0%

Spark安装

Apache Spark™ is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

https://spark.apache.org/

How to install Spark on Windows

https://spark.apache.org/downloads.html

Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for stream processing.

https://phoenixnap.com/kb/install-spark-on-windows-10

Installing with PyPi

PySpark is now available in pypi. To install just run pip install pyspark.