Install on Windows 10

Install Java

Download the JDK 8 from here, install it and check if Java is in the path typing:

> java -version
java version "1.8.0_301"
Java(TM) SE Runtime Environment (build 1.8.0_301-b09)
Java HotSpot(TM) 64-Bit Server VM (build 25.301-b09, mixed mode)

If is not, set the environment variable %JAVA_HOME% to the path to the JDK installation, and add %JAVA_HOME%\bin to the path.

Install Hadoop winutils

From the repo gh:cdarlint/winutils install hadoop version for your apache spark installation.

If you follow using the next part of this document, you might need to set a 2.7.x version of this component.

Add the directory where the component is copied to the variable %HADOOP_HOME%, and %HADOOP_HOME%\bin to the path.

Install Apache Spark

Using scoop install spark using the command:

> scoop install spark

If everything is set you, can open any of the spark shells, such as:

> spark-shell.cmd


> pyspark