A Custom Apache Distribution including Spark and Hadoop, for Windows.
This Distribution has been customized to work out of the box.
So, just download it, and unzip it.
Set the Path variables for bin folders, HADOOP_HOME, SPARK_HOME, and JAVA_HOME.
That's it..! use Hadoop and Spark natively on Windows.
osDQ dedicated to create apache spark based data pipeline using JSON
.../example/samplerun.json
For those on windows, you need to have hadoop distribtion unzipped on local drive and HADOOP_HOME set. Also copy winutils.exe from here into HADOOP_HOME\bin