Study material for Apache Spark
update: 2017-05-25
Though a detailed API and some brief official guides are delivered with Spark, I still miss the easy-to-use, consistency, and completeness of Pandas’s document. Hence the post is created to collect and share the study material about Spark. Please feel free to contact me if you have any recommendation.
0. For users
0.1 DataFrame
- 10 Minutes to Spark DataFrame: Written by myself. It contains some code snippets of basic operator when using spark.
0.2 Machine Learning
- ml-guide: Official.
1. For developers
-
Mastering Apache Spark 2: Third-party. A lot of implementation details are converd in the online book.
-
Contributing to Spark: Official.
-
Useful Developer Tools: Official. Useful when debugging and testing.
-
Building Spark: Official.