Though a detailed API and some brief official guides are delivered with Spark, I still miss the easy-to-use, consistency, and completeness of Pandas’s document. Hence the post is created to collect and share the study material about Spark. Please feel free to contact me if you have any recommendation.
0. For users
- 10 Minutes to Spark DataFrame: Written by myself. It contains some code snippets of basic operator when using spark.
0.2 Machine Learning
- ml-guide: Official.