PySpark
PySpark is the Python API for Spark.
https://spark.apache.org
https://spark.apache.org/docs/latest/api/python/index.html
Tutorials
PySpark Cheat Sheet: Spark DataFrames in Python (article) - DataCamp
https://www.datacamp.com/community/blog/pyspark-sql-cheat-sheet
Paid Tutorials
Introduction to PySpark
Introduction to PySpark | DataCamp
https://www.datacamp.com/courses/introduction-to-pyspark
Sea also
machine learning