pySpark is python API for spark and it is very powerful for big data processing.
In this project I have analyzed playstore data using pySpark.
We have implemented pySpark command and performed EDA using SQL and Spark
Link for dataset
https://www.kaggle.com/datasets/gauthamp10/google-playstore-apps