Repository containing portfolio of data science projects completed by me for academic, self learning, and hobby purposes. Mainly, I use Python as the language and present it in the form of Python and Jupyter notebooks.
For a more visually pleasant experience for browsing the portfolio, check out https://danafr00.github.io/
- Data Scraping (Python)
- ETL Airflow Pipeline (Python, Airflow, SQL)
- Airflow and Pyspark ETL(Python, Airflow, SQL, Pyspark, Looker)
- Stream and Batch Processing (Python, Airflow, SQL, Kafka, Bigquery, Looker)