This project demonstrates a focused data analysis and visualization workflow:
- Acquisition — Ingesting ~3 million NYC TLC taxi trip records in
.parquetformat - Exploration — Performing EDA using
SQLandPythonin a Jupyter Notebook - Presentation — Building an interactive dashboard with Streamlit to surface insights
This project shows my ability to:
- Dive into large raw datasets and extract value
- Use familiar tools for both analysis and application
- Deliver visualization and bridging the gap between data and decision-making
- Communicate clearly: raw data → insights → actionable output