8-Month Data Science Roadmap
Month 1: Python & Math for Data Science
Goal: Master Python programming and foundational math/statistics.
- Day 1-2: Install Python, Jupyter, Git
- Day 3-6: Syntax, variables, loops, conditionals
- Day 7-9: Functions, file handling, scopes
- Day 10-13: Lists, Tuples, Dicts, Sets
- Day 14-16: Classes, Inheritance, OOP
- Day 17-20: NumPy: arrays, indexing, broadcasting
- Day 21-24: Pandas: DataFrames, cleaning, filtering
- Day 25-26: Matplotlib, Seaborn basics
- Day 27-28: Git + GitHub (push, pull, clone, commit)
- Day 29: Linear Algebra: Vectors, Matrices
- Day 30: Stats: Mean, Median, Variance, Std Dev
- BONUS: Lambda, Map, Filter, Reduce, Generators
Month 2: Data Wrangling + EDA
Goal: Clean, analyze, visualize real-world datasets.
- Day 1-2: Handle nulls, duplicates
- Day 3-5: String ops, datetime, feature extraction
- Day 6-8: Outlier detection, scaling
- Day 9-11: Feature engineering
- Day 12-14: Seaborn: pairplots, heatmaps
- Day 15-17: Matplotlib: subplots, bar/pie
- Day 18-21: Titanic EDA
- Day 22-25: COVID-19/Netflix EDA
- Day 26-28: GroupBy, Pivot, Aggregations
- Day 29: EDA Report
- Day 30: Push project to GitHub
Month 3: SQL + Excel + DB Design
Goal: Analyze data using SQL + dashboards with Excel.
- Day 1-3: SELECT, WHERE, ORDER BY
- Day 4-6: GROUP BY, COUNT, SUM, HAVING
- Day 7-9: INNER/LEFT JOIN
- Day 10-12: Subqueries, CTEs, CASE WHEN
- Day 13-15: Window functions (RANK, ROW_NUMBER)
- Day 16-17: Excel charts, IFs, formatting
- Day 18-20: Pivot tables, VLOOKUP
- Day 21-23: SQL Practice on LeetCode/Mode
- Day 24-26: Mini project (sales/customer analysis)
- Day 27: Python + MySQL
- Day 28: Excel Dashboard
- Day 29-30: GitHub upload + documentation
- BONUS: Triggers, Stored Procedures, Transactions, Indexing
Month 4: Core Machine Learning
Goal: Train & evaluate ML models using scikit-learn.
- Day 1-3: ML Process, Data Splitting
- Day 4-6: Linear Regression
- Day 7-8: Ridge/Lasso
- Day 9-10: Logistic Regression
- Day 11-12: KNN Classifier
- Day 13-14: Decision Trees & Random Forest
- Day 15-17: Evaluation metrics: Accuracy, F1, ROC
- Day 18-20: Hyperparameter tuning (GridSearchCV)
- Day 21-24: House Price Prediction Project
- Day 25-28: Titanic/Heart Disease Classifier
- Day 29: Push to GitHub
- Day 30: Write blog/article (optional)
Month 5: Deep Learning & NLP
Goal: Dive into neural networks, sentiment analysis, and NLP.
- Day 1-3: Neural Net Concepts (activation, loss)
- Day 4-6: ANN with Keras
- Day 7-8: CNN for image classification
- Day 9-11: Text preprocessing (cleaning, lemmatizing)
- Day 12-14: TF-IDF, CountVectorizer
- Day 15-17: Sentiment model
- Day 18-20: Intro to LSTM, RNN
- Day 21-23: Transformers (BERT/GPT basics)
- Day 24-25: Hugging Face classification
- Day 26-28: Twitter Sentiment Project
- Day 29: GitHub upload
- Day 30: Write Notion summary
- BONUS: Specialization: NER, Language Modeling, SpaCy, HuggingFace
Month 6: Data Engineering & APIs
Goal: Learn scraping, APIs, pipelines, PySpark, Airflow.
- Day 1-3: BeautifulSoup Scraping
- Day 4-6: Build CSV scraper
- Day 7-9: OpenWeatherMap/Twitter API
- Day 10-12: Save API to SQL/CSV
- Day 13-15: Apache Spark basics
- Day 16-18: PySpark transformations
- Day 19-21: ETL pipeline in Python
- Day 22-23: Airflow DAGs & scheduling
- Day 24-26: Web scraping project
- Day 27-29: Spark project to GitHub
- Day 30: Document or blog it
- BONUS: Big Data: Hadoop, Kafka, MongoDB, Cassandra
Month 7: Deployment + End-to-End Projects
Goal: Convert models into apps & APIs, deploy online.
- Day 1-3: Flask basics
- Day 4-6: FastAPI for ML model
- Day 7-10: Project 1 - API with ML model
- Day 11-13: Streamlit UI basics
- Day 14-16: Project 2 - ML Dashboard
- Day 17-19: Dockerize your project
- Day 20-21: Deploy to HuggingFace or Render
- Day 22-25: Recommender System
- Day 26: Portfolio website (GitHub Pages or Notion)
- Day 27-30: Polish code, write readmes, record demo (optional)
Month 8: Job Readiness + Interviews
Goal: Final prep to land a job: resume, interviews, portfolio.
- Day 1-2: Resume Writing (project-based)
- Day 3-5: GitHub cleanup: READMEs, repos
- Day 6-9: Behavioral interview prep (STAR)
- Day 10-12: Python/ML questions
- Day 13-15: SQL interview prep
- Day 16-20: Mock interviews
- Day 21-24: Write blogs (Medium/Hashnode)
- Day 25-27: Apply on LinkedIn, Internshala
- Day 28-30: Revise weak areas + reflect