✅ Module 1: Introduction to Data Science & Python Basics
• What is Data Science?
• Data Science Workflow (CRISP-DM)
• Python Basics for Data Science (variables, functions, data types, loops)
• Using Jupyter Notebook
Mini Project 1: "Explore Titanic Dataset using Python"
• Load CSV, basic stats, plot simple graphs
✅ Module 2: Data Handling with Pandas & NumPy
• Pandas Series & Data Frames
• Reading, cleaning, and manipulating data
• Handling missing values, duplicates
• Basic NumPy operations
Mini Project 2: "Sales Data Analysis"
• Read Excel/CSV, clean and summarize sales data
✅ Module 3: Data Visualization Introduction to Matplotlib and Seaborn
• Line, bar, histogram, scatter, heatmap
• Customizing plots, sub plotting
Mini Project 3: "Visualizing Covid-19 Trends"
• Create multi-type plots using real Covid datasets
✅ Module 4: Exploratory Data Analysis (EDA)
• Descriptive stats, correlations, distributions
• Outlier detection, box plots
• Grouping and aggregations
Mini Project 4: "EDA on Student Performance Dataset"
• Analyse patterns based on gender, study time, etc.
✅ Module 5: Intro to Machine Learning + Scikit-learn
• ML vs Traditional Programming
• Supervised vs Unsupervised
• Scikit-learn Basics
• ML Pipeline Steps
Mini Project 5: "Predict House Prices using Linear Regression"
• Build and evaluate a linear regression model
✅ Module 6: Classification
• Logistic Regression
• Decision Trees
• Model Evaluation (Accuracy, Confusion Matrix, Precision/Recall)
Mini Project 6: "Email Spam Classifier"
• Build a spam detector from labelled email dataset
✅ Module 7: Clustering & Unsupervised Learning
• K-Means Clustering
• Elbow Method
• Visualizing clusters
Mini Project 7: "Customer Segmentation using Mall Dataset"
• Segment customers using K-Means
✅ Module 8: Microsoft Fabric for Data Science
• What is Microsoft Fabric?
• Introduction to OneLake and Data Engineering experiences
• Using Notebooks in Fabric (Spark engine)
• Connecting datasets to Power BI
• Hands-on: Load data → Clean → Analyze → Visualize
Project 8: Power BI Dashboard using Microsoft Fabric Lakehouse data
(e.g., create a dashboard for sales performance)
✅ Module 9: Real-World Case Study – Business Analytics
• Case: Telecom Company – Customer Churn Prediction
o Problem understanding
o Data preparation and EDA
o Build classification model
o Present business recommendations
Capstone Project: End-to-End Churn Prediction Pipeline & Insights Dashboard
• Includes Power BI or Streamlit report
✅ Module 10: Careers, GitHub, and Portfolio Building
• Git, GitHub for version control
• Documenting and presenting projects
• Building a Data Science Portfolio
• Resume tips + Certifications (e.g., Microsoft, Google, IBM)
• Introduction to Kaggle
Optional Project: Upload and present all projects via GitHub Portfolio
✅ Final List of 8 Projects:
1. Titanic Survival Analysis
2. Retail Sales Analysis
3. COVID-19 Data Visualization
4. Student Performance EDA
5. House Price Prediction
6. Spam Classifier
7. Customer Segmentation
8. Power BI Dashboard via Microsoft Fabric
+ Capstone Case Study: Telecom Customer Churn