Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
14 views3 pages

Data Science - Modules

The document outlines a comprehensive data science curriculum consisting of ten modules, covering topics from Python basics to machine learning and data visualization. Each module includes mini projects that reinforce learning through practical applications, such as analyzing datasets and building predictive models. The course culminates in a capstone project focused on customer churn prediction for a telecom company, along with guidance on career development and portfolio building.

Uploaded by

umajha1900
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views3 pages

Data Science - Modules

The document outlines a comprehensive data science curriculum consisting of ten modules, covering topics from Python basics to machine learning and data visualization. Each module includes mini projects that reinforce learning through practical applications, such as analyzing datasets and building predictive models. The course culminates in a capstone project focused on customer churn prediction for a telecom company, along with guidance on career development and portfolio building.

Uploaded by

umajha1900
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

✅ Module 1: Introduction to Data Science & Python Basics

• What is Data Science?


• Data Science Workflow (CRISP-DM)
• Python Basics for Data Science (variables, functions, data types, loops)
• Using Jupyter Notebook

Mini Project 1: "Explore Titanic Dataset using Python"

• Load CSV, basic stats, plot simple graphs

✅ Module 2: Data Handling with Pandas & NumPy

• Pandas Series & Data Frames


• Reading, cleaning, and manipulating data
• Handling missing values, duplicates
• Basic NumPy operations

Mini Project 2: "Sales Data Analysis"

• Read Excel/CSV, clean and summarize sales data

✅ Module 3: Data Visualization Introduction to Matplotlib and Seaborn

• Line, bar, histogram, scatter, heatmap


• Customizing plots, sub plotting

Mini Project 3: "Visualizing Covid-19 Trends"

• Create multi-type plots using real Covid datasets

✅ Module 4: Exploratory Data Analysis (EDA)

• Descriptive stats, correlations, distributions


• Outlier detection, box plots
• Grouping and aggregations

Mini Project 4: "EDA on Student Performance Dataset"

• Analyse patterns based on gender, study time, etc.


✅ Module 5: Intro to Machine Learning + Scikit-learn

• ML vs Traditional Programming
• Supervised vs Unsupervised
• Scikit-learn Basics
• ML Pipeline Steps

Mini Project 5: "Predict House Prices using Linear Regression"

• Build and evaluate a linear regression model

✅ Module 6: Classification

• Logistic Regression
• Decision Trees
• Model Evaluation (Accuracy, Confusion Matrix, Precision/Recall)

Mini Project 6: "Email Spam Classifier"

• Build a spam detector from labelled email dataset

✅ Module 7: Clustering & Unsupervised Learning

• K-Means Clustering
• Elbow Method
• Visualizing clusters

Mini Project 7: "Customer Segmentation using Mall Dataset"

• Segment customers using K-Means

✅ Module 8: Microsoft Fabric for Data Science

• What is Microsoft Fabric?


• Introduction to OneLake and Data Engineering experiences
• Using Notebooks in Fabric (Spark engine)
• Connecting datasets to Power BI
• Hands-on: Load data → Clean → Analyze → Visualize

Project 8: Power BI Dashboard using Microsoft Fabric Lakehouse data


(e.g., create a dashboard for sales performance)
✅ Module 9: Real-World Case Study – Business Analytics

• Case: Telecom Company – Customer Churn Prediction


o Problem understanding
o Data preparation and EDA
o Build classification model
o Present business recommendations

Capstone Project: End-to-End Churn Prediction Pipeline & Insights Dashboard

• Includes Power BI or Streamlit report

✅ Module 10: Careers, GitHub, and Portfolio Building

• Git, GitHub for version control


• Documenting and presenting projects
• Building a Data Science Portfolio
• Resume tips + Certifications (e.g., Microsoft, Google, IBM)
• Introduction to Kaggle

Optional Project: Upload and present all projects via GitHub Portfolio

✅ Final List of 8 Projects:

1. Titanic Survival Analysis


2. Retail Sales Analysis
3. COVID-19 Data Visualization
4. Student Performance EDA
5. House Price Prediction
6. Spam Classifier
7. Customer Segmentation
8. Power BI Dashboard via Microsoft Fabric

+ Capstone Case Study: Telecom Customer Churn

You might also like