Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
44 views40 pages

Data Science

The Executive PG Programme in Data Science by Parasmani is an outcome-focused online learning platform designed for tech professionals, offering a curriculum that includes programming tools, specializations, and industry projects. Participants receive guidance from industry experts and have access to a dedicated support team, ensuring a comprehensive learning experience. The program is validated by NASSCOM and aims to provide a transformative education that enhances career opportunities in data science.

Uploaded by

anil mishra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
44 views40 pages

Data Science

The Executive PG Programme in Data Science by Parasmani is an outcome-focused online learning platform designed for tech professionals, offering a curriculum that includes programming tools, specializations, and industry projects. Participants receive guidance from industry experts and have access to a dedicated support team, ensuring a comprehensive learning experience. The program is validated by NASSCOM and aims to provide a transformative education that enhances career opportunities in data science.

Uploaded by

anil mishra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 40

Executive PG Programme in

________________

DATA SCIENCE
________________
"We are surrounded by data but starved for insight"
Table________________
of Content

Data Mining
About Parasmani
Why Parasmani
Program Highlights
Parasmani Learning Experience
Industry Projects
Learning Path
Executive PG Programme Curriculum
About
Parasmani
Parasmani, is an Learners enrolled with us
outcome-focused leading are taught, guided, and
ed-tech platform for tech mentored by top
professionals. Our professionals and experts
industry-vetted approach working at leading
towards teaching and organizations including
training young minds Google, Facebook, Intuit,
helps them upskill and Microsoft, Amazon,
bag the career of their Hotstar, etc.
dreams.
We are a transformative
Our learners have
learning platform witnessed a 5x ROI
devoted towards creating (Return on Investment)
a growth ecosystem to from our program. Our
assist software offerings include -
professionals in Parasmani Academy and
unlocking their talent and Parasmani Data Science.
seizing opportunities at
every stage of their
careers.
Why
Parasmani
The new age education system welcomes you. We deliver better results than
normal classes.

Add values to online education, with capabilities of doubt solving. Our


intelligent and technologically equipped EdTech online doubt solving
platform.

Parasmani is a revolutionary online doubt solving platform that


provides on-demand tutoring across all domains of education
Our platform enables thousands of tutors to share
their knowledge with students around the globe.
Program
Highlight
_______________________
_______________________ Executive PG Programme from
Parasmani Education and Alumni
Status
Equivalent to NSQF (National Skill
Qualification Framework) level 8 Get certified by Parasmani
Education and gain alumni
Do an Executive PG Programme that
status on successful completion of
satisfies NSQF level 8 the program.
criteria.

_____________________
Tools & Languages
Learn 14 + Programming Tools &
Languages such as Python, _____________________
Tableau, MySQL, Keras,
NASSCOM Future Skill Certification
Tensorflow and more.
India’s first Executive PG Programme,
validated by and recommended by
NASSCOM. Avail of a participation
certificate from NASSCOM on
successful program completion.

_____________________
5 Specializations
Choose from 5 specializations
__________________
such as Natural Language Blended Learning
Learn with the ease and flexibility
Processing, Deep Learning,
of recorded sessions as well as
Business Intelligence/ Data
live sessions, designed to ensure
Analytics, Business Analytics,
a wholesome learning experience.
Data Engineering, based on
your background and career
aspirations and get the learning
you want.
Parasmani Learning
Experience
___________
Student Support Team
_____

• We have a dedicated/ Student


Support Team for handling your
queries via email or callback
requests
• This support is available 7 days a
week, 24x7

___________
PARASMANI BaseCamp (PRE-COVID)

___________
• Fun-packed, informative and career
building workshop sessions by
Expert Feedback industry professionals and
• Personalized expert feedback on professors
assignments and projects
• Group activities with your peers and
• Regular live sessions by experts to
alumni
clarify concept-related doubts

___________
Industry Mentors
• Receive unparalleled guidance
from industry mentors, teaching
assistants and graders
• Receive one-on- one feedback
on submissions and personalized
feedback on improvement

___________
Industry Networking
• Live sessions by experts on various
industry topics
• One-on-one discussion and
feedback
sessions with industry mentors
New
Addition
___________
Career Essential Soft-skills Program
Career Essential Soft-skills Program
1. Excel your personal & professional life with PARASMANI’s Soft
Skills Program
2. Study Three fundamental Skills - Interview & Job Search,
Corporate & Business Communication and Problem Solving
3. Get access to 40+ learner hours of soft skills content delivered
by the best faculty & Industry experts

___________
30-Hour Programming Bootcamp for Non-tech Learners
1. Non-tech background? No need to fear Programming
anymore
2. A 30-hour Python Programming bootcamp, focusing
on developing Basic + Intermediate Python Programming
Concepts to assist non-tech learners.
3. A blended learning experience delivered via
Interactive live sessions and assessments
Industry
Projects

IMDb Movie Analysis Uber Supply-Demand Lead Scoring Fraud Detection


Gap

Creditworthiness of Speech Recognition Image Captioning Gesture Recognition


Customers

Social Media Listening Telecom Churn Interactive Market Retail Giant Sales
Campaign Analysis Forecasting

And many more!


Learning
Path

Preparatory Course Data Toolkit Machine Learning


0 week 12 weeks 10 weeks
Tools: Python, Excel Tools: Python, Excel, Tools: Python, Excel
mySQL

Choose any of the


5 Specialisations
22 weeks (with 4 weeks of Capstone)

Natural Language Deep Learning Business Analytics Business Intelligence/ Data Engineering
Processing Tools: Python, Tools: Python,
Data Analytics Tools: Hadoop,
Tools: Python, Excel Excel, TensorFlow mySQL, Excel Tools: Python, Power HBase, Sqoop, Hive, Flume,
BI, Excel, mySQL, MongoDB, PySpark, Spark, Airflow
Shiny, Tableau

Executive PG Executive PG Executive PG Executive PG Executive PG


Programme in Programme in Programme in Programme Programme in
Data Science Data Science Data Science in Data Science Data Science
(Natural Language (Deep Learning) (Business (Business Intelligence/ (Data Engineering)
Processing) Analytics) Data Analytics)
Executive PG Programme
in Data Science
COMMON CURRICULUM

PRE-PROGRAM PREPARATORY CONTENT


1. DATA ANALYSIS IN EXCEL
1. INTRODUCTION TO EXCEL Taught by one of the most renowned data scientists in the
country (S.Anand, CEO, Gramener), this module takes you
2. DATA ANALYSIS IN EXCEL - I: from a beginner-level Excel user to an almost professional
FUNCTIONS, FORMULAE, AND user.

CHARTS
3. DATA ANALYSIS IN EXCEL - II:
PIVOTS AND LOOKUPS

2. ANALYTICS PROBLEM SOLVING


1. THE CRISP-DM FRAMEWORK - This module covers concepts of the CRISPDM
framework for business problem-solving.
BUSINESS AND DATA
UNDERSTANDING
2. CRISP-DM FRAMEWORK
- DATA PREPARATION,
MODELLING, EVALUATION
AND DEPLOYMENT

COURSE 1: DATA TOOLKIT


2 WEEKS
2. ANALYTICS PROBLEM SOLVING
1. UNDERSTANDING THE Build a foundation for the most in-demand
programming language of the 21st century.
PARASMANI CODING CONSOLE
2. BASICS OF PYTHON
3. DATA STRUCTURES IN
PYTHON
4. CONTROL STRUCTURE AND
FUNCTIONS IN PYTHON
5. OOP IN PYTHON

*The Curriculum is subject to change as per the inputs from university or industry experts
2. PROGRAMMING IN PYTHON
1. LOGIC AND SYNTAX Learn how to approach and solve logical 1 WEEK
problems using programming.
BUILDING
2. DATA STRUCTURES: LISTS,
STRINGS, DICTIONARIES, AND
STACKS
3. TIME COMPLEXITY
4. SEARCHING AND SORTING
5. TWO POINTERS
6. RECURSION

3. PYTHON FOR DATA SCIENCE


1. INTRODUCTION TO NUMPY Learn how to manipulate datasets in Python 1 WEEK
using Pandas which is the most powerful
2. INTRODUCTION TO library for data preparation and analysis.
MATPLOTLIB
3. INTRODUCTION TO PANDAS
4. GETTING AND CLEANING
DATA

4. DATA VISUALISATION IN PYTHON


1. INTRODUCTION TO DATA Humans are visual learners, and hence no 1 WEEK
task related to data is complete without
VISUALISATION visualization. Learn to plot and interpret
2. DATA VISUALISATION USING various graphs in Python and observe how
they make data analysis and drawing insights
SEABORN easier.

*The Curriculum is subject to change as per the inputs from university or industry experts
5. EXPLORATORY DATA ANALYSIS 1 WEEK

1. DATA SOURCING Learn how to find and analyze the


2. DATA CLEANING patterns in the data to draw actionable
insights.
3. UNIVARIATE ANALYSIS
4. BIVARIATE ANALYSIS AND
MULTIVARIATE ANALYSIS

6. CREDIT EDA CASE STUDY 1 WEEK

1. PROBLEM STATEMENT Solve a real industry problem through the


concepts learnt in exploratory data analysis.
2. EVALUATION RUBRIC
3. FINAL SUBMISSION
4. SOLUTION

7. INFERENTIAL STATISTICS 1 WEEK

1. BASICS OF PROBABILITY Build a strong statistical foundation and learn


how to ‘infer’ insights from a huge population
2. DISCRETE PROBABILITY using a small sample.
DISTRIBUTIONS
3. CONTINUOUS PROBABILITY
DISTRIBUTIONS
4. CENTRAL LIMIT THEOREM

8. HYPOTHESIS TESTING 1 WEEK

1. CONCEPTS OF HYPOTHESIS Understand how to formulate and validate


hypotheses for a population to solve real-life
TESTING - I: NULL AND business problems.
ALTERNATE HYPOTHESIS,
MAKING A DECISION, AND
CRITICAL VALUE METHOD
2. CONCEPTS OF HYPOTHESIS
TESTING - II: P-VALUE METHOD
AND TYPES OF ERRORS
3. INDUSTRY DEMONSTRATION
OF HYPOTHESIS TESTING:
TWO-SAMPLE MEAN AND
PROPORTION TEST, A/B
TESTING

*The Curriculum is subject to change as per the inputs from university or industry experts
9. DATA ANALYSIS USING SQL 1 WEEK

1. DATABASE DESIGN Data in companies is definitely not


2. DATABASE CREATION IN stored in excel sheets! Learn the
fundamentals of databases and extract
MYSQL WORKBENCH information from RDBMS using the
3. QUERYING IN MYSQL structured query language.

4. JOINS AND SET OPERATIONS

10. ADVANCED SQL & BEST PRACTICES 1 WEEK

1. WINDOW FUNCTIONS Apply advanced SQL concepts like


windowing and procedures to derive
2. CASE STATEMENTS, STORED
insights from data and answer
ROUTINES AND CURSORS pertinent business questions.
3. QUERY OPTIMISATION AND
BEST PRACTICES
4. PROBLEM-SOLVING USING
SQL

11. SQL ASSIGNMENT: RSVP MOVIES 1 WEEK

1. PROBLEM STATEMENT In this assignment, you will work on a


movies dataset using SQL to extract
2. EVALUATION RUBRIC exciting insights.
3. FINAL SUBMISSION
4. SOLUTION

COURSE 2: MACHINE LEARNING - I


1. LINEAR REGRESSION
2 WEEKS

1. SIMPLE LINEAR REGRESSION Venture into the machine learning


community by learning how one
2. SIMPLE LINEAR REGRESSION variable can be predicted using several
IN PYTHON other variables through a housing
dataset where you will predict the
3. MULTIPLE LINEAR prices of houses based on various
REGRESSION factors.

4. MULTIPLE LINEAR
REGRESSION IN PYTHON
4. INDUSTRY RELEVANCE OF
LINEAR REGRESSION

*The Curriculum is subject to change as per the inputs from university or industry experts
2. LINEAR REGRESSION ASSIGNMENT 1 WEEK

1. PROBLEM STATEMENT Build a model to understand the factors


on which the demand for bike-sharing
2. EVALUATION RUBRIC systems vary on and help a company
3. FINAL SUBMISSION optimize its revenue.

4. SOLUTION

3. LOGISTIC REGRESSION 1 WEEK

1. UNIVARIATE LOGISTIC Learn your first binary classification technique


by determining which telecom operator
REGRESSION customers are likely to churn versus
2. MULTIVARIATE LOGISTIC those who are not to help the business
retain customers.
REGRESSION: MODEL
BUILDING AND EVALUATION
4. SOLUTION

1 WEEK
4. CLASSIFICATION USING DECISION TREES

1. INTRODUCTION TO DECISION Learn how the human decision-making process


can be replicated using a decision tree
TREES and tune it to suit your needs.
2. ALGORITHMS FOR DECISION
TREES CONSTRUCTION
3. HYPERPARAMETER TUNING
IN DECISION TREES

1 WEEK
5. UNSUPERVISED LEARNING: CLUSTERING

1. INTRODUCTION TO Learn how to group elements into different


clusters when you don’t have any pre-defined
CLUSTERING labels to segregate them through
2. K-MEANS CLUSTERING K-means clustering, hierarchical clustering,
and more.
3. HIERARCHICAL CLUSTERING
4. OTHER FORMS OF
CLUSTERING: K-MODE,
K-PROTOTYPE, DB SCAN

*The Curriculum is subject to change as per the inputs from university or industry experts
6. BASICS OF NLP AND TEXT MINING 1 WEEK

1. REGEX AND INTRODUCTION Do you get annoyed by the constant


spam in your mailbox? Wouldn’t it be
TO NLP nice if we had a program to check your
2. BASIC LEXICAL PROCESSING spelling? In this module learn how to
build a spell checker & spam detector
3. ADVANCED LEXICAL using techniques like phonetic hashing,
PROCESSING bag-of words, TF-IDF, etc.

7. BUSINESS PROBLEM SOLVING 1 WEEK

1. INTRODUCTION TO BUSINESS Learn how to approach open-ended real-world


problems using data as a lever to
PROBLEM SOLVING draw actionable insights.
2. BUSINESS PROBLEM
SOLVING: CASE STUDY
DEMONSTRATIONS

1 WEEK
8. CASE STUDY: LEAD SCORING

1. PROBLEM STATEMENT Help the Sales team of your company identify


which leads are worth pursuing through
2. EVALUATION RUBRIC this classification case study
3. FINAL SUBMISSION
4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
SPECIALISATION: DEEP LEARNING

COURSE 3 - MACHINE LEARNING - II


1. BAGGING & RANDOM FOREST 1 WEEK

1. POPULAR ENSEMBLES Learn how powerful ensemble


algorithms can improve your
2. INTRODUCTION TO RANDOM classification models by building
FORESTS random forests from decision trees.

3. FEATURE IMPORTANCE IN
RANDOM FORESTS
4. RANDOM FORESTS IN
PYTHON

1 WEEK

2. BOOSTING

1. INTRODUCTION TO Learn about ensemble modelling through


bagging and boosting and, understand
BOOSTING AND ADABOOST
how weak algorithms can be transformed
2. GRADIENT BOOSTING into stronger ones.

3. MODEL SELECTION & GENERAL ML TECHNIQUES

1. PRINCIPLES OF MODEL Learn the pros and cons of simple and 1 WEEK
complex models and the different methods
SELECTION for quantifying model complexity, along with
general machine learning techniques like
2. MODEL EVALUATION feature engineering, model evaluation, and
3. MODEL SELECTION: BEST many more.

PRACTICES

4. PRINCIPAL COMPONENT ANALYSIS 1 WEEK

1. PRINCIPAL COMPONENT Understand important concepts related to


dimensionality reduction, the basic idea and
ANALYSIS AND SINGULAR the learning algorithm of PCA, and its practical
VALUE DECOMPOSITION applications on supervised and unsupervised
problems.
2. PRINCIPAL COMPONENT
ANALYSIS IN PYTHON

*The Curriculum is subject to change as per the inputs from university or industry experts
5. ADVANCED REGRESSION
1. GENERALISED LINEAR In this module, take a more advanced 1 WEEK
lookout regression models and learn
REGRESSION the concepts related to regularization
3. REGULARISED REGRESSION

2. ADVANCED ML CASE STUDY


1. PROBLEM STATEMENT Build a regularized regression model to 1 WEEK
understand the most important variables
2. EVALUATION RUBRIC to predict house prices in Australia
3. FINAL SUBMISSION
4. SOLUTION

COURSE 4 - ADVANCED MACHINE LEARNING AND


DEEP LEARNING
1. TIME SERIES ANALYSIS
2 WEEKS
1. INTRODUCTION TO In this module, you will learn how to analyze
and forecast a series that varies with time.
TIME SERIES AND ITS
COMPONENTS
2. WORKING WITH STATIONARY
TIME SERIES
3. END-TO-END ANALYSIS OF
TIME SERIES

2. INTRODUCTION TO NEURAL NETWORKS AND ANN

1. STRUCTURE OF NEURAL Learn the most sophisticated and cutting- 1 WEEK


edge technique in machine learning -
NETWORKS Artificial Neural Networks or ANNs
2. FEED FORWARD IN NEURAL
NETWORKS
3. BACKPROPAGATION IN
NEURAL NETWORKS
4. MODIFICATIONS TO NEURAL
NETWORKS
5. HYPERPARAMETER TUNING
IN NEURAL NETWORKS

*The Curriculum is subject to change as per the inputs from university or industry experts
3. NEURAL NETWORK ASSIGNMENT
1 WEEK
1. PROBLEM STATEMENT Build a neural network from scratch in
2. EVALUATION RUBRIC Tensor flow to identify the type of skin
cancer from the image.
3. FINAL SUBMISSION
4. SOLUTION

COURSE 5 - ADVANCED DEEP LEARNING AND


COMPUTER VISION
1. CONVOLUTIONAL NEURAL NETWORKS 2 WEEKS

Learn the basics of CNN and OpenCV and


1. INTRODUCTION TO how to classify image data using various
CONVOLUTIONAL NEURAL architectures which you will then implement
NETWORKS using Python and Keras.

2. BUILDING CNNS WITH


PYTHON AND KERAS
3. CNN ARCHITECTURES AND
TRANSFER LEARNING
3. STYLE TRANSFER AND
OBJECT DETECTION

2. INTRODUCTION TO NEURAL NETWORKS AND ANN


1. INDUSTRY DEMONSTRATION: Apply CNNs to Computer Vision tasks like 1 WEEK
detecting anomalies in chest X-Ray scans.
USING CNNS WITH FLOWERS
IMAGES
3. INDUSTRY DEMONSTRATION:
USING CNNS WITH X-RAY
IMAGES

*The Curriculum is subject to change as per the inputs from university or industry experts
3. OBJECT DETECTION AND IMAGE SEGMENTATION
(OPTIONAL) 0 WEEK

1. FUNDAMENTALS OF OBJECT Learn the applications of DL in


DETECTION computer vision through industry-
relevant detection algorithms such as
2. REGION-BASED DETECTORS RCNNs, YOLO and SSD.
3. ONE-SHOT DETECTORS
4. CUSTOM OBJECT DETECTION
5. SEMANTIC SEGMENTATION

4. RECURRENT NEURAL NETWORKS 1 WEEK

1. WHAT MAKES A NEURAL Learn the basics of CNN and OpenCV and
how to classify image data using various
NETWORK RECURRENT architectures which you will then implement
2. VARIANTS OF RNNS: using Python and Keras.

BIDIRECTIONAL RNNS AND


LSTMS
3. BUILDING RNNS IN PYTHON

5. GESTURE RECOGNITION 2 WEEKS

1. TWO ARCHITECTURES: 3D Make a Smart TV system which can control


the TV with the user’s hand gestures as the
CONVS AND CNN-RNN STACK remote control
2. UNDERSTANDING
GENERATORS
3. STARTER CODE
WALKTHROUGH
4. PROBLEM STATEMENT AND
FINAL SUBMISSION

*The Curriculum is subject to change as per the inputs from university or industry experts
COURSE 6 - CAPSTONE PROJECT

CAPSTONE PROJECT 0 WEEK


1. AN OVERVIEW OF THE Learn the applications of DL in
computer vision through industry-
DOMAIN AND ASSOCIATED relevant detection algorithms such as
CONCEPTS RCNNs, YOLO and SSD.
2. PROBLEM STATEMENT
3. EVALUATION RUBRIC
4. MID SUBMISSION
5. FINAL SUBMISSION
6. SOLUTION

SPECIALISATION: NATURAL
LANGUAGE PROCESSING

COURSE 3 - MACHINE LEARNING II


1. BAGGING & RANDOM FOREST 1 WEEK

1. POPULAR ENSEMBLES Learn the basics of CNN and OpenCV and


how to classify image data using various
2. INTRODUCTION TO RANDOM architectures which you will then implement
FORESTS using Python and Keras.

3. FEATURE IMPORTANCE IN
RANDOM FORESTS
4. FEATURE IMPORTANCE IN
RANDOM FORESTS

2. BOOSTING 2 WEEKS

1. INTRODUCTION TO Learn about ensemble modelling through


bagging and boosting, and understand how
BOOSTING AND ADABOOST
weak algorithms can be transformed into
2. GRADIENT BOOSTING stronger ones.

*The Curriculum is subject to change as per the inputs from university or industry experts
3. MODEL SELECTION & GENERAL ML TECHNIQUES
1. PRINCIPLES OF MODEL Learn the pros and cons of simple and
1 WEEK
complex models and the different
SELECTION methods for quantifying model
2. MODEL EVALUATION complexity, along with general machine
learning techniques like feature
3. MODEL SELECTION: BEST engineering, model evaluation, and
PRACTICES many more.

4. PRINCIPAL COMPONENT ANALYSIS 1 WEEK

1. PRINCIPAL COMPONENT
Understand important concepts related to
ANALYSIS AND SINGULAR dimensionality reduction, the basic idea
VALUE DECOMPOSITION and the learning algorithm of PCA, and its
practical applications on supervised and
2. PRINCIPAL COMPONENT unsupervised problems.
ANALYSIS IN PYTHON

1 WEEK
5. ADVANCED REGRESSION
1. GENERALISED LINEAR Learn how the human decision-making process
can be replicated using a decision tree
REGRESSION and tune it to suit your needs.
2. REGULARISED REGRESSION

6. ADVANCED ML CASE STUDY 1 WEEK

1. PROBLEM STATEMENT Learn how to group elements into different


clusters when you don’t have any pre-defined
2. EVALUATION RUBRIC labels to segregate them through
3. FINAL SUBMISSION K-means clustering, hierarchical clustering,
and more.
4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
COURSE 4 - ADVANCED MACHINE LEARNING AND
NATURAL LANGUAGE PROCESSING

1. TIME SERIES FORECASTING 2 WEEKS

1. INTRODUCTION TO Learn the applications of DL in


computer vision through industry-
TIME SERIES AND ITS relevant detection algorithms such as
COMPONENTS RCNNs, YOLO and SSD.
2. WORKING WITH STATIONARY
TIME SERIES
3. END-TO-END ANALYSIS OF
TIME SERIES

2. NEURAL NETS FOR NLP 1 WEEK

1. UNDERSTANDING NEURAL Learn the most sophisticated and cutting-


edge technique in machine learning -
NETWORKS Artificial Neural Networks or ANNs.
2. LOSS FUNCTIONS AND BACK
PROPAGATION
3. UNDERSTANDING
TENSORFLOW
4. CASE STUDY: IMDB MOVIE
REVIEW CLASSIFICATION

3. SYNTACTIC PROCESSING 2 WEEKS

1. INTRODUCTION TO Make a Smart TV system which can control


the TV with the user’s hand gestures as the
SYNTACTIC PROCESSING remote control
2. PARSING
3. INFORMATION EXTRACTION
4. CONDITIONAL RANDOM
FIELDS

*The Curriculum is subject to change as per the inputs from university or industry experts
4. SYNCTACTIC PROCESSING ASSIGNMENT 1 WEEK

1. PROBLEM STATEMENT Use the techniques such as POS tagging


and Dependency parsing to extract
2. EVALUATION RUBRIC information from unstructured text
3. REGULARISED REGRESSION data.

3. SOLUTION

COURSE 5- ADVANCED NATURAL LANGUAGE


PROCESSING
2 WEEKS
1. SEMANTIC PROCESSING
1. INTRODUCTION TO Learn the most interesting area in the field
SEMANTIC PROCESSING of NLP and understand different techniques
like word-embeddings and topic modelling
2. DISTRIBUTIONAL SEMANTICS
to build an application that extracts
3. INDUSTRY APPLICATIONS OF opinions about socially relevant issues.
DISTRBUTIONAL SEMANTICS
4. TOPIC MODELLING

2. APPLIED DL IN NLP 2 WEEKS

1. INTRODUCTION TO MACHINE Apply the concepts of DL in natural language


processing problems through encoder
TRANSLATION decoder architecture and NMTs, and
2. ATTENTION-BASED NMT implement them in Tensor Flow.

MODEL
3. CUSTOM MODEL BUILDING IN
TENSORFLOW

3. CASE STUDY: AUTOMATIC TICKET CLASSIFICATION


Categories support tickets with the help of
1. PROBLEM STATEMENT Unsupervised learning and Topic modelling. 2 WEEKS
2. EVALUATION RUBRIC
3. FINAL SUBMISSION
4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
COURSE 6 - CAPSTONE PROJECT
4 WEEKS
1. CAPSTONE PROJECT
1. AN OVERVIEW OF THE Choose from a range of real-world
industry woven projects on advanced
DOMAIN AND ASSOCIATED topics like Recommendation Systems,
CONCEPTS Fraud Detection, Emotion Detection from
faces, Social Media Listening, and Speech
2. PROBLEM STATEMENT Recognition among
3. EVALUATION RUBRIC many others.

3. MID SUBMISSION

SPECIALISATION: BUSINESS ANALYTICS

COURSE 3 - ADVANCED MACHINE LEARNING


1. BAGGING & RANDOM FOREST 1 WEEK

1. POPULAR ENSEMBLES Learn how powerful ensemble algorithms


can improve your classification models by
2. INTRODUCTION TO RANDOM building random forests from decision trees.
FORESTS
3. FEATURE IMPORTANCE IN
RANDOM FORESTS
4. RANDOM FORESTS IN
PYTHON

2. MODEL SELECTION & GENERAL ML TECHNIQUES


2 WEEKS

1. PRINCIPLES OF MODEL Learn the pros and cons of simple and


complex models and the different methods
SELECTION for quantifying model complexity, along with
2. MODEL BUILDING AND general machine learning techniques like
feature engineering, model evaluation, and
EVALUATION many more.
3. FEATURE ENGINEERING
4. CLASS IMBALANCE

*The Curriculum is subject to change as per the inputs from university or industry experts
2 WEEKS
3. TIME SERIES FORECASTING
1. INTRODUCTION TO In this module, you will learn how to
analyze and forecast a series that varies
TIME SERIES AND ITS with time.
COMPONENTS
2. SMOOTHING TECHNIQUES
3. INTRODUCTION TO AR
MODELS
4. BUILDING AR MODELS

1 WEEK
4. MODEL SELECTION CASE STUDY

1. PROBLEM STATEMENT Apply your business acumen to the newly


learnt machine learning techniques, and
2. EVALUATION RUBRIC select the right model most appropriate for
3. FINAL SUBMISSION a provided business scenario.

4. SOLUTION

COURSE 4 - DATA VISUALISATION AND


STORYTELLING
1. VISUALISATION USING TABLEAU 1 WEEK

1. DATA EXPLORATION IN Learn basic visualization techniques using


the most in-demand visualization tool in the
TABLEAU industry.
2. VISUALISING AND ANALYSING
DATA IN TABLEAU WITH
BASIC PLOTS

*The Curriculum is subject to change as per the inputs from university or industry experts
2. ADVANCED EXCEL 2 WEEKS

1. EXCEL FUNCTIONS In this module, you will learn how to


analyze and forecast a series that varies
2. DATA ANALYSIS IN EXCEL with time.
3. ADVANCED TOOLS AND
VISUALISATIONS

1 WEEK
3. VISUALISATION USING POWERBI
1. POWERBI: INTRODUCTION Take your visualization game a step forward
by understanding how to operate PowerBI.
AND SETUP
2. VISUALISING AND ANALYSING
DATA IN POWERBI
3. DATA TRANSFORMATIONS
USING POWERBI

4. STRUCTURED PROBLEM SOLVING USING FRAMEWORKS

1. INTRODUCTION TO Learn how to attack a business problem 1 WEEK


STRUCTURED PROBLEM using various structured frameworks like 5W,
5WHYs, and SPIN.
SOLVING
2. INTERVIEWING AND
FRAMEWORKS - I: 5W AND
5WHYS
3. INTERVIEWING AND
FRAMEWORKS - II: SPIN
4. INDUSTRY DEMONSTRATIONS
ON FRAMEWORKS
5. UNDERSTANDING BUSINESS
MODEL CANVAS AND ISSUE
TREE FRAMEWORK
6. INDUSTRY DEMONSTRATIONS
ON ISSUE TREE FRAMEWORK
7. SPECIALISED FRAMEWORKS
FOR BUSINESS PROBLEMS:
7PS, 5CS, ETC.

*The Curriculum is subject to change as per the inputs from university or industry experts
5. DATA STORYTELLING
1. INTRODUCTION TO DATA Learn how to effectively strategize,
communicate, and fine-grain your data
STORYTELLING analysis projects and understand how to 2 WEEKS
2. COMPONENTS OF A optimally present your findings to
technical and non-technical stakeholders
GOOD STORY WITH and PARASMANIe your storytelling skills.
DATA - UNDERSTANDING
YOUR STAKEHOLDER AND
STAKEHOLDER EMPATHY,
LEVELS OF DETAILS FOR
DIFFERENT STAKEHOLDERS
- CXO/LEADERSHIP VS TEAM
PRESENTATIONS, VISUALS,
ETC.
3. GOLDEN RULES FOR DATA
STORYTELLING

3. AIRBNB CASE STUDY


Use your newly learnt UI tools skills to 1 WEEK
1. PROBLEM STATEMENT analyse an AirBnB dataset to make
important
2. EVALUATION RUBRIC
business decisions. But the analysis is
3. FINAL SUBMISSION just a small part; can you also effectively
present it using Data Storytelling to the right
4. SOLUTION stakeholders?

COURSE 5: SOLVING BUSINESS REQUIREMENTS

1. OPERATIONS RESEARCH IN EXCEL


1. INTRODUCTION & CONCEPTS Learn about the world of operations research 1 WEEK
through linear and integer optimizations
OF OPTIMISATION
2. OPTIMISATION USING EXCEL
3. OPTIMISATION USING
PYTHON
4. OR IN INDUSTRY -
WAREHOUSE PROBLEM,
ASSIGNMENT PROBLEM, JOBSHOP
SCHEDULING, ETC.

*The Curriculum is subject to change as per the inputs from university or industry experts
2. DATA ARCHITECTURE
1. COMPONENTS OF EFFECTIVE Given a broad business challenge, 1 WEEK
describe how you would approach the
DATA ARCHITECTURE development of a Machine Learning
2. TECHNOLOGY AND Architecture strategy using the
Structured Problem Solving
INFRASTRUCTURE Method.
3. TOOLS TO BUILD
AN EFFECTIVE DATA
ARCHITECTURE

3. DATA STRATEGY
Understand how to identify the right 2 WEEKS
business problems (Revenue/Cost
1. BACKGROUND OF DATA Perspective, Value Chain) using the DS
STRATEGY project assessment framework. You will also
learn how to manage a product from
2. CORE OF DATA STRATEGY-I production to deployment and understand
3. CORE OF DATA STRATEGY-II the overall lifecycle management of an
Analytics/DS project.
4. CASE STUDIES FOR DATA
STRATEGY

4. BUSINESS CASE STUDY

1. PROBLEM STATEMENT Understand how a project in the industry


2 WEEKS
is taken up and solved through a
2. EVALUATION RUBRIC comprehensive business case study.
3. FINAL SUBMISSION
4. SOLUTION

COURSE 6 - CAPSTONE PROJECT


1. CAPSTONE PROJECT
1. POWER BI - OPTIONAL Solve an end-to-end real-life industry 4 WEEKS
problem from a wide variety of domains.
2. AN OVERVIEW OF THE
DOMAIN AND ASSOCIATED
CONCEPTS
3. PROBLEM STATEMENT
4. EVALUATION RUBRIC
5. MID SUBMISSION
6. FINAL SUBMISSION
7. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
SPECIALISATION: BUSINESS
INTELLIGENCE / DATA ANALYTICS
COURSE 3: ADVANCED DBS AND BIG DATA ANALYTICS
1. DATA MODELLING
1 WEEK
1. DATABASE DESIGN RECAP In this module, you will learn and use data
modelling on a dataset to solve a business
2. BUILDING BLOCKS OF DATA problem.
MODELLING
3. PROBLEM SOLVING USING
DATA MODELLING
4. DATA MODELLING: OPTIONAL
ASSIGNMENT

2. ADVANCED SQL AND BEST PRACTICES


1. WINDOW FUNCTIONS Apply advanced SQL concepts like 1 WEEK
windowing and procedures to derive
2. CASE STATEMENTS, STORED insights from data and answer pertinent
ROUTINES, AND CURSORS business questions.

3. QUERY OPTIMISATION AND


BEST PRACTICES
4. PROBLEM SOLVING USING
SQL

3. INTRODUCTION TO BIG DATA AND CLOUD


1. BIG DATA AND CLOUD Understand the basics of big data and cloud
1 WEEK
and learn to work with an EMR cluster on a
COMPUTING cloud-based service.
2. AMAZON WEB SERVICES
3. BIG DATA STORAGE AND
PROCESSING - HADOOP
4. EMR CLUSTER IN AWS

*The Curriculum is subject to change as per the inputs from university or industry experts
4. ANALYTICS USING SPARK 2 WEEKS

1. EXPLORATORY DATA Use PySpark to do EDA and Predictive


Analysis using Spark’s ML library.
ANALYSIS WITH PYSPARK
2. PREDICTIVE ANALYSIS WITH
SPARK MLLIB

5. BIG DATA CASE STUDY 1 WEEK

1. PROBLEM STATEMENT Use your analytics skills to work on a large


dataset in the cloud to solve an industry
2. EVALUATION RUBRIC problem.
3. FINAL SUBMISSION
4. SOLUTION

COURSE 4 - DATA VISUALISATION AND


STORYTELLING
1 WEEK
1. VISUALISATION USING TABLEAU
1. DATA EXPLORATION IN Learn basic visualization techniques using
the most in-demand visualization tool in the
TABLEAU industry.
2. VISUALISING AND ANALYSING
DATA IN TABLEAU WITH
BASIC PLOTS

1 WEEK
2. ADVANCED EXCEL

1. EXCEL FUNCTIONS Solve an end-to-end real-life industry


problem from a wide variety of domains.
2. DATA ANALYSIS IN EXCEL
3. ADVANCED TOOLS AND
VISUALISATIONS

*The Curriculum is subject to change as per the inputs from university or industry experts
3. VISUALISATION USING POWERBI 1 WEEK

1. POWERBI: INTRODUCTION Take your visualization game a step


forward by understanding how to
AND SETUP operate PowerBI.
2. VISUALISING AND ANALYSING
DATA IN POWERBI
3. VISUALISING AND ANALYSING
DATA IN POWERBI

4. STRUCTURED PROBLEM SOLVING USING FRAMEWORKS

1. INTRODUCTION TO Learn how to attack a business problem 1 WEEK


using various structured frameworks like 5W,
STRUCTURED PROBLEM 5WHYs, and SPIN.
SOLVING
2. INTERVIEWING AND
FRAMEWORKS - I: 5W AND
5WHYS
3. INTERVIEWING AND
FRAMEWORKS - II: SPIN
4. INDUSTRY DEMONSTRATIONS
ON FRAMEWORKS
5. UNDERSTANDING BUSINESS
MODEL CANVAS AND ISSUE
TREE FRAMEWORK
6. INDUSTRY DEMONSTRATIONS
ON ISSUE TREE FRAMEWORK
3. SPECIALIZED FRAMEWORKS
FOR BUSINESS PROBLEMS:
7PS, 5CS, ETC.

*The Curriculum is subject to change as per the inputs from university or industry experts
5. DATA STORYTELLING 1 WEEK

1. INTRODUCTION TO DATA Learn how to effectively strategize,


communicate, and fine-grain your data
STORYTELLING analysis projects and understand how to
3. COMPONENTS OF A optimally present your findings to
technical and non-technical stakeholders
GOOD STORY WITH and PARASMANIe your storytelling skills.
DATA - UNDERSTANDING
YOUR STAKEHOLDER AND
STAKEHOLDER EMPATHY,
LEVELS OF DETAILS FOR
DIFFERENT STAKEHOLDERS
- CXO/LEADERSHIP VS TEAM
PRESENTATIONS, VISUALS,
ETC.
4. GOLDEN RULES FOR DATA
STORYTELLING

6. AIRBNB CASE STUDY 1 WEEK

Use your newly learnt UI tools skills to


1. PROBLEM STATEMENT
analyze an AirBnB dataset to make
2. EVALUATION RUBRIC important business decisions. But the
analysis is just a small part; can you also
3. FINAL SUBMISSION effectively present it using Data Storytelling
4. SOLUTION to the right stakeholders?

COURSE 5: ADVANCED PROBLEM SOLVING AND


PROGRAMMING
1. DATA STRUCTURES - SETS, DICTIONARIES, STACKS,
QUEUES 1 WEEK

1. IN-BUILT DATA STRUCTURES Learn user-defined data structures -Stack,


Queue, and Trees in Python that help in
2. STACK advanced data manipulation.
3. QUEUE
4. TREES

*The Curriculum is subject to change as per the inputs from university or industry experts
1 WEEK
2. SEARCHING AND SORTING

1. SEARCHING Learn most fundamental searching and


sorting algorithms and design
2. SORTING techniques
2. TWO POINTERS

3. ALGORITHM ANALYSIS + RECURSION 1 WEEK

1. ALGORITHM ANALYSIS Learn how to assess the efficiency of your


code using algorithm analysis techniques
2. TIME AND SPACE and learn to write recursive algorithms
COMPLEXITY
4. RECURSION

4. ADVANCED DATABASE PROGRAMMING USING PANDAS


1 WEEK
1. ADVANCED DATA WRANGLING Learn and implement advanced wrangling
functions and techniques in Pandas related
WITH PANDAS - I to date-time, multi-columns aggregation,
2. ADVANCED DATA WRANGLING hierarchical indexing, and more.

WITH PANDAS - II

5. PYTHON & SQL LAB 2 WEEKS

1. SQL: TIMED TEST + In this competitive assignment, you will


solve a variety of programming questions in
ASSIGNMENT both SQL and Python in a timed
2. PYTHON: TIMED TESTS I & II environment. You will also demonstrate one
of the questions through a video submission
3. VIDEO SUBMISSION to help improve your interviewing skills.

*The Curriculum is subject to change as per the inputs from university or industry experts
COURSE 6 - CAPSTONE PROJECT

1. CAPSTONE PROJECT 4 WEEKS

1. AN OVERVIEW OF THE Solve an end-to-end real-life industry


problem from a wide variety of domains.
DOMAIN AND ASSOCIATED
CONCEPTS
2. PROBLEM STATEMENT
3. EVALUATION RUBRIC
4. MID SUBMISSION
5. FINAL SUBMISSION
6. SOLUTION

SPECIALISATION: DATA ENGINEERING


COURSE 3: DATA ENGINEERING - I

1. DATA MANAGEMENT AND RELATIONAL DATABASE


MODELLING 1 WEEK

1. ENTERPRISE DATA Understand the concepts of Data


Management and learn to model data
MANAGEMENT from a Relational Database.
2. RELATIONAL DATABASE
MODELLING
3. NORMAL FORMS AND ER
DIAGRAMS

1 WEEK
2. INTRODUCTION TO BIG DATA(OPTIONAL)
This module you will learn what big data
1. 4VS OF BIG DATA is, its various characteristics, and its
2. BIG DATA: INDUSTRY CASE determining factors. You will also get an
idea of the various sources of big data and
STUDIES the wide range of big data applications in
different industries such as retail, healthcare,
and finance.

*The Curriculum is subject to change as per the inputs from university or industry experts
3. INTRODUCTION TO CLOUD AND AWS SETUP 1 WEEK

1. INTRODUCTION TO CLOUD Understand what is cloud and setup


your AWS account which will be
2. AWS SETUP required during the program.

4. INTRODUCTION TO HADOOP AND MAPREDUCE


PROGRAMMING 1 WEEK

1. CONCEPTS RETAILED TO Understand the world of distributed data


processing and storage with Hadoop. Learn
DISTRIBUTED COMPUTING to write MapReduce jobs in Python.
2. HADOOP DISTRIBUTED FILE
SYSTEM
3. MAPREDUCE PROGRAMMING
IN PYTHON

5. ASSIGNMENT (OPTIONAL) 0 WEEK

1. INTRODUCTION, PROBLEM Solve an assignment to brush up on the skills


learnt so far.
STATEMENT AND GRADING
RUBRICS

6. NOSQL DATABASES AND APACHE HBASE NOSQL


DATABASES AND MONGODB (OPTIONAL) 2 WEEKS

1. CONCEPTS OF NOSQL In this competitive assignment, you will


solve a variety of programming questions in
DATABASES both SQL and Python in a timed
2. INTRODUCTION TO APACHE environment. You will also demonstrate one
of the questions through a video submission
HBASE to help improve your interviewing skills.
3. HBASE PYTHON API
4. COMPARISON OF NOSQL
DATABASES

*The Curriculum is subject to change as per the inputs from university or industry experts
7. DATA WAREHOUSING (OPTIONAL) 0 WEEK

1. INTRODUCTION TO DATA Understand the intricacies behind


designing a data warehouse and a data
WAREHOUSE AND DATA lake for use case(s).
LAKES
2. DESIGNING DATA
WAREHOUSING FOR AN ETL
DATA PIPELINE
3. DESIGNING DATA LAKE FOR
AN ETL DATA PIPELINE

8. DATA INGESTION WITH APACHE SQOOP AND APACHE


FLUME 1 WEEK

1. INTRODUCTION TO DATA Get familiar with the challenges involved


in data ingestion. Use Sqoop and Flume to
INGESTION ingest structured and unstructured data
2. STRUCTURED DATA into Hadoop.

INGESTION WITH SQOOP


3. UNSTRUCTURED DATA
INGESTION WITH FLUME

9. MAPREDUCE PROGRAMMING ASSIGNMENT 0 WEEK

1. PROBLEM STATEMENT AND Practice MapReduce Programming on a Big


Dataset.
SAMPLE DATASET
1. PROBLEM STATEMENT AND
SAMPLE DATASET

COURSE 4 - DATA ENGINEERING - II


2 WEEKS
1. HIVE & QUERYING

1. FUNDAMENTALS OF APACHE Manage and query a data warehouse with


Apache Hive. Learn to write optimized HQL
HIVE for large-scale data analysis.
2. WRITING HQL FOR DATA
ANALYSIS
3. PARTITIONING AND
BUCKETING WITH HIVE

*The Curriculum is subject to change as per the inputs from university or industry experts
2. ASSIGNMENT (OPTIONAL) 0 WEEK

1. INTRODUCTION, PROBLEM Solve an assignment to brush up the


skills learnt so far.
STATEMENT AND GRADING
RUBRICS

1 WEEK
3. AMAZON REDSHIFT
1. DATA WAREHOUSING WITH Learn to deploy a Redshift cluster and use it
REDSHIFT for querying data.

2. ANALYSE DATA WITH


REDSHIFT

4. INTRODUCTION TO APACHE SPARK 1 WEEK

1. SPARK ARCHITECTURE Get introduced to Apache Spark, a lighting


fast big data processing engine.
2. RDD, DATAFRAME API, SPARK
SQL

5. PROJECT: ETL DATA PIPELINE 2 WEEKS

1. INTRODUCTION AND Make use of Sqoop, Redshift & Spark to


design an ETL data pipeline.
PROBLEM STATEMENT
2. GRADING RUBRICS AND
SUBMISSION

5. AWS CLOUD INFRASTRUCTURE (OPTIONAL) 2 WEEKS


Do a deep dive into AWS Cloud.
1. THE AWS CLOUD PLATFORM
2. BUILDING AND DEPLOYING
VIRTUAL MACHINES
3. AWS CLOUD STORAGE
SOLUTIONS
4. APPLICATION DEPLOYMENT
5. CLOUD ADMINISTRATION
AND SECURITY
6. LOAD BALANCING AND
BACKUP STRATEGIES
7. CLOUD AUTOMATION
*The Curriculum is subject to change as per the inputs from university or industry experts
COURSE 5 - DATA ENGINEERING - III
1. OPTIMISING SPARK FOR LARGE-SCALE DATA 1 WEEK
PROCESSING
1. RUNNING SPARK ON Use PySpark to create large-scale data
processing applications.
MULTINODE CLUSTER
2. SPARK MEMORY & DISK
OPTIMISATION
3. OPTIMISING SPARK CLUSTER
ENVIRONMENT

2. APACHE FLINK(OPTIONAL) 0 WEEK

1. INTRODUCTION TO APACHE Get Introduced to Apache Flink and learn


query batch data.
FLINK
2. BATCH DATA PROCESSING
WITH FLINK
Use DataStream API to create a stream
3. STREAM PROCESSING WITH processing application.
APACHE FLINK
4. SQL API

3. REAL-TIME DATA STREAMING WITH APACHE KAFKA 1 WEEK

1. INTRO TO REAL-TIME Understand the basics of big data and cloud


and learn to work with an EMR cluster on a
DATA PROCESSING cloud-based service.
ARCHITECTURES
2. FUNDAMENTALS OF APACHE
KAFKA
3. SETTING UP KAFKA
PRODUCER AND CONSUMER
4. KAFKA CONNECT API &
KAFKA STREAMS

*The Curriculum is subject to change as per the inputs from university or industry experts
4. REAL-TIME DATA PROCESSING USING SPARK STREAMING

1. SPARK STREAMING Learn about the real-time data 0 WEEK


processing architecture of Apache
ARCHITECTURE Spark. Build Spark Streaming
2. SPARK STREAMING APIS applications to process data in
real-time.
2. BUILDING STREAM
PROCESSING APPLICATION
WITH SPARK
2. COMPARISION BETWEEN
SPARK STREAMING AND
FLINK

5. ASSIGNMENT (OPTIONAL) 0 WEEK

1. INTRODUCTION, PROBLEM Solve an assignment to brush up on the


skills learnt so far.
STATEMENT AND GRADING
RUBRICS

6. BUILDING AUTOMATED DATA PIPELINES WITH AIRFLOW


Automate Data Pipelines with Airflow. 1 WEEK
1. FUNDAMENTS OF AIRFLOW
2. WORKFLOW MANAGEMENT
WITH AIRFLOW
2. AUTOMATING AN ENTIRE
DATA PIPELINE WITH
AIRFLOW

7. ANALYTICS USING PYSPARK 1 WEEK

1. EXPLORATORY DATA Use PySpark to do EDA and Predictive


Analysis using Spark’s ML library.
ANALYSIS WITH PYSPARK
2. PREDICTIVE ANALYSIS WITH
SPARK MLLIB

8. PROJECT: REAL-TIME DATA PROCESSING 1 WEEK

1. INTRODUCTION AND Build an end-to-end real-time data


processing application using Spark
PROBLEM STATEMENT
Streaming and Kafka.
2. GRADING RUBRICS AND
SUBMISSION

*The Curriculum is subject to change as per the inputs from university or industry experts
COURSE 6 - CAPSTONE PROJECT
CAPSTONE PROJECT 4 WEEKS

1. AN OVERVIEW OF THE The capstone project will stitch all the


components of data engineering together.
DOMAIN AND ASSOCIATED
CONCEPTS
2. PROBLEM STATEMENT
3. EVALUATION RUBRIC
4. MID SUBMISSION
5. FINAL SUBMISSION
6. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts

You might also like