Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
2 views8 pages

PG Certification in Data Science

The AI-Powered Post Graduate Certification in Data Science by WHY TAP is a 7-month dual certification program designed to equip learners with essential skills in data analytics and data science. The program covers a range of topics including Excel, SQL, machine learning, deep learning, and NLP, with hands-on projects and a capstone project to ensure job readiness. Graduates receive certifications as a Certified Data Analyst and an AI-Powered Post Graduate Certificate in Data Science, along with career support services.

Uploaded by

ktamil77291
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views8 pages

PG Certification in Data Science

The AI-Powered Post Graduate Certification in Data Science by WHY TAP is a 7-month dual certification program designed to equip learners with essential skills in data analytics and data science. The program covers a range of topics including Excel, SQL, machine learning, deep learning, and NLP, with hands-on projects and a capstone project to ensure job readiness. Graduates receive certifications as a Certified Data Analyst and an AI-Powered Post Graduate Certificate in Data Science, along with career support services.

Uploaded by

ktamil77291
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

AI-Powered Post Graduate Certification

in Data Science

Data is the new language. AI is the new logic. Learn both and
the world listens
Sathishkumar Kannan, MS (UK)
Founder & CEO, WHY Global Services
Founder & CEO, FaceSync Technologies Pvt Ltd

Total Duration: 7 Months

• Data Analytics: 3 Months

• Data Science & AI: 4 Months

• Mode: Blended Learning (Offline | Online)

• Certification: Dual - Certified Data Analyst + PG Certificate in Data


Science
Program Overview

The AI-Powered Post Graduate Certification in Data Science by WHY TAP is an end-to-end,
industry-relevant program crafted to help learners master data-driven decision-making,
business insights, machine learning, and AI-powered solutions.

The program begins with a rigorous foundation in Data Analytics: covering Excel, SQL, Power
BI, and Python for analysis and gradually progresses into core and advanced data science,
machine learning, deep learning, natural language processing, and model deployment.

This comprehensive pathway ensures learners graduate with hands-on experience, industry-
aligned tools, and the confidence to tackle real-world data problems. The program awards
two certifications, ensuring maximum credibility for both analytics-focused and data science-
oriented job roles.

Certifications Awarded

• Certified Data Analyst – upon completion of Data Analytics phase (3 months)


• AI-Powered Post Graduate Certificate in Data Science – after completing the full 7-
month program
Course Curriculum

Module 1: Spreadsheet & SQL Analytics

• Introduction to Data Structures in Excel & Google Sheets


• Data Cleaning and Validation Techniques
• Logical, Text, Lookup, and Date Functions (IF, VLOOKUP, INDEX-MATCH, etc.)
• Pivot Tables, Pivot Charts, and Dynamic Dashboards
• SQL Basics: SELECT, WHERE, ORDER BY, GROUP BY
• Aggregation and Filtering with HAVING
• Multi-table Queries using JOINS (INNER, LEFT, RIGHT)
• Subqueries, Nested Queries, and CASE Statements
• Introduction to Window Functions (RANK, DENSE_RANK, LEAD, LAG)
• Writing Optimized SQL Queries for Business Scenarios

Module 2: Business Intelligence Tools (Power BI / Tableau)

• Introduction to Data Visualization & BI Concepts


• Connecting to Multiple Data Sources (CSV, Excel, SQL, Web)
• Data Modeling and Relationship Mapping
• Creating Calculated Columns and Measures using DAX
• Building Interactive Reports with Filters and Slicers
• Drill-through Pages, Bookmarks, and Custom Tooltips
• Creating Executive-Level KPI Dashboards
• Publishing and Sharing Reports Online
• Report Optimization and Visual Storytelling Techniques
• Best Practices for Dashboard Design and User Interaction
Module 3: Python for Analytics & EDA

• Python Syntax: Variables, Data Types, Conditionals, Loops


• Functions, Lambda Expressions, List Comprehension
• Working with Files (CSV, Excel, JSON)
• Pandas for DataFrame Operations and Cleaning
• NumPy for Mathematical Operations and Arrays
• Matplotlib & Seaborn for Visualization
• Exploratory Data Analysis: Distribution, Correlation, Outliers
• Handling Missing Data and Duplicate Records
• Introduction to Jupyter, Google Colab, and VS Code
• Introduction to Git & GitHub for Version Control

Module 4: Data Preprocessing & Feature Engineering

• Handling Missing Values with Imputation Techniques


• Identifying and Treating Outliers
• Scaling Methods: MinMax, StandardScaler, RobustScaler
• Encoding Categorical Variables: One-Hot, Label, Ordinal, Frequency
• Feature Binning and Transformation (Log, Box-Cox, etc.)
• Feature Interaction and Polynomial Features
• Feature Selection Techniques: Filter, Wrapper, Embedded
• Merging, Joining, and Concatenating Datasets
• Creating Pipelines using Scikit-learn
• Fetching & Parsing Data from APIs
Module 5: Applied Statistics & Machine Learning

• Descriptive Statistics: Mean, Median, Variance, Std Dev


• Probability Theory: Basic, Conditional, Bayes’ Theorem
• Common Distributions: Normal, Binomial, Poisson
• Inferential Statistics: Confidence Intervals, Sampling
• Hypothesis Testing: t-test, ANOVA, Chi-Square
• Introduction to Machine Learning Workflow
• Regression Models: Linear, Ridge, Lasso
• Classification Models: Logistic Regression, Decision Tree, Random Forest, KNN
• Model Performance Metrics: Confusion Matrix, Precision, Recall, ROC-AUC, F1
Score
• Cross Validation, Grid Search, and Hyperparameter Tuning

Module 6: Unsupervised Learning & Time Series

• Clustering Concepts and Applications


• K-Means, Agglomerative Hierarchical Clustering
• DBSCAN for Anomaly Detection
• Dimensionality Reduction: PCA, t-SNE
• Association Rule Mining: Apriori, FP-Growth
• Time Series Concepts: Trend, Seasonality, Residual
• Smoothing Techniques and Stationarity Checks
• ARIMA, Auto-ARIMA, Seasonal Decomposition
• Forecasting with Facebook Prophet
• Evaluating Forecast Accuracy using RMSE, MAPE, MAE
Module 7: Deep Learning, NLP & Model Deployment

• Neural Network Architecture and Layers


• Activation Functions, Loss Functions, Optimizers
• ANN Implementation using TensorFlow/Keras
• Introduction to Convolutional Neural Networks (CNNs)
• Basics of Natural Language Processing (NLP)
• Text Cleaning, Tokenization, Lemmatization
• Bag of Words, TF-IDF, and Word2Vec
• Sentiment Classification using Scikit-learn
• Building Web Apps using Streamlit and Flask
• Creating APIs with FastAPI and Deploying on Render/GitHub

Program Outcomes

• Dual Certifications: Certified Data Analyst & PG Certificate in Data Science


• 15+ Hands-on Projects across business domains
• Real-world Capstone with evaluation and demo day
• Resume, LinkedIn & GitHub portfolio optimization
• Career training: Mock Interviews, Job Referrals, Internship Access
• 100% Placement Assistance via WHY TAP’s Industry Network
Tools & Technologies You Will Learn

• Excel & Google Sheets – Data cleaning, formula logic, KPI dashboards
• SQL – Querying, joins, aggregations, subqueries, window functions
• Power BI – Data modeling, visual dashboards, DAX, real-time reporting
• Python – Programming, scripting, automation, EDA
• Pandas & NumPy – Data manipulation, analysis, and computation
• Matplotlib & Seaborn – Data visualization for insights and storytelling
• Scikit-learn – Building machine learning models and evaluation metrics
• Statsmodels & SciPy – Applied statistics, A/B testing, hypothesis validation
• TensorFlow & Keras – Deep learning, artificial neural networks, model training
• NLTK / SpaCy / Word2Vec – Text preprocessing and natural language processing
• Facebook Prophet – Time series forecasting
• Streamlit & Flask – Web app development and model deployment
• FastAPI – Building scalable APIs for data-driven applications
• Git & GitHub – Version control, collaboration, and portfolio publishing
• Google Colab & Jupyter Notebooks – Interactive code environments

Skills You Will Master

• Business Analysis & KPI Reporting


• Data Visualization & Dashboard Design
• Programming & Data Automation
• Data Cleaning, Wrangling & Feature Engineering
• Statistical Thinking & Experiment Design
• Supervised & Unsupervised Machine Learning
• Time Series Forecasting & Demand Prediction
• Natural Language Processing (NLP)
• Deep Learning Model Implementation
• GitHub Portfolio Development
Summary
The AI-Powered Post Graduate Certification in Data Science by WHY TAP is a
comprehensive 7-month dual certification program that transforms learners into job-ready
data professionals.

The program begins with a strong 3-month foundation in Data Analytics, covering Excel,
SQL, Power BI, and Python for business intelligence and decision-making. It then advances
into Data Science and AI over 4 months, covering machine learning, deep learning, NLP,
time series forecasting, and model deployment using real-world tools and technologies.

Learners gain hands-on experience through 15+ practical projects, industry use cases, and a
full-fledged capstone project, culminating in a powerful GitHub portfolio and job-readiness
training.

Graduates receive two certifications:

• Certified Data Analyst

• AI-Powered Post Graduate Certificate in Data Science

The program is ideal for students, working professionals, or career switchers with no prior
coding background. It includes placement assistance, resume and LinkedIn optimization,
mock interviews, and access to WHY TAP’s job portal and industry network.

You might also like