Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
58 views8 pages

Data Analyst Syllabus (For Aundh)

Uploaded by

userabhay1111111
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
58 views8 pages

Data Analyst Syllabus (For Aundh)

Uploaded by

userabhay1111111
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

DATA ANALYST SYLLABUS

1. Introduction to Data Analyst


• Types of Data
Qualitative/ Categorical
Quantitative/ Numerical
• Exploratory Data Analysis
Learn the Structure of the Data
Uncover Patterns or Errors
Find relationships and insights
• EDA Steps
o Data Preparation
Sources of Data
Data Collection
Data Cleaning and Wrangling
Data Sanity Check
o Data Exploration (Learn about each variable, Compute Summary Statistics, Find Correlation
and Trends, Visualize the data)
o Hypothesis Generation and Further analysis
• Metrics and Analysis
o Understanding different types of metric
o User journeys and investigating abnormal behaviours
• Data Exploration
o Frequency Distribution
o Measure of Central Tendency (Mean, Median, Mode)
o Measure of Dispersion (Variance and Standard Deviation)
o Z-score
• Data Visualization
o Types of Charts
o Selecting Charts
▪ By Use-Case
▪ By Data Types
o Styling Charts
o Explainability of the Chart
2. Data Preparation using Excel
• Introduction to Excel
• Formatting Cells
• Keyboard shortcuts
• Copy Paste in Excel
• Functions
• Filters
• Sorting
• Loading and Cleaning Data
o Gathering Raw Data
o Removing Duplicates
o Fill Options
o Data Validation
• Numerical Data Types
o Operators
o Range, Average, Count, Rounding
o Variance
o Summarising
• Handling Text Data Types
o Categorical Variables
o Cases and Spaces
o Cleaning Strings
• Working with Dates and TimeStamps
o Operators
o Aggregations
o Time Between
• Logical Functions
o AND, OR, NOT, IF
o Combining logical functions
o Aggregate Logical Functions
• Data Protection
o Protecting Sheets & Cells

3. Data Analysis & Visualisation Using Excel


• Summary Statistics
o Measure of Central Tendencies
o Range, Variance
• Referencing
o LOOKUP functions
o Index Match
• Summarising with Pivot tables
o Introduction to pivot tables
o Slicer, Multiple Pivot tables
• Data Visualisation
o Charts
o Formatting Charts
o Building Dashboards
• Using Pivot Tables
• What If-Analysis
o Scenario Analysis
o Sensitivity Analysis
o Growth Rate
o What-If Analysis in Excel
• Forecasting
o Seasonality
o Reducing Bias
o Confidence Intervals
o Moving Averages
o Weighted Averages
o Techniques in Excel

4. Data Analysis using SQL


• Introduction to Databases
• Difference between SQLite, MySQL, PostgreSQL etc
• Database Terminologies (Tables/ Relations, Record/Raws, Schema, Field, Unique Identifiers,
Primary Key, Relationships, Foreign Keys, Constraints)
• Overview of SQL (DDL, DML, Queries)
• Data Types (NULL, INTEGER, REAL, FLOT, NUMERIC, TEXT, CHAR / VARCHAR, BLOB)
• DDL (Create DB, Create Table, etc)
• Anatomy of a Query
• SELECT, FROM, WHERE Clause
• Aliasing
• Operators
o Relational Operators (<, <=,>,>=, +, < >, !=)
o Logical Operators (AND, OR, NOT)
o LIKE, BETWEEN
• GROUPBY, DISTINCT, HAVING Clause
• SORT, LIMIT
• Order of Queries

5. Advanced Data Analysis Using SQL


• Numerical Data Types
o Numeric Types, Operators
o Range, Average and Mean
o Variance, Rounding and Summarising
• Exploring Distributions
• Summarising
o Correlation function
o Median/Percentile Discrete and Continous
• Character Data Types
o Data Types, Categorical Variables
o Grouping, Counting and Ordering
o Cases and Spaces
o Searching in Strings, Trimming Spaces
o Splitting, Concatenating, Full Text Search
• Working with Dates and Timestamps
o Types and Formats
o Comparisons and Operations
o Components and Aggregation
o Aggregating with date/time series
o Time between Events
▪ Lead and Log
▪ Average time between events
▪ Change in time series
• Working with Arrays
o Arrays in PostgreSQL (CREATE, INSERT etc)
o Accessing Arrays
o Searching Arrays
o Array Functions and Operators

6. Data Manipulation with SQL


• Joins and Set Operations
o Relationships between Tables
o Inner Joins
o Outer Joins
o Joins on Join
o Cross Joins
o Self Joins
o Equi and Non Equi Joins
o Set Operators
• Data Manipulation Techniques
o Case Statements
o Subqueries
o Correlated Subqueries
o Nested Subqueries
o Common Table Expression
o Window Functions

7. Database Design with SQL


• Introduction to OLTP and OLAP
• Storing Data
o Types of Data
o Data Warehouses
o Data Lakes
o ETL and ELT
• Data Modelling and Schema Design
o Conceptual Data Model
o Logical Data Model
o Physical Data Model
• Dimension Modelling
o Fact Tables, Dimension Tables
o Star Schema
o Snowflake Schema
• Normalization and Denormalization
• Database Views
• Scalability
o Partitioning (Vertical Partitioning, Horizontal Partitioning)
o Sharding
• Data Integration
o Data Sources, Transformation, Unified Data Model
o Update Cadence
o ETL

8. Introduction to Data Visualisation


• Data Visualisation
o Why is visualisation important?
o Visualisation Framework
o Chart Types
o Trend Visualisation
o Tips and Tricks for Visualisation
• Data Story Telling
o What is Data Storytelling?
o Biases- When do they appear?
o Biases- Formal Terminology

9. Data Preparation & Exploration with Tableau


• Introduction to Tableau
• Loading Data in Tableau
o Data Sources and Loading Data Types
o Joins and Relationships
o Fields in Data and their types
o Dimensions of Measures
o Data Roles
o Navigating UI elements
• Combining Data
o Unions, Joins, Relationships
• Filtering and Sorting
o Types of filters
o Filtering on Dimensions, Measures
o Sorting and Filtering through Selections
• Aggregation
o Aggregating Measures
o Aggregating Dimensions
o Scatter Plot and Aggregations
• Calculated Fields and Table Calculations
o Functions/ Operators
o Formatting Numbers
o Type Conversions
o Level of Details Expression (LOD)
o Table Calculations

10. Data Visualisation


• Chart Types with Tableau
• Exploratory Analysis using Visualisation of Trends
o Reference Lines, Trend Lines and Forecasting
o BarCharts and Line Charts for discrete and continuous data
o Discrete Time Analysis
o Quick Tables
o Formatting through Colours
o Bubble Chart
• Mapping your Data
o Geographic Data Types
o Geocoding
o Creating Maps
• Dashboards and Stories
o Introduction to Dashboards
o Introduction to Stories (Story Points,etc)
o Creating Dashboards and Stories
o Building a KPI Dashboard
o Updating the Tooltip
• Analysis
o Seasonality Analysis
o YoY Analysis
o YTD Calculation
o Calculating Growth
o CAGR Analysis
o Moving Rolling Calculations
o Cohort Analysis

11. Essentials of Python for Data Analysis


• Python Environment setup
• Python Data Types
o Variables
o Python is Dynamically Typed
o Rules for Naming variables and Naming Conventions
o Overview of Python Data Types
▪ Numeric
▪ Sequence
▪ Set
▪ Dictionary
o Literals or constants
o Type Conversion
• Operators and Expression
o Arithmetic Operators
o Expressions
o Operator Precedence
o Arithmetic Operators on All Data Types
o Assignment Operators
o Relational Operator
o Logical Operators
o Boolean
o Special Operators
o Mathematical
• Conditional Statements
• Loops & Control Flow
• String
• Lists
• Tuple

12. Advanced Data Analysis using Python


• NumPy
o Introduction to NumPy and importing NumPy
o Array Creation
o NumPy Attributes
o Creating Different Types of Arrays
o Accessing elements of an Array
o NumPy Slicing
o Reshaping & Flattening an Array
o Data Types in NumPy
o Operators
o Data Analysis using NumPy
• Pandas
o Datasets in Python
o DataFrame
o Accessing Data
o Filtering
o Traversing Data Frame
o Sorting
o Merging-Data/ Joins in Pandas
▪ Inner Joins
▪ Outer Joins
▪ Self Joins
▪ Merging on indexes
▪ Filtering Joins
• Concatenating Data Vertically
• Data Integrity Check
• Reshaping Data

13. Data Visualisation using Python


• Visualisation
o Introduction to MatPlotlib
o Line Plots, Scatter Plots, Histograms, BarPlots and Vertical BarPlots
o Customising Plots
o Introduction to SeaBorn
o Relational Plots and SubPlots
o Customising Scatter Plots
• EDA Using Pandas
o Feature Engineering
o Summary Statistics
o Data Validation and Cleanup
o Group Summary Statistics
o Pivot Tables
o Explicit Indexes
o Checking for Missing Values
o Handling Outliers
o Patterns over date time
o Correlation
o Updating CSV Files

14. Interview Prep & Portfolio Building


• Data Analysis Process Interview questions
• Technical interview questions
• SQL interview questions
• Building your resume
• Building your data Portfolio
• Graduation Test & Projects

BONUS CONTENT
Descriptive and Inferential Statistics
• Z-core
• Central Limit Theorem, p-value
• Hypothesis testing
• Power Analysis

◆◆ NEWGEN CORPORATE TRAINING CENTER ◆◆


OFFLINE / ONLINE BATCH

Address: NEWGEN CORPORATE TRAINING CENTER, Bremen Chowk, Office No.217, West Avenue,

Above Atithi Hotel, Aundh, Pune – 411027


URL: http://www.newgensoftech.com

Mail Id: [email protected]

Insta Id: @Newgen_Softech

You might also like