AZURE DATA
ENGINEERING
Course Syllabus
Page #1
Trainer's
Profile
16 years of professional experience
10 years of training experience in Data Analytics and Data
Engineering
Microsoft certified in Azure Data Engineering, SQL and
POWER BI.
Over 4000+ hours of training delivered.
Trained over 1600+ students across multiple technologies
from people in different industries.
Expertise in batch, one-to-one and corporate trainings.
Creator and lead instructor @cloudanddatauniverse
youtube channel.
Over 100 projects delivered across multiple technologies.
Page #2
About
Course
In this course, you will learn Azure Cloud, ADF, SQL
Server, Python, PySpark, Databricks, Synapse and Azure
DevOps.
You will be trained by following a scenario-based approach.
Our instructor Mr. Yusuf Didighar has immense experience in
training & consulting and we are sure your learning journey
with us indeed will be path-breaking for your career!
The best thing about this training is it will be live and
interactive. This will help you to clear your doubts and make
your understanding better as we progress.
Session recordings will be shared for future reference.
A soft copy of the Course completion certificate will be
provided at the end of the batch.
CDU will help with building a resume as well as interview
preparation.
Page #3
Azure Data Engineering Syllabus
Azure Cloud
Introduction
Cloud Computing
Top cloud providers
Benefits of Cloud
Computing
Introduction to Azure
Sign up for Azure Trial
How to upgrade to a pay-
as-you-go
Provision Azure Data lake
gen2
Provision Azure SQL DB
Provision Azure Data
Factory
Provision Azure Databricks
Provision Azure Synapse
CLOUD AND DATA UNIVERSE
Page #4
Azure Data Engineering Syllabus
SQL
Introduction Advanced SQL Concepts
Introduction to SQL Creating and Managing
Different vendors in SQL Views
Installation of Microsoft Creating and Managing
SQL Server Stored Procedures
Installing Sample Database Creating and Managing
Functions
Creating and Managing
Index
SQL Fundamentals
CTE
Querying tables
Query Optimization
Inserting data
Creating, Deleting and
Managing tables
Data Warehousing
Sorting and Filtering Data
Introduction
Grouping and Pivoting
OLAP & OLTP
Data
Star & SnowFlake Schema
Creating constraints on
Incremental Load
tables
ETL process
Importing and Exporting
Facts & Dimensions
data from various sourses
SCD types
Joining tables
ACID Properties
String Functions
Date Functions
Subquery
Case statement
Window functions
Data types
Combining tables
CLOUD AND DATA UNIVERSE
Page #5
Azure Data Engineering Syllabus
Azure Data Factory
Introduction Transformations
What is ETL? Select
ETL vs ELT Sort
Top tools in ETL Filter
Introduction to Azure Data Derived
Factory Conditional split
Cast
Surrogate
Aggregate
Activities
Pivot
Copy Data
Unpivot
Set Variable
Rank
Get Metadata
Window
For Each
Union
Append Variable
Lookup
Delete
Exists
Lookup
Join
Stored procedure
Flatten
Filter
Parse
IF
Stringify
Switch
Execute Pipeline
Script ADF Features
Validation Introduction to Dataflow
Integration Runtimes
Incremental loads
Triggers
Monitoring Pipelines
CLOUD AND DATA UNIVERSE
Page #6
Azure Data Engineering Syllabus
Python Fundamentals
Introduction Libraries
What is Python? Matplotlib
Installation Numpy
Pandas
Fundamentals Advanced
Print Functions-Introduction
Variables Functions-Scope of variable
Concatenation Functions-args&kwargs
Data Types Lambda
Interpolation map
operators filter
If statement reduce
loops Classes
Collections - List
Collections - Tuple
Collections - Set
Collections - Dictionary
List comprehensions
Date Functions
Exception handling
string functions
Regex
CLOUD AND DATA UNIVERSE
Page #7
Azure Data Engineering Syllabus
Big Data Fundamentals
Introduction
Introduction to Big data
History & evolution of Big
data
Understanding Distributed
computing
Hadoop ecosystem
overview
Azure data engineering
stack
Different file types
CLOUD AND DATA UNIVERSE
Page #8
Azure Data Engineering Syllabus
Spark & Databricks
Spark DataFrames
Introduction Introduction to DataFrame
Spark components & API Creating data frame from
Spark architecture RDD
Transformations Imposing schema
Actions Read options
Lineage graph Write options
DAG Rename columns
Add columns
Select columns
RDD Drop columns
Remove duplicates
Introduction to RDD
Sort columns
Creating RDD from text file
Filter
Various operations on RDD
Union
Conditional columns
Casting
Databricks Aggregations
Introduction Pivot
Sign up for Databricks Unpivot
Community Edition Window functions
Setting up Cluster Joins
Understanding dbutils Join Algorithms
Accessing & mounting Partitioning
Azure Data lake gen2 Bucketing
Creating & monitoring jobs
CLOUD AND DATA UNIVERSE
Page #9
Azure Data Engineering Syllabus
Synapse
Introduction
Getting started with
Synapse
Setting up synapse
Synapse Fundamentals
Pool concept
Dedicated & Serverless
pools
Pricing
Databases
Spark pool
Creating pipelines
Creating notebooks
CLOUD AND DATA UNIVERSE
Page #10
Azure Data Engineering Syllabus
Azure DevOps
Introduction
Introduction to DevOps
Getting started with Azure
DevOps
Setting up Azure
components with Azure
DevOps
Code versioning
Code versioning
Creating Repository
Creating branches
Setting policies
Code check-in
Pull requests
Code merging
Deployment
Deployment
Creating build pipelines
Creating release pipelines
CLOUD AND DATA UNIVERSE
Page #11