Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
5 views10 pages

Ultimate Data Engineering Masters Program v1

Uploaded by

reheki6971
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views10 pages

Ultimate Data Engineering Masters Program v1

Uploaded by

reheki6971
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

7 months Intensive Course

Validity - 2 Years
Key Tools and Technologies Covered in this
Program

MICROSOFT
AZURE

AZURE DATA
FACTORY
PYSPARK | AZURE | AWS
END-TO-END PROJECTS
CURRICULUM
Milestone 1 - Distributed Processing Fundamentals &
PySpark
Week 1 : Big Data - The Big Picture
Week 2 : Distributed Storage & Data Lake
Week 3 : Distributed Processing Fundamentals
Week 4 : Apache Spark Core APIs
Week 5 : Spark APIs - Dataframes & Spark SQL
Week 6 : Spark Dataframe Transformations
Week 7 : Apache Spark Caching In-depth
Week 8 : Apache Spark Architecture
Week 9 : Apache Spark Internals
Week 10 : Apache Spark Optimizations
Week 11 : More on Spark Optimizations
Week 12 : GIT GITHUB & CICD
Week 13 : Apache Hive

Milestone-1 Power Modules to Kickstart your Career


Apache Spark Project
Apache Spark Interview Questions
Resume LinkedIn & Naukri Profile Building
Data Structures & Algorithms
Milestone 2
Week 14 : Azure Cloud Fundamentals
Week 15 - 22 : Azure Databricks In-Depth (8 Weeks)
Week 23 - 24 : Azure Data Factory (2 Weeks)

Milestone-2 Power Modules to Kickstart your Career


Azure Cloud Capstone Project
Azure Interview Questions
Overview of Azure Databricks In-Depth (8 Weeks)
Module
→ What is Databricks and Why Databricks
→ Databricks Free Edition vs Azure Databricks
→ High Level Architecture of Databricks
→ Different Cluster Creation Modes
→ Different Types of Tables in Databricks
→ Iceberg Managed Tables in Databricks
→ Magic Commands in Databricks
→ Databricks Utilities
→ Lakehouse Architecture
→ Delta Lake in Depth
→ Volumes
→ Databricks Copy Into
→ Autoloader
→ Lakeflow Declarative Pipelines (Earlier called as DLT)
→ Implementing a Medallion Architecture
→ Governance using Unity Catalog
→ Lakeflow Connect
→ Lakeflow Jobs
→ Deployment – Databricks Asset Bundles
Milestone 3
Week 25 : Data Modeling
Week 26 - 27 : Spark Structured Streaming (2 Weeks)
Week 28 : Apache Kafka
Week 29 : Big Data on AWS Cloud - Athena & EC2
Week 30 : Big Data on AWS Cloud - EMR
Week 31 : Big Data on AWS Cloud - GLUE
Week 32 : Big Data on AWS Cloud - Redshift, Lambda, S3

Milestone-3 Power Modules to Kickstart your Career


AWS Project Pipeline

You might also like