7 months Intensive Course
Validity - 2 Years
Key Tools and Technologies Covered in this
Program
MICROSOFT
AZURE
AZURE DATA
FACTORY
PYSPARK | AZURE | AWS
END-TO-END PROJECTS
CURRICULUM
Milestone 1 - Distributed Processing Fundamentals &
PySpark
Week 1 : Big Data - The Big Picture
Week 2 : Distributed Storage & Data Lake
Week 3 : Distributed Processing Fundamentals
Week 4 : Apache Spark Core APIs
Week 5 : Spark APIs - Dataframes & Spark SQL
Week 6 : Spark Dataframe Transformations
Week 7 : Apache Spark Caching In-depth
Week 8 : Apache Spark Architecture
Week 9 : Apache Spark Internals
Week 10 : Apache Spark Optimizations
Week 11 : More on Spark Optimizations
Week 12 : GIT GITHUB & CICD
Week 13 : Apache Hive
Milestone-1 Power Modules to Kickstart your Career
Apache Spark Project
Apache Spark Interview Questions
Resume LinkedIn & Naukri Profile Building
Data Structures & Algorithms
Milestone 2
Week 14 : Azure Cloud Fundamentals
Week 15 - 22 : Azure Databricks In-Depth (8 Weeks)
Week 23 - 24 : Azure Data Factory (2 Weeks)
Milestone-2 Power Modules to Kickstart your Career
Azure Cloud Capstone Project
Azure Interview Questions
Overview of Azure Databricks In-Depth (8 Weeks)
Module
→ What is Databricks and Why Databricks
→ Databricks Free Edition vs Azure Databricks
→ High Level Architecture of Databricks
→ Different Cluster Creation Modes
→ Different Types of Tables in Databricks
→ Iceberg Managed Tables in Databricks
→ Magic Commands in Databricks
→ Databricks Utilities
→ Lakehouse Architecture
→ Delta Lake in Depth
→ Volumes
→ Databricks Copy Into
→ Autoloader
→ Lakeflow Declarative Pipelines (Earlier called as DLT)
→ Implementing a Medallion Architecture
→ Governance using Unity Catalog
→ Lakeflow Connect
→ Lakeflow Jobs
→ Deployment – Databricks Asset Bundles
Milestone 3
Week 25 : Data Modeling
Week 26 - 27 : Spark Structured Streaming (2 Weeks)
Week 28 : Apache Kafka
Week 29 : Big Data on AWS Cloud - Athena & EC2
Week 30 : Big Data on AWS Cloud - EMR
Week 31 : Big Data on AWS Cloud - GLUE
Week 32 : Big Data on AWS Cloud - Redshift, Lambda, S3
Milestone-3 Power Modules to Kickstart your Career
AWS Project Pipeline