Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
4 views12 pages

Data Engineering Syllabus1 PDF

The document outlines an advanced graduate program in Data Engineering designed by former Google employees, featuring a job-assured syllabus for 2024. It covers various topics including Python, SQL, Pyspark, AWS concepts, and Git, aimed at preparing students for careers in data engineering. Contact information and links for further inquiries are also provided.

Uploaded by

email.yashdaga81
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views12 pages

Data Engineering Syllabus1 PDF

The document outlines an advanced graduate program in Data Engineering designed by former Google employees, featuring a job-assured syllabus for 2024. It covers various topics including Python, SQL, Pyspark, AWS concepts, and Git, aimed at preparing students for careers in data engineering. Contact information and links for further inquiries are also provided.

Uploaded by

email.yashdaga81
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

ADVANCE GRADUATE PROGRAMME

DATA ENGINEERING

*DESIGNED AND CURATED BY EX GOOGLE EMPLOYEES IN UK. READY TO WORK SYLLABUS.

1 Million

[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in


ADVANCE PROGRAMME
IN DATA ENGINEERING

JOB ASSURED PROGRAMME

2024

2024

1 Million +

[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in


[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in
Browsejobs

[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in


AWS
Processing

[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in


Python Basics Pandas
Python Intro Comments Intro Series DataFrames
Variables Data Types Read CSV Read JSON
Numbers Casting Strings Analyse Data Cleaning
Booleans Operators Lists Data Cleaning empty
Tuples Sets Dictionaries If cells Cleaning Wrong
Else While Loop | For Loops Format Cleaning Wrong
Functions Lambda Data Removing
Functions Duplicates Pandas
Correlations Pandas
Plotting

Python Advanced
Arrays Classes |
Objects
Inheritance
Iterators
Polymorphism
Scope Modules
Dates Math JSON RegEx
PIP Try Except User Input
File Handling Read, Write &
Delete Files

[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in


SQL Advanced Pyspark
Join Left Join Right Features Advantages
Join Self Join Group Modules and Packages
By Having Exists Cluster Managers
Case Stored Installation Architecture
Procedures Sparksession
Operators Create DB Sparkcontext RDD
| Table Drop DB | Parallelize Repartition or
Table Alter DB | Coalsce Broadcast
Table Primary Key Variables Accumalator
Foriegn Key Views

SQL
Intro Syntax
Select Selet
Distinct Where
Order by And |
Or | Not
Insert Into Null
Values Update
Delete Select Top
Min and Max Count
| Sum | Avg Like |
Wildcards In |
Between Aliases

[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in


Pyspark DataFrame Pyspark DataFrame
Create an Empty (cont)
DataFrame Convert RDD
to DF Convert DF to union() & unionall()
Pandas Show() unionByName() UDF
StructType and StructField tranform() apply() map()
Column Class Select() flatMap() foreach()
Collect() WithColumn() sample() vs sampleBy()
withColumnRenamed() fillna() & fill() pivot()
where() & filter() drop() & partitionBy() MapType
dropDuplicates() orderBy()
and sort() groupBy() join()

Pyspark SQL Functions


Aggregrate Functions
Window Functions Date
and Timestamp
Functions JSON
functions

[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in


AWS Concepts Apache Airflow
Intro Cloud Computing Fundamental concepts
Benefits EC2 EC2 Working with Taskflow
Instance Types EC2 Building a running
Pricing EC2 Scaling EC2 pipeline Object Storage
Auto Scaling AWS Load
Balancing AWS Lambda
AWS Containers AWS
Availability zones AWS
CloudFormation AWS
Elastic Beanstalk AWS S3
AWS RDS AWS Redshift
IIAM CloudWatch AWS
cloud compliance

GIT
Intro New files Staging
Environment Commit
Help Branch Branch
Merge Pull From
GitHub Push to GitHub
GitHub Branch Pull Branch
from GitHub Push Branch
to Github Github Flow
Github Fork Git Clone from
Github Git ignore Rever |
Reset | Amend

[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in


[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in
[email protected] +91-8618519825 | +44 7894574003 www.browsejobs.in
1 Million

INDIA UK UAE

INDIA

+91 7847006048 | +91 9861163654 | +91 8431814749 | +91 8618519825

UK

+44 7825 904092 | +44 7894574003

[email protected]

www.browsejobs.in

You might also like