Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
89 views76 pages

Shaik 200 Questions Data Engineer Interview Guide

The document is an interview guide for advanced data engineers, covering key topics in Python, SQL, and ETL processes. It includes questions and answers on file handling, API creation, SQL joins, window functions, and Azure Data Factory components. The guide emphasizes practical skills such as exception handling, data loading techniques, and query optimization.

Uploaded by

smasif6565
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
89 views76 pages

Shaik 200 Questions Data Engineer Interview Guide

The document is an interview guide for advanced data engineers, covering key topics in Python, SQL, and ETL processes. It includes questions and answers on file handling, API creation, SQL joins, window functions, and Azure Data Factory components. The guide emphasizes practical skills such as exception handling, data loading techniques, and query optimization.

Uploaded by

smasif6565
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 76

Shaik - Advanced Data Engineer Interview Guide

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?


Shaik - Advanced Data Engineer Interview Guide

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
Shaik - Advanced Data Engineer Interview Guide

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?


Shaik - Advanced Data Engineer Interview Guide

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance


Shaik - Advanced Data Engineer Interview Guide

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?


Shaik - Advanced Data Engineer Interview Guide

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering


Shaik - Advanced Data Engineer Interview Guide

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().
Shaik - Advanced Data Engineer Interview Guide

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic


Shaik - Advanced Data Engineer Interview Guide

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
Shaik - Advanced Data Engineer Interview Guide

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
Shaik - Advanced Data Engineer Interview Guide

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Shaik - Advanced Data Engineer Interview Guide

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?


Shaik - Advanced Data Engineer Interview Guide

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?


Shaik - Advanced Data Engineer Interview Guide

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.


Shaik - Advanced Data Engineer Interview Guide

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering


Shaik - Advanced Data Engineer Interview Guide

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance


Shaik - Advanced Data Engineer Interview Guide

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic


Shaik - Advanced Data Engineer Interview Guide

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.
Shaik - Advanced Data Engineer Interview Guide

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
Shaik - Advanced Data Engineer Interview Guide

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Shaik - Advanced Data Engineer Interview Guide

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?


Shaik - Advanced Data Engineer Interview Guide

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.


Shaik - Advanced Data Engineer Interview Guide

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?


Shaik - Advanced Data Engineer Interview Guide

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering


Shaik - Advanced Data Engineer Interview Guide

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance


Shaik - Advanced Data Engineer Interview Guide

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic


Shaik - Advanced Data Engineer Interview Guide

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.


Shaik - Advanced Data Engineer Interview Guide

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.
Shaik - Advanced Data Engineer Interview Guide

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Shaik - Advanced Data Engineer Interview Guide

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?


Shaik - Advanced Data Engineer Interview Guide

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
Shaik - Advanced Data Engineer Interview Guide

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?


Shaik - Advanced Data Engineer Interview Guide

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering


Shaik - Advanced Data Engineer Interview Guide

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?


Shaik - Advanced Data Engineer Interview Guide

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic


Shaik - Advanced Data Engineer Interview Guide

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
Shaik - Advanced Data Engineer Interview Guide

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance


Shaik - Advanced Data Engineer Interview Guide

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Shaik - Advanced Data Engineer Interview Guide

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?


Shaik - Advanced Data Engineer Interview Guide

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
Shaik - Advanced Data Engineer Interview Guide

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?


Shaik - Advanced Data Engineer Interview Guide

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance


Shaik - Advanced Data Engineer Interview Guide

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?


Shaik - Advanced Data Engineer Interview Guide

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering


Shaik - Advanced Data Engineer Interview Guide

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().
Shaik - Advanced Data Engineer Interview Guide

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic


Shaik - Advanced Data Engineer Interview Guide

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
Shaik - Advanced Data Engineer Interview Guide

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
Shaik - Advanced Data Engineer Interview Guide

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Shaik - Advanced Data Engineer Interview Guide

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?


Shaik - Advanced Data Engineer Interview Guide

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?


Shaik - Advanced Data Engineer Interview Guide

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.


Shaik - Advanced Data Engineer Interview Guide

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering


Shaik - Advanced Data Engineer Interview Guide

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance


Shaik - Advanced Data Engineer Interview Guide

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic


Shaik - Advanced Data Engineer Interview Guide

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.
Shaik - Advanced Data Engineer Interview Guide

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
Shaik - Advanced Data Engineer Interview Guide

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Shaik - Advanced Data Engineer Interview Guide

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?


Shaik - Advanced Data Engineer Interview Guide

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.


Shaik - Advanced Data Engineer Interview Guide

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?


Shaik - Advanced Data Engineer Interview Guide

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering


Shaik - Advanced Data Engineer Interview Guide

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance


Shaik - Advanced Data Engineer Interview Guide

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic


Shaik - Advanced Data Engineer Interview Guide

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.


Shaik - Advanced Data Engineer Interview Guide

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.
Shaik - Advanced Data Engineer Interview Guide

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Shaik - Advanced Data Engineer Interview Guide

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?


Shaik - Advanced Data Engineer Interview Guide

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
Shaik - Advanced Data Engineer Interview Guide

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?


Shaik - Advanced Data Engineer Interview Guide

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering


Shaik - Advanced Data Engineer Interview Guide

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?


Shaik - Advanced Data Engineer Interview Guide

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic


Shaik - Advanced Data Engineer Interview Guide

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
Shaik - Advanced Data Engineer Interview Guide

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

Python - File Handling / API / Logic

Q: What is the difference between a list and a tuple in Python?

A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.

Python - File Handling / API / Logic

Q: How do you handle exceptions in Python?

A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.

Python - File Handling / API / Logic

Q: Write a Python script to read a file and count word frequencies.

A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.

Python - File Handling / API / Logic

Q: How do you create a REST API in Flask?

A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.

SQL - Joins / Windows / Performance


Shaik - Advanced Data Engineer Interview Guide

Q: What is the difference between INNER JOIN and LEFT JOIN?

A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the

right.

SQL - Joins / Windows / Performance

Q: Write a SQL query to get the second highest salary.

A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);

SQL - Joins / Windows / Performance

Q: What is a window function in SQL?

A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,

OVER().

SQL - Joins / Windows / Performance

Q: How do you optimize a SQL query for large datasets?

A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.

ETL - SSIS / ADF / Data Engineering

Q: What are the main components of Azure Data Factory?

A: Pipelines, Datasets, Linked Services, and Integration Runtime.

ETL - SSIS / ADF / Data Engineering

Q: How do you use SSIS to load data from multiple files?

A: Use a Foreach Loop Container with expressions and a Data Flow Task.

ETL - SSIS / ADF / Data Engineering

Q: What is the difference between Lookup and Merge Join in SSIS?

A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.

ETL - SSIS / ADF / Data Engineering

Q: How do you handle schema drift in ADF?

A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.

You might also like