Shaik - Advanced Data Engineer Interview Guide
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
Shaik - Advanced Data Engineer Interview Guide
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
Shaik - Advanced Data Engineer Interview Guide
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
Shaik - Advanced Data Engineer Interview Guide
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Shaik - Advanced Data Engineer Interview Guide
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
Shaik - Advanced Data Engineer Interview Guide
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Shaik - Advanced Data Engineer Interview Guide
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
Shaik - Advanced Data Engineer Interview Guide
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Shaik - Advanced Data Engineer Interview Guide
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
Shaik - Advanced Data Engineer Interview Guide
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
Shaik - Advanced Data Engineer Interview Guide
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Shaik - Advanced Data Engineer Interview Guide
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
Shaik - Advanced Data Engineer Interview Guide
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
Shaik - Advanced Data Engineer Interview Guide
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
Shaik - Advanced Data Engineer Interview Guide
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Shaik - Advanced Data Engineer Interview Guide
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Shaik - Advanced Data Engineer Interview Guide
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Shaik - Advanced Data Engineer Interview Guide
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
Shaik - Advanced Data Engineer Interview Guide
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
Shaik - Advanced Data Engineer Interview Guide
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Shaik - Advanced Data Engineer Interview Guide
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
Shaik - Advanced Data Engineer Interview Guide
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
Shaik - Advanced Data Engineer Interview Guide
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
Shaik - Advanced Data Engineer Interview Guide
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Shaik - Advanced Data Engineer Interview Guide
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Shaik - Advanced Data Engineer Interview Guide
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Shaik - Advanced Data Engineer Interview Guide
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
Shaik - Advanced Data Engineer Interview Guide
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
Shaik - Advanced Data Engineer Interview Guide
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Shaik - Advanced Data Engineer Interview Guide
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
Shaik - Advanced Data Engineer Interview Guide
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
Shaik - Advanced Data Engineer Interview Guide
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
Shaik - Advanced Data Engineer Interview Guide
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Shaik - Advanced Data Engineer Interview Guide
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
Shaik - Advanced Data Engineer Interview Guide
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Shaik - Advanced Data Engineer Interview Guide
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
Shaik - Advanced Data Engineer Interview Guide
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Shaik - Advanced Data Engineer Interview Guide
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Shaik - Advanced Data Engineer Interview Guide
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
Shaik - Advanced Data Engineer Interview Guide
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
Shaik - Advanced Data Engineer Interview Guide
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
Shaik - Advanced Data Engineer Interview Guide
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Shaik - Advanced Data Engineer Interview Guide
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
Shaik - Advanced Data Engineer Interview Guide
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Shaik - Advanced Data Engineer Interview Guide
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
Shaik - Advanced Data Engineer Interview Guide
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Shaik - Advanced Data Engineer Interview Guide
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
Shaik - Advanced Data Engineer Interview Guide
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
Shaik - Advanced Data Engineer Interview Guide
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Shaik - Advanced Data Engineer Interview Guide
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
Shaik - Advanced Data Engineer Interview Guide
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
Shaik - Advanced Data Engineer Interview Guide
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
Shaik - Advanced Data Engineer Interview Guide
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Shaik - Advanced Data Engineer Interview Guide
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Shaik - Advanced Data Engineer Interview Guide
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Shaik - Advanced Data Engineer Interview Guide
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
Shaik - Advanced Data Engineer Interview Guide
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
Shaik - Advanced Data Engineer Interview Guide
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Shaik - Advanced Data Engineer Interview Guide
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
Shaik - Advanced Data Engineer Interview Guide
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
Shaik - Advanced Data Engineer Interview Guide
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
Shaik - Advanced Data Engineer Interview Guide
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Shaik - Advanced Data Engineer Interview Guide
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Shaik - Advanced Data Engineer Interview Guide
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Shaik - Advanced Data Engineer Interview Guide
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
Shaik - Advanced Data Engineer Interview Guide
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
Shaik - Advanced Data Engineer Interview Guide
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Shaik - Advanced Data Engineer Interview Guide
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
Shaik - Advanced Data Engineer Interview Guide
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
Shaik - Advanced Data Engineer Interview Guide
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
Shaik - Advanced Data Engineer Interview Guide
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Shaik - Advanced Data Engineer Interview Guide
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
Shaik - Advanced Data Engineer Interview Guide
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Shaik - Advanced Data Engineer Interview Guide
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
Shaik - Advanced Data Engineer Interview Guide
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.
Python - File Handling / API / Logic
Q: What is the difference between a list and a tuple in Python?
A: Lists are mutable, whereas tuples are immutable. Lists have methods like append(), while tuples do not.
Python - File Handling / API / Logic
Q: How do you handle exceptions in Python?
A: Using try-except blocks. You can also use finally for cleanup and raise to throw exceptions.
Python - File Handling / API / Logic
Q: Write a Python script to read a file and count word frequencies.
A: Use open(), read(), split(), and collections.Counter to count word frequencies in a file.
Python - File Handling / API / Logic
Q: How do you create a REST API in Flask?
A: Use Flask decorators like @app.route and methods=['GET', 'POST']. Use jsonify() to return responses.
SQL - Joins / Windows / Performance
Shaik - Advanced Data Engineer Interview Guide
Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns only matching rows; LEFT JOIN returns all rows from the left table with matched rows from the
right.
SQL - Joins / Windows / Performance
Q: Write a SQL query to get the second highest salary.
A: SELECT MAX(salary) FROM employees WHERE salary < (SELECT MAX(salary) FROM employees);
SQL - Joins / Windows / Performance
Q: What is a window function in SQL?
A: Window functions perform calculations across a set of rows related to the current row, e.g., ROW_NUMBER, RANK,
OVER().
SQL - Joins / Windows / Performance
Q: How do you optimize a SQL query for large datasets?
A: Use indexes, avoid SELECT *, use proper WHERE clauses, analyze execution plans.
ETL - SSIS / ADF / Data Engineering
Q: What are the main components of Azure Data Factory?
A: Pipelines, Datasets, Linked Services, and Integration Runtime.
ETL - SSIS / ADF / Data Engineering
Q: How do you use SSIS to load data from multiple files?
A: Use a Foreach Loop Container with expressions and a Data Flow Task.
ETL - SSIS / ADF / Data Engineering
Q: What is the difference between Lookup and Merge Join in SSIS?
A: Lookup is for referencing small reference data; Merge Join combines sorted datasets based on join keys.
ETL - SSIS / ADF / Data Engineering
Q: How do you handle schema drift in ADF?
A: Use schema mapping and dynamic datasets, or parameterize schema structures using JSON settings.