Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
9 views3 pages

Complete Data Analyst Data Science Interview QA Diksha

The document is a comprehensive Q&A guide for Data Analyst and Data Scientist roles, covering self-introduction, technical skills, machine learning concepts, BI tools, project experiences, behavioral questions, and scenario-based inquiries. It includes specific questions and detailed answers on topics such as SQL, Python, data visualization, and project management. The guide aims to prepare candidates for interviews by providing insights into common questions and effective responses.

Uploaded by

dikshadatir20
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views3 pages

Complete Data Analyst Data Science Interview QA Diksha

The document is a comprehensive Q&A guide for Data Analyst and Data Scientist roles, covering self-introduction, technical skills, machine learning concepts, BI tools, project experiences, behavioral questions, and scenario-based inquiries. It includes specific questions and detailed answers on topics such as SQL, Python, data visualization, and project management. The guide aims to prepare candidates for interviews by providing insights into common questions and effective responses.

Uploaded by

dikshadatir20
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Complete Q&A Guide for Data Analyst

and Data Scientist Roles


1. Self-Introduction & Gap Explanation
Q: Tell me about yourself.
A: I’m Diksha Datir, an IT engineering graduate with skills in Python, SQL, Power BI,
Tableau, and ML. I’ve worked on projects like Credit Risk Modeling and IPL dashboards, and
interned at Blinkit analyzing delivery data. I’m eager to contribute to a data-driven team.

Q: You graduated in 2023. What have you done during the 2-year gap?
A: I used the time to upskill myself through certifications (IBM, HCIA-AI), internships, and
project work. I focused on Python, SQL, machine learning, and BI tools and completed
several real-world datasets and dashboards to stay current and hands-on.

2. Technical (SQL, Python, Excel)


Q: What is the difference between INNER JOIN and LEFT JOIN?
A: INNER JOIN returns matching rows from both tables; LEFT JOIN returns all rows from the
left table and matched rows from the right.

Q: How do you find duplicate values in a SQL table?


A: By using GROUP BY with HAVING COUNT(*) > 1 on the relevant column.

Q: How do you handle missing values in Python?


A: Use Pandas functions like dropna(), fillna(), or imputation strategies such as mean,
median, or domain-specific methods.

Q: What are list, tuple, set, and dictionary in Python?


A: List: ordered & mutable; Tuple: ordered & immutable; Set: unordered & unique; Dict:
key-value pair mapping.

3. Machine Learning
Q: What’s the difference between supervised and unsupervised learning?
A: Supervised uses labeled data (e.g., classification); unsupervised finds patterns in
unlabeled data (e.g., clustering).
Q: What is overfitting and how do you prevent it?
A: Overfitting is when a model learns noise instead of patterns. Prevent using cross-
validation, pruning, regularization, or simpler models.

Q: Explain the confusion matrix.


A: It shows TP, TN, FP, FN — helps calculate accuracy, precision, recall, and F1-score for
classification models.

Q: Which algorithms have you worked with?


A: Logistic Regression, Random Forest, Gradient Boosting, Linear Regression, K-Means,
Decision Trees.

4. BI Tools & Data Visualization


Q: Difference between Tableau and Power BI?
A: Tableau offers deeper customization; Power BI is cost-effective and better with Microsoft
tools. I’ve used both in projects.

Q: What visuals would you use for sales over time?


A: Line chart for trends, bar chart for categories. Also area charts if needed for volume.

Q: What is DAX in Power BI?


A: DAX (Data Analysis Expressions) is a formula language to create calculated columns and
KPIs in Power BI.

5. Project-Based Questions
Q: Explain your Blinkit internship project.
A: I analyzed 50,000+ delivery records using Python, built dashboards in Power BI/Tableau,
and delivered insights that improved delivery efficiency by 15%.

Q: Describe your Credit Risk Modeling project.


A: Built a classification model using Logistic Regression, Random Forest, GBM. Evaluated
with ROC-AUC, precision, recall. Used cross-validation for robustness.

Q: What was the purpose of the IPL Dashboard?


A: To visualize team/player stats over seasons. Used Power Query for data prep and DAX for
custom KPIs like win %, strike rate.
6. Behavioral & Situational
Q: Tell me about a challenge you faced in a project.
A: In the rainfall prediction project, model accuracy was low. I improved it by tuning
hyperparameters and using MLflow for tracking.

Q: How do you prioritize tasks when under pressure?


A: I break them into smaller parts, use priority matrices, and focus on business-critical
deliverables first.

Q: How do you explain technical findings to a non-technical audience?


A: I simplify language, use visualizations, and focus on what the data means for business
goals, not the technical jargon.

7. Scenario-Based Questions
Q: Sales dropped suddenly — how do you analyze?
A: Check product trends, customer behavior, marketing activity, web traffic, and competitor
pricing. Use SQL and dashboards for investigation.

Q: How would you build a dashboard for management?


A: Understand key KPIs, use filters, DAX/measures, ensure interactivity, and focus on visual
storytelling.

You might also like