0% found this document useful (0 votes)

14 views4 pages

DSP 51 Mock Test II

The document contains a mock test with multiple-choice questions and subjective questions related to linear regression, model building, text analytics, and ROC curves. It also includes coding problems focused on decision trees and KMeans clustering using datasets. The questions assess understanding of statistical concepts, data preprocessing, and machine learning techniques.

Uploaded by

jhonnybhai888

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views4 pages

DSP 51 Mock Test II

Uploaded by

jhonnybhai888

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

MOCK TEST II

MCQs
You are given a multiple linear regression model: Y=β0+β1x1+β2x2+β3x3
The null hypothesis states that the variable is insignificant. Thus, if we fail to
reject the null hypothesis, you can say that the predictor is insignificant.
For example, if you fail to reject the null hypothesis for x1, you can say that x1 is
insignificant. This would also imply that the coefficient for x1, i.e., β1 = 0.
In other words, the null hypothesis tests if the predictor's coefficient, i.e., βi = 0.
If the null hypothesis is rejected, then βi≠0.
Answer to Question 1 and 2 are related to above content.

Question 1
If β1=β2=0 holds and β3 = 0 fails to hold, then what can you conclude?
A. There is a high correlation between x1 and x2.
B. There is a linear relationship between the outcome variable(Y) and x3.
C. There is a linear relationship between the outcome variable and x1, x2.

Question 2
If β1 = β2 = β3 = 0 holds true, what can you conclude?
A. There is no linear relationship between y and any of the three
independent variables.
B. There is a linear relationship between y and all of the three independent
variables.
C. There is linear relationship between x1, x2 and x3.

Question 3
Suppose you need to build a model on a dataset that contains 2 categorical
variables with 2 and 4 levels, respectively. How many dummy variables should
you create for model building?
A. 4
B. 5
C. 6
D. 8
Question 4
In a dataset with mean 50 and standard deviation 12, what will be the value of a
variable with an initial value of 20 after you standardise it?
A. 1.9
B. -1.9
C. 2.5
D. -2.5

Question 5

Which of the following variables are negatively correlated with the target variable
based on the summary statistics report given above? (More than one option may
be correct.)
A. Tenure
B. TotalCharges
C. MonthlyCharges
D. TechSupport_Yes

Subjective Questions
1. To do text analytics, we need to clean it . There are three kinds of words present
in any text corpus. What are they and give two reasons why they must be
removed?
2. In NLTK, you have different types of tokenisers present that you can use in
different applications. Explain briefly what are they and why one should use it?
3. Why can’t linear regression be used in place of logistic regression for binary
classification?
4. Developing hypotheses will be a key part of your job role as a data scientist
when you're working on real-world problems. You need to bring all your domain
knowledge to the forefront and try to identify the potential root causes of the
given problem. Your question is “What factors contribute most significantly to
customer churn in a subscription-based streaming service?" (For ex:Netflix,
Amazon Prime etc)
5. ROC stands for Receiver Operating Characteristic curve. This name has emerged
from the domain of electrical engineering around the 2nd World War when
electrical and radar engineers used such curve to detect enemy planes. Since
then, this concept has found its application in many fields, machine learning
being the latest one.
"What is the significance of the ROC curve in Logistic Regression, and how does
it help in evaluating the model's performance?"

Coding Problems
1. Decision Tree - Bank Marketing Dataset
Description
You are given the 'Portuguese Bank' marketing dataset which contains data about a
telemarketing campaign run by the bank to sell a product (term deposit - a type of
investment product).

Each row represents a 'prospect' to whom phone calls were made to sell the product.
There are various attributes describing the prospects, such as age, profession,
education level, previous loans taken by the person etc. Finally, the target variable
is 'purchased' (1/0), 1 indicating that the person had purchased the product. A sample
of the training data is attached below (note that 'id' shouldn't be used to train the
model) :

!"#$%&'"(#)*+,

As an analyst, you want to predict whether a person will purchase the product or not.
This will help the bank reduce their marketing costs since one can then target only the
prospects who are likely to buy. Build a decision tree with default
hyperparameters to predict whether a person will buy the product or not. You have
to write the predictions in the file bank_predictions.csv in the following format (note the
column names carefully)
bank_predicted id
0 2041
1 399
0 1400
0 3709
1 2111

2. Clustering KMeans
Description:
Given below is a data set on the education status of Indian states.

!"#$%"&'%'E)*#+I-.)-

Which parameters do you think are the most important for segmenting the
states? How did you decide this? How will you check if the segmenting is good or
whether you need to use different factors for segmenting? How are the clusters
different when we have not scaled compared to clusters formed after scaling?

Statistics Made Easy
100% (4)
Statistics Made Easy
412 pages
Chapters 1 & 2 MCQs
No ratings yet
Chapters 1 & 2 MCQs
75 pages
Marks Hi Marks: Be Comp MCQ PDF
100% (1)
Marks Hi Marks: Be Comp MCQ PDF
878 pages
Abinash Nag Project Report CART
No ratings yet
Abinash Nag Project Report CART
40 pages
Chi-Square Test for Independence
No ratings yet
Chi-Square Test for Independence
4 pages
Biometry and Experimental Design
100% (2)
Biometry and Experimental Design
106 pages
Cognitive Class - Answers Data Analysis With Python
No ratings yet
Cognitive Class - Answers Data Analysis With Python
6 pages
Tybsc Cs368 Data Analytics Labbook
No ratings yet
Tybsc Cs368 Data Analytics Labbook
58 pages
MCQs (Machine Learning)
50% (22)
MCQs (Machine Learning)
7 pages
Predictive Model: Submitted by
100% (3)
Predictive Model: Submitted by
27 pages
This Sheet Is For 1 Mark Questions S.R No
100% (1)
This Sheet Is For 1 Mark Questions S.R No
69 pages
EC501 Lecture 04
No ratings yet
EC501 Lecture 04
30 pages
Week 4 Assignment
No ratings yet
Week 4 Assignment
11 pages
MLRS Assignment 1 24070146008 Sreemanth Mannem
No ratings yet
MLRS Assignment 1 24070146008 Sreemanth Mannem
12 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
ML MID-1 Question Bank
No ratings yet
ML MID-1 Question Bank
6 pages
Labook DA
No ratings yet
Labook DA
59 pages
Machine Learning Assignment Solutions
No ratings yet
Machine Learning Assignment Solutions
46 pages
MBA786M Project
No ratings yet
MBA786M Project
2 pages
Solution
No ratings yet
Solution
18 pages
QCM DL
No ratings yet
QCM DL
7 pages
ML Suggestion 2
No ratings yet
ML Suggestion 2
11 pages
DATA SCIENCE iNTERVIEW QUESTION
No ratings yet
DATA SCIENCE iNTERVIEW QUESTION
42 pages
Assignment III
No ratings yet
Assignment III
3 pages
ML MCQs Set
No ratings yet
ML MCQs Set
18 pages
Repaso Econometria Final BUENO
No ratings yet
Repaso Econometria Final BUENO
88 pages
2022 Jan
No ratings yet
2022 Jan
37 pages
ISE 529 Mock Test Answers
No ratings yet
ISE 529 Mock Test Answers
6 pages
Graded Quiz Unit 3 PDF
No ratings yet
Graded Quiz Unit 3 PDF
10 pages
Data Science Quiz Questions Analysis
No ratings yet
Data Science Quiz Questions Analysis
8 pages
Machine Learning Insem-01 QP
No ratings yet
Machine Learning Insem-01 QP
6 pages
ML Afawerquestions
No ratings yet
ML Afawerquestions
5 pages
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
Module3-Fitting A Model To Data
No ratings yet
Module3-Fitting A Model To Data
57 pages
PAMLSET2
No ratings yet
PAMLSET2
4 pages
Python Data Preprocessing & Regression
No ratings yet
Python Data Preprocessing & Regression
68 pages
Soal CISDM
No ratings yet
Soal CISDM
3 pages
Data Science Interview Prep Guide
No ratings yet
Data Science Interview Prep Guide
3 pages
Exercise2 BAN5753
No ratings yet
Exercise2 BAN5753
4 pages
Aih Lab1
No ratings yet
Aih Lab1
10 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
ML MID-1 QB With Answers
No ratings yet
ML MID-1 QB With Answers
10 pages
Method Validation: With Confidence
100% (2)
Method Validation: With Confidence
52 pages
PAMLSET1 New
No ratings yet
PAMLSET1 New
4 pages
Qns Exam2
No ratings yet
Qns Exam2
11 pages
18CSO106T Data Analysis Using Open Source Tool: Question Bank
No ratings yet
18CSO106T Data Analysis Using Open Source Tool: Question Bank
26 pages
2 Mark Questions
No ratings yet
2 Mark Questions
13 pages
Survival Analysis for Medical Data
No ratings yet
Survival Analysis for Medical Data
37 pages
Sample MCQ
No ratings yet
Sample MCQ
16 pages
Grade 3 Data Mining: Question Text
No ratings yet
Grade 3 Data Mining: Question Text
28 pages
Linear Regression Assignment
0% (2)
Linear Regression Assignment
8 pages
Mmla Ia FT202087
No ratings yet
Mmla Ia FT202087
6 pages
Data Science Exam Analysis
No ratings yet
Data Science Exam Analysis
16 pages
Introduction To Mediation Models With The PROCESS Marco in SPSS
No ratings yet
Introduction To Mediation Models With The PROCESS Marco in SPSS
47 pages
Lab 03 Sol
No ratings yet
Lab 03 Sol
6 pages
Data Mining For Intelligence
No ratings yet
Data Mining For Intelligence
4 pages
Python For Data Science - Unit 6 - Week 4
No ratings yet
Python For Data Science - Unit 6 - Week 4
5 pages
What Are The Differences Between Supervised and Unsupervised Learning?
No ratings yet
What Are The Differences Between Supervised and Unsupervised Learning?
21 pages
Data Science Cse
No ratings yet
Data Science Cse
24 pages
CS 4407 Unit 3 Graded Quiz Review
No ratings yet
CS 4407 Unit 3 Graded Quiz Review
11 pages
BigDatal PDF
No ratings yet
BigDatal PDF
50 pages
Machine Learning Quiz: Key Concepts
No ratings yet
Machine Learning Quiz: Key Concepts
7 pages
Data Science For Online Customer Analytics - Assignment
No ratings yet
Data Science For Online Customer Analytics - Assignment
11 pages
Credit Risk Classification Analysis
No ratings yet
Credit Risk Classification Analysis
16 pages
40 Interview Questions On Machine Learning - AnalyticsVidhya
100% (1)
40 Interview Questions On Machine Learning - AnalyticsVidhya
21 pages
Multivariate Analysis Techniques
No ratings yet
Multivariate Analysis Techniques
4 pages
Probability and Statistics Problems
No ratings yet
Probability and Statistics Problems
8 pages
WC2 3
No ratings yet
WC2 3
9 pages
Econometrics ch4
No ratings yet
Econometrics ch4
66 pages
Formulas and Tables With Everything 2019
No ratings yet
Formulas and Tables With Everything 2019
10 pages
Normal Practice
No ratings yet
Normal Practice
4 pages
Math 43
No ratings yet
Math 43
3 pages
Intraday Momentum Tradingwith HMM
No ratings yet
Intraday Momentum Tradingwith HMM
38 pages
Sampling Distribution of Defective Chips
100% (1)
Sampling Distribution of Defective Chips
5 pages
MA 6.101 Probability and Statistics: Assistant Professor, IIIT Hyderabad
No ratings yet
MA 6.101 Probability and Statistics: Assistant Professor, IIIT Hyderabad
65 pages
A Unified View of Performance Metrics: Translating Threshold Choice Into Expected Classification Loss
No ratings yet
A Unified View of Performance Metrics: Translating Threshold Choice Into Expected Classification Loss
57 pages
Optimal Receiver Design Guide
No ratings yet
Optimal Receiver Design Guide
41 pages
119686
No ratings yet
119686
24 pages
Document 2
No ratings yet
Document 2
18 pages
Qed WP 1456
No ratings yet
Qed WP 1456
58 pages
Five Instruments For Measuring Tree Height An Eval
No ratings yet
Five Instruments For Measuring Tree Height An Eval
8 pages
Module 5 - Stat. - Prob.
No ratings yet
Module 5 - Stat. - Prob.
4 pages
Learning Team Week 4 QNT351 University of Phoenix
No ratings yet
Learning Team Week 4 QNT351 University of Phoenix
5 pages
Understanding Multicollinearity in Regression
No ratings yet
Understanding Multicollinearity in Regression
8 pages
Frequencies: Frequencies Variables Usia /piechart Freq /order Analysis
No ratings yet
Frequencies: Frequencies Variables Usia /piechart Freq /order Analysis
5 pages
Probability & Statistics Exam Review
No ratings yet
Probability & Statistics Exam Review
12 pages
The Randomized Block Design
No ratings yet
The Randomized Block Design
32 pages
Markov Processes Meng
No ratings yet
Markov Processes Meng
29 pages

DSP 51 Mock Test II

Uploaded by

DSP 51 Mock Test II

Uploaded by

MOCK TEST II

You might also like