10/8/21, 9:33 AM Big Data Computing - - Unit 9 - Week-7
Assessment submitted.
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
X
[email protected]
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Big Data Computing (course)
Course
Thank you for taking the Week -
outline 7:Assignment-7.
How does an
NPTEL online
course work?
Week - 7:Assignment-7
Week-0 Your last recorded submission was on 2021-10-08, 09:33 Due date: 2021-10-13, 23:59 IST.
IST
Week-1
1) Suppose you are using a bagging based algorithm say a Random Forest in model 1 point
building. Which of the following can be true?
Week-2
1. Number of tree should be as large as possible
Week-3
2. You will have interpretability after using Random Forest
Week-4
Only 1
Only 2
Week-5
Both 1 and 2
Week-6
None of the mentioned
2) To apply bagging to regression trees which of the following is/are true in such case 1 point
Week-7 ?
Decision Trees
1. We build the N regression with N bootstrap sample
for Big Data
Analytics
2. We take the average the of N regression tree
(unit? 3. Each tree has a high variance with low bias
unit=67&lesson=68)
1 and 2
Big Data
2 and 3
Predictive
Analytics
1 and 3
(Part-I) (unit?
1, 2 and 3
unit=67&lesson=69)
3) In which of the following scenario a gain ratio is preferred over Information Gain ? 1 point
Big Data
Predictive
When a categorical variable has very small number of category
Analytics
(Part-II) (unit?
Number of categories is the not the reason
unit=67&lesson=70)
When a categorical variable has very large number of category
https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=67&assessment=98 1/3
10/8/21, 9:33 AM Big Data Computing - - Unit 9 - Week-7
Week-7:
None of the mentioned
Assessment submitted.
Lecture
X material (unit? 4) Which of the following is/are true about Random Forest and Gradient Boosting 1 point
unit=67&lesson=71) ensemble methods ?
Feedback for
Week 7 (unit? 1. Both methods can be used for classification task
unit=67&lesson=72) 2. Random Forest is use for classification whereas Gradient Boosting is use for regression
task
Quiz: Week -
3. Random Forest is use for regression whereas Gradient Boosting is use for Classification
7:Assignment-
task
7
(assessment?
4. Both methods can be used for regression task
name=98)
1 and 2
Text Transcripts
2 and 3
2 and 4
Books
1 and 4
5) Given an attribute table shown below, which stores the basic information of attribute 1 point
a, including the row identifier of instance row_id , values of attribute values (a) and class labels of
instances c.
Which of the following attribute will first provide the pure subset ?
Humidity
https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=67&assessment=98 2/3
10/8/21, 9:33 AM Big Data Computing - - Unit 9 - Week-7
Wind
Assessment submitted.
Outlook
X
None of the mentioned
6) True or False ?
1 point
Bagging provides an averaging over a set of possible datasets, removing noisy and non-stable
parts of models.
True
False
7) Hundreds of trees can be aggregated to form a Random forest model. Which of the 1 point
following is true about any individual tree in Random Forest?
1. Individual tree is built on a subset of the features
2. Individual tree is built on all the features
3. Individual tree is built on a subset of observations
4. Individual tree is built on full set of observations
1 and 3
1 and 4
2 and 3
2 and 4
8) Boosting any algorithm takes into consideration the weak learners. Which of the 1 point
following is the main reason behind using weak learners ?
Reason I-To prevent overfitting
Reason II- To prevent underfitting
Reason I
Reason II
Both the Reasons
None of the Reasons
You may submit any number of times before the due date. The final submission will be
considered for grading.
Submit Answers
https://onlinecourses.nptel.ac.in/noc21_cs86/unit?unit=67&assessment=98 3/3