0% found this document useful (0 votes)

4 views4 pages

Module 4 Supervised Learning

Uploaded by

vishal.patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

Module 4 Supervised Learning

Uploaded by

vishal.patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Module 4 Supervised Learning

What is Ensemble Learning with example?

• Ensemble learning refers to a machine learning approach in which the predictions from multiple
models are merged to enhance the accuracy of the ultimate forecast.
 E.g., Suppose you are a movie director and you have created a short movie on a very
important and interesting topic. Now, you want to take preliminary feedback (ratings) on the
movie before making it public. What are the possible ways by which you can do that?
o A: You may ask one of your friends to rate the movie for you.
o B: Another way could be by asking 5 colleagues of yours to rate the movie.
o C: How about asking 50 people to rate the movie?

Simple Ensemble Techniques-

1) Max Voting
2) Averaging
3) Weighted Averaging

a) Max Voting
i) The max voting method is generally used for classification problems.
ii) In this technique, multiple models are used to make predictions for each data point.
iii) The predictions by each model are considered as a ‘vote’.
iv) The predictions which we get from the majority of the models are used as the final
prediction.

b) Averaging
i) Similar to the max voting technique, multiple predictions are made for each data point
in averaging.
ii) In this method, we take an average of predictions from all the models and use it to
make the final prediction.
iii) Averaging can be used for making predictions in regression problems.

c) Weighted Averaging
i) This is an extension of the averaging method.
ii) All models are assigned different weights defining the importance of each model for
prediction.

Advanced Ensemble techniques-

1. Bagging
2. Boosting
3. Stacking
a. Bagging
i. The idea behind bagging is combining the results of multiple models.
ii. you create all the models on the same set of data and combine it, there is a
high chance that these models will give the same result since they are
getting the same input.
iii. Bootstrapping is a sampling technique in which we create subsets of
observations from the original dataset, with replacement.
iv. Multiple subsets are created from the original dataset, selecting
observations with replacement.
v. A base model (weak model) is created on each of these subsets.
vi. The models run in parallel and are independent of each other.
vii. The final predictions are determined by combining the predictions from all
the models.
b. Boosting
i. Boosting is a sequential process, where each subsequent model attempts to
correct the errors of the previous model.
ii. The succeeding models are dependent on the previous model.
 Algorithm
i. A subset is created from the original dataset.
ii. Initially, all data points are given equal weights.
iii. A base model is created on this subset.
iv. This model is used to make predictions on the whole dataset.
v. Errors are calculated using the actual values and predicted values.
vi. The observations which are incorrectly predicted, are given higher
weights.
vii. Another model is created, and predictions are made on the dataset.
viii. Similarly, multiple models are created, each correcting the errors of the
previous model.
ix. The final model (strong learner) is the weighted mean of all the models
(weak learners).

c. Stacking
i. Stacking is an ensemble learning technique that uses predictions from
multiple models (for example decision tree, knn or svm) to build a new
model.
ii. This model is used for making predictions on the test set.
iii. The train set is split into 10 parts.
iv. A base model (suppose a decision tree) is fitted on 9 parts and predictions
are made for the 10th part. This is done for each part of the train set.
v. The base model (in this case, decision tree) is then fitted on the whole train
dataset.
vi. Using this model, predictions are made on the test set.
vii. Steps 2 to 4 are repeated for another base model (say knn) resulting in
another set of predictions for the train set and test set.
viii. The predictions from the train set are used as features to build a new
model.
ix. This model is used to make final predictions on the test prediction set.

Evaluating a ML model-

• How well is my model doing? Is it a useful model?

• Will training my model on more data improve its performance?
• Do I need to include more features?
Metrics-

• Classification metrics
o When performing classification predictions, there's four types of outcomes that could
occur.
 True positives are when you predict an observation belongs to a class and it does belong to that
class.

 True negatives are when you predict an observation does not belong to a class and it does not
belong to that class.

 False positives occur when you predict an observation belongs to a class when it does not.

 False negatives occur when you predict an observation does not belong to a class when in fact it
does.

Accuracy-

• The most used metric to judge a model and is not a clear indicator of the performance. The worst
happens when classes are imbalanced.
TP + TN
-----------------------
TP + FP + TN +FN
Precision-

• Percentage of positive instances out of the total predicted positive instances.

• Here denominator is the model prediction done as positive from the whole given dataset.

TP
-------------
TP + FP
Recall / Sensitivity / True Positive Rate-

• Percentage of positive instances out of the total actual positive instances.

• Therefore denominator (TP + FN) here is the actual number of positive instances present in the
dataset. TP
-------------
TP + FN
Specificity-

• Percentage of negative instances out of the total actual negative instances.

• Therefore denominator (TN + FP) here is the actual number of negative instances present in the
dataset. It is like recall, but the shift is on the negative instances.
TN
-------------
F1 score- TN + FN
• It is the harmonic means of precision and recall.
• This takes the contribution of both, so the higher the F1 score, the better.
• product in the numerator if one goes low, the final F1 score goes down significantly.

2 X Precision X Recall
-------------------------------
Precision + Recall
ROC curve-
• ROC stands for Receiver Operating Characteristic and the graph is plotted against TPR and FPR for
various threshold values.
• As TPR increases FPR also increases.
• We have four categories, and we want the threshold value that leads us closer to the top left corner.

Regression metrics-
• Evaluation metrics for regression models are quite different than the above metrics.
• It is only concerned with whether a prediction was correct or incorrect.
o Explained variance: - Explained variance compares the variance within the expected
outcomes and compares that to the variance in the error of our model. This metric essentially
represents the amount of variation in the original dataset that our model can explain.
o Mean squared error: - Mean squared error is simply defined as the average of squared
differences between the predicted output and the true output. Squared error is commonly used
because it is agnostic to whether the prediction was too high or too low, it just reports that the
prediction was incorrect.
o R2 coefficient represents the proportion of variance in the outcome that our model can
predict based on its features.

Bias vs Variance-
• In general, a machine learning model analyses the data, find patterns in it and make predictions.
• While training, the model learns these patterns in the dataset and applies them to test data for
prediction.
• While making predictions, a difference occurs between prediction values made by the model and
actual values/expected values, and this difference is known as bias errors or Errors due to bias.

 Low Bias: A low bias model will make fewer assumptions about the form of the target function.
 High Bias: A model with a high bias makes more assumptions, and the model becomes unable to
capture the important features of our dataset. A high bias model also cannot perform well on new
data.
• variance tells how much a random variable is different from its expected value.
• a model should not vary too much from one training dataset to another, which means the algorithm
should be good in understanding the hidden mapping between inputs and output variables.
• Variance errors are either of low variance or high variance.
 Low variance means there is a small variation in the prediction of the target function with
changes in the training data set.
 High variance shows a large variation in the prediction of the target function with changes in the
training dataset.

Different Combinations of Bias-Variance-

• Low-Bias,Low-Variance:
The combination of low bias and low variance shows an ideal machine learning model. However, it
is not possible practically.
• Low-Bias, High-Variance: With low bias and high variance, model predictions are inconsistent and
accurate on average. This case occurs when the model learns with many parameters and hence leads
to overfitting.
• High-Bias, Low-Variance: With High bias and low variance, predictions are consistent but inaccurate
on average. This case occurs when a model does not learn well with the training dataset or uses few
numbers of the parameter. It leads to underfitting problems in the model.
• High-Bias,High-Variance:
With high bias and high variance, predictions are inconsistent and inaccurate on average.
 In summary, a model with high bias is limited from learning the true trend and underfits
the data.
 A model with high variance learns too much from the training data and overfits the data.
 The best model sits somewhere in the middle of the two extremes.

Print - Udyam Registration Certificate
No ratings yet
Print - Udyam Registration Certificate
4 pages
Module 4 Supervised Algoritms-II
No ratings yet
Module 4 Supervised Algoritms-II
40 pages
Module 1
No ratings yet
Module 1
28 pages
PR2 Module 5 Lesson 3
No ratings yet
PR2 Module 5 Lesson 3
30 pages
ML Challenges and Metrics
No ratings yet
ML Challenges and Metrics
19 pages
Solved Example of Decision Tree Gini Index
No ratings yet
Solved Example of Decision Tree Gini Index
4 pages
Example
No ratings yet
Example
3 pages
Ebook m346 Ebook M346ebook M346ebook m346
No ratings yet
Ebook m346 Ebook M346ebook M346ebook m346
22 pages
Week 11.3
No ratings yet
Week 11.3
14 pages
Jntuk r20 ML Unit-III
No ratings yet
Jntuk r20 ML Unit-III
28 pages
Slide 6
No ratings yet
Slide 6
37 pages
MOS Module 1
No ratings yet
MOS Module 1
19 pages
Notes On Machine Learning Fundamentals
No ratings yet
Notes On Machine Learning Fundamentals
4 pages
What Do Case-Control Studies Estimate? Survey of Methods and Assumptions in Published Case-Control Research
No ratings yet
What Do Case-Control Studies Estimate? Survey of Methods and Assumptions in Published Case-Control Research
9 pages
Display PDF - PHP
No ratings yet
Display PDF - PHP
4 pages
Dynamic Response
No ratings yet
Dynamic Response
13 pages
Deep Learning CNN
No ratings yet
Deep Learning CNN
2 pages
Mod8 DM
No ratings yet
Mod8 DM
13 pages
Machine Learning QUESTION AND ANSWERS
No ratings yet
Machine Learning QUESTION AND ANSWERS
13 pages
Chapter 11 Analysis of Variance and Regression
No ratings yet
Chapter 11 Analysis of Variance and Regression
21 pages
Econ f342 Appeco
No ratings yet
Econ f342 Appeco
3 pages
AEE-Tutorial 1
No ratings yet
AEE-Tutorial 1
7 pages
MachineLearning Chatgpt
No ratings yet
MachineLearning Chatgpt
19 pages
II-Sem-MULTIVARIATE DATA ANALYSIS
No ratings yet
II-Sem-MULTIVARIATE DATA ANALYSIS
2 pages
QB ML Ans
No ratings yet
QB ML Ans
14 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Fy Mtech Fem Assignments
No ratings yet
Fy Mtech Fem Assignments
4 pages
Statistical and Machine-Learning Data Mining: Bruce Ratner
No ratings yet
Statistical and Machine-Learning Data Mining: Bruce Ratner
13 pages
8.1 Simple Exponential Smoothing - Forecasting - Principles and Practice (3rd Ed)
No ratings yet
8.1 Simple Exponential Smoothing - Forecasting - Principles and Practice (3rd Ed)
8 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
3 pages
NLOGIT 6 Reference Guide
No ratings yet
NLOGIT 6 Reference Guide
695 pages
EstimationTheory Lecture 03
No ratings yet
EstimationTheory Lecture 03
21 pages
Session 15-Logistic Regression
No ratings yet
Session 15-Logistic Regression
16 pages
AI ML Concepts
No ratings yet
AI ML Concepts
97 pages
Linear Modelling II Course Outline
No ratings yet
Linear Modelling II Course Outline
22 pages
PSCS511 - Machine Learning
No ratings yet
PSCS511 - Machine Learning
23 pages
Capstone Notes-Model
No ratings yet
Capstone Notes-Model
20 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
116 pages
AIMl TA2
No ratings yet
AIMl TA2
4 pages
Unit 2 Part 2 Data Science Final 23june
No ratings yet
Unit 2 Part 2 Data Science Final 23june
39 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
Peramalan Penjualan Bulanan
No ratings yet
Peramalan Penjualan Bulanan
24 pages
Predicting Pregnancies of Our Customers I - Regression Model
No ratings yet
Predicting Pregnancies of Our Customers I - Regression Model
50 pages
Econ21 Ditzen
No ratings yet
Econ21 Ditzen
36 pages
Unit 3 ML
No ratings yet
Unit 3 ML
40 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
3ML.02.MainConcepts Evaluation
No ratings yet
3ML.02.MainConcepts Evaluation
35 pages
Machine Learning General: Definiton
No ratings yet
Machine Learning General: Definiton
14 pages
Data Analysis for Statisticians
No ratings yet
Data Analysis for Statisticians
93 pages
L2 - Problems in ML & Performance Evaluation
No ratings yet
L2 - Problems in ML & Performance Evaluation
30 pages
Chapter 2 Part II
No ratings yet
Chapter 2 Part II
28 pages
ML 5
No ratings yet
ML 5
26 pages
Unit - 1 Leftover Topic Notes
No ratings yet
Unit - 1 Leftover Topic Notes
8 pages
Model Evaluation in ML
No ratings yet
Model Evaluation in ML
12 pages
Ensemble Learning
No ratings yet
Ensemble Learning
46 pages
Unit 4
No ratings yet
Unit 4
34 pages
Diagnosing Bias Vs Variance
No ratings yet
Diagnosing Bias Vs Variance
11 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
EXP-1-To Implement Linear Regression
No ratings yet
EXP-1-To Implement Linear Regression
5 pages
Econometrics for Advanced Learners
No ratings yet
Econometrics for Advanced Learners
6 pages
ECON 322 ECONOMETRICS II - Kabarak University
No ratings yet
ECON 322 ECONOMETRICS II - Kabarak University
7 pages
ML11 Generalization
No ratings yet
ML11 Generalization
40 pages
ML Notes
No ratings yet
ML Notes
16 pages
SML
No ratings yet
SML
8 pages
Lecture 4 R and Time Series Analysis
No ratings yet
Lecture 4 R and Time Series Analysis
36 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
ML Endsem
No ratings yet
ML Endsem
14 pages
ML (Theory)
No ratings yet
ML (Theory)
11 pages
All DL
No ratings yet
All DL
72 pages
Codes and Concepts of ML-Developer-2
No ratings yet
Codes and Concepts of ML-Developer-2
17 pages
New Doc 2018-10-03 17.22.00 - 20200303130026 PDF
No ratings yet
New Doc 2018-10-03 17.22.00 - 20200303130026 PDF
1 page
IEDheater E On Y Axis
No ratings yet
IEDheater E On Y Axis
1 page
Registration
No ratings yet
Registration
1 page
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
Nils Baker
No ratings yet
Nils Baker
11 pages
Unit - 2 Deep Learning
No ratings yet
Unit - 2 Deep Learning
26 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
Chapter Two Time Series Regression
No ratings yet
Chapter Two Time Series Regression
7 pages
ML MAKAUT Unit-3
No ratings yet
ML MAKAUT Unit-3
6 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
ML Unit-3 - RTU
No ratings yet
ML Unit-3 - RTU
20 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
4 pages
7 ML
No ratings yet
7 ML
38 pages
Opman Quiz 1 Review
No ratings yet
Opman Quiz 1 Review
11 pages
Note Taking Guide: Episode 901 Name - : Chemistry: A Study of Matter
No ratings yet
Note Taking Guide: Episode 901 Name - : Chemistry: A Study of Matter
7 pages
Statistics and Prob Test
No ratings yet
Statistics and Prob Test
3 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
Machine Learning Juunit2.pdf Lands
No ratings yet
Machine Learning Juunit2.pdf Lands
7 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
BLOSUM Matrices
No ratings yet
BLOSUM Matrices
18 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Statistical Mechanics Overview
No ratings yet
Statistical Mechanics Overview
21 pages
Gas Laws for Science Students
100% (1)
Gas Laws for Science Students
39 pages

Module 4 Supervised Learning

Uploaded by

Module 4 Supervised Learning

Uploaded by

Module 4 Supervised Learning

What is Ensemble Learning with example?

Simple Ensemble Techniques-

Advanced Ensemble techniques-

• How well is my model doing? Is it a useful model?

• Percentage of positive instances out of the total predicted positive instances.

• Percentage of positive instances out of the total actual positive instances.

• Percentage of negative instances out of the total actual negative instances.

Different Combinations of Bias-Variance-

You might also like