0% found this document useful (0 votes)

20 views14 pages

Module 10 - Part 2 - Boosting Models

The document discusses boosting models in machine learning, specifically focusing on AdaBoost, Gradient Boosting Machine (GBM), and XGBoost. It explains the differences between bagging and boosting, highlighting how boosting iteratively improves models based on previous errors. Additionally, it outlines the pros and cons of XGBoost, noting its complexity compared to other boosting methods.

Uploaded by

Aashir Aftab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views14 pages

Module 10 - Part 2 - Boosting Models

Uploaded by

Aashir Aftab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Module 10- Part II

Boosting Models
AdaBoost, GBM, XGBoost

Prof. Pedram Jahangiry

Class Modules
• Module 1- Introduction to Machine Learning
• Module 2- Setting up Machine Learning Environment
• Module 3- Linear Regression (Econometrics approach)
• Module 4- Machine Learning Fundamentals
• Module 5- Linear Regression (Machine Learning approach)
• Module 6- Penalized Regression (Ridge, LASSO, Elastic Net)
• Module 7- Logistic Regression
• Module 8- K-Nearest Neighbors (KNN)
• Module 9- Classification and Regression Trees (CART)
• Module 10- Bagging and Boosting
• Module 11- Dimensionality Reduction (PCA)
• Module 12- Clustering (KMeans – Hierarchical)

Prof. Pedram Jahangiry

Road map ML Algorithm

Supervised Unsupervised

Dimensionality
Regression Classification Clustering
Reduction

Linear / Logistic Principle K-Mean

Polynomial regression Component
Penalized Analysis
regression (PCA)
KNN KNN Hierarchical

SVR SVM SVC

1. Decision Trees (DTs)

Tree-based Tree-based
Regression models Classification models 2. Bagging, Random Forests
3. Boosting

Prof. Pedram Jahangiry

Topics
Part I
1. Bagging vs Boosting
2. AdaBoost
3. Gradient Boosting Machine (GBM)
4. XGBoost

Part II
Pros and Cons

Prof. Pedram Jahangiry

Part I
1. Bagging vs Boosting
2. AdaBoost
3. Gradient Boosting Machine (GBM)
4. XGBoost

Prof. Pedram Jahangiry

Bagging vs Boosting

• Bagging consists of creating many “copies” of the training data

(each copy is slightly different from another) and then apply the
weak learner to each copy to obtain multiple weak models and then
combine them.
• In bagging, the bootstrapped trees are independent from each other.

• Boosting consists of using the “original” training data and iteratively

creating multiple models by using a weak learner. Each new model
would be different from the previous ones in the sense that the weak
learner, by building each new model tries to “fix” the errors which
previous models make.
• In boosting, each tree is grown using information from previous tree.

Prof. Pedram Jahangiry

AdaBoost (Adaptive Boosting)
• Forest of weak learners (trees with only 1 feature;
stumps).
• Each tree (stump) depends on the previous tree’s
errors rather than being independent.

1) Starting with usual splitting criteria!

2) Each tree (stump) gets different weight based on
its prediction accuracy.
3) Each observation gets a weight inversely related
to its predicted outcome. (ex, misclassified ones
get more weight).
Source: Towards data science
4) Aggregation is done based on each weak
learner’s weight.

Prof. Pedram Jahangiry

AdaBoost
Key features:
• Adaptive: Updates the weights of misclassified instances at each step.
• Tends to be sensitive to noise and outliers.
• Can be used with various base classifiers, but most commonly used with decision stumps.

• AdaBoost is old: AdaBoost is a popular boosting technique introduced by Yoav Freund and Robert
Schapire in 1996.

Prof. Pedram Jahangiry

Gradient Boosting Machine (GBM)
Source: Geeksforgeeks

• In gradient boosting, each weak learner corrects its

predecessor’s error.
• Unlike AdaBoost, the weights of the training instances
are not tweaked, instead, each predictor is trained using
the residual errors of predecessor as labels.
• Unlike AdaBoost, each tree can be larger than a stump.
However, the trees are still small. By fitting a small tree
to the residuals, the GBM slowly improve 𝑓መ in areas
where it does not perform well.

• Learning rate shrinks the contribution of each tree. There is a trade-off between learning rate and
number of trees. Learning rate slows down the process even further, allowing for more and different
shaped trees to attack the residuals.
• Aggregation is done by adding the first tree predictions and a scaled (shrunk) version of the following
trees.

Prof. Pedram Jahangiry

Extreme Gradient Boosting (XGBoost)
• XGBoost is a refined and customized version of a gradient boosting decision tree system, created
with performance and speed in mind.
• Extreme refers to the fact that the algorithms and methods have been customized to push the limit
of what is possible for gradient boosting algorithms.

Prof. Pedram Jahangiry

Put it all together!

Prof. Pedram Jahangiry

Part II
Pros and Cons

Prof. Pedram Jahangiry

XGBoost’s Pros and Cons

Pros:

Cons:
• XGBoost is more difficult to understand, visualize and to tune compared to AdaBoost and
random forests. There is a multitude of hyperparameters that can be tuned to increase
performance.

Prof. Pedram Jahangiry

Class Modules
✓ Module 1- Introduction to Machine Learning
✓ Module 2- Setting up Machine Learning Environment
✓ Module 3- Linear Regression (Econometrics approach)
✓ Module 4- Machine Learning Fundamentals
✓ Module 5- Linear Regression (Machine Learning approach)
✓ Module 6- Penalized Regression (Ridge, LASSO, Elastic Net)
✓ Module 7- Logistic Regression
✓ Module 8- K-Nearest Neighbors (KNN)
✓ Module 9- Classification and Regression Trees (CART)
✓ Module 10- Bagging and Boosting
• Module 11- Dimensionality Reduction (PCA)
• Module 12- Clustering (KMeans – Hierarchical)

Prof. Pedram Jahangiry

Finance-Focused Big Data Techniques
100% (1)
Finance-Focused Big Data Techniques
23 pages
Radiod Master
0% (1)
Radiod Master
149 pages
Job Interview: Listening Practice
No ratings yet
Job Interview: Listening Practice
3 pages
Module 4 ML
No ratings yet
Module 4 ML
33 pages
Module 4
No ratings yet
Module 4
44 pages
Module 10-Part 3 - Advanced Boosting Models
No ratings yet
Module 10-Part 3 - Advanced Boosting Models
11 pages
Boosting
No ratings yet
Boosting
6 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
Machine Learning Interview Prep
No ratings yet
Machine Learning Interview Prep
2 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Boosting
No ratings yet
Boosting
12 pages
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
No ratings yet
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
6 pages
Lecture12 Annotated
No ratings yet
Lecture12 Annotated
20 pages
Ensemble, Voting, Bagging, Boosting
No ratings yet
Ensemble, Voting, Bagging, Boosting
15 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
Boosting
No ratings yet
Boosting
2 pages
XGBoost & Adaboost
No ratings yet
XGBoost & Adaboost
22 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
XGBoost - A Powerful Machine Learning Algorithm For Beginners
No ratings yet
XGBoost - A Powerful Machine Learning Algorithm For Beginners
3 pages
LECTURE+NOTES Boosting
No ratings yet
LECTURE+NOTES Boosting
8 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Chapter Five
No ratings yet
Chapter Five
42 pages
Module 3.5 Ensemble Learning XGBoost
No ratings yet
Module 3.5 Ensemble Learning XGBoost
26 pages
Module 5,1 Ensemble - Bagging, RF, Boosting
No ratings yet
Module 5,1 Ensemble - Bagging, RF, Boosting
66 pages
ML QB Solutionss
No ratings yet
ML QB Solutionss
16 pages
Bagging Vs Boosting in Machine Learning
No ratings yet
Bagging Vs Boosting in Machine Learning
5 pages
Session 10 - Ensemble Methods (XGBoost)
No ratings yet
Session 10 - Ensemble Methods (XGBoost)
37 pages
Types of Boosting
No ratings yet
Types of Boosting
4 pages
Plagiarism
No ratings yet
Plagiarism
20 pages
Gradient Boosted Trees: Dr. Geetha Kuntoji
No ratings yet
Gradient Boosted Trees: Dr. Geetha Kuntoji
24 pages
XGBoost for Data Scientists
No ratings yet
XGBoost for Data Scientists
8 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Petrophysics-Driven Well Log Quality Control Using Machine Learning-2
No ratings yet
Petrophysics-Driven Well Log Quality Control Using Machine Learning-2
15 pages
Machine Learning
No ratings yet
Machine Learning
93 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Bagging Vs Boosting - Javatpoint
No ratings yet
Bagging Vs Boosting - Javatpoint
8 pages
Boosting Algorithms Explained
No ratings yet
Boosting Algorithms Explained
79 pages
Ensemble Methods
No ratings yet
Ensemble Methods
21 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
MODULE - 4 - PART 1 - Ensemble - Methods
No ratings yet
MODULE - 4 - PART 1 - Ensemble - Methods
24 pages
XGBoost - Unleashing The Power of Gradient Boosting
No ratings yet
XGBoost - Unleashing The Power of Gradient Boosting
10 pages
Trees, Boosting, and Random Forest
No ratings yet
Trees, Boosting, and Random Forest
14 pages
Plagiarism
No ratings yet
Plagiarism
18 pages
Gradient Boosting in ML
No ratings yet
Gradient Boosting in ML
5 pages
XG Boost
No ratings yet
XG Boost
5 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
9 pages
16-Ensemble Learning - Cont... - 12-04-2024
No ratings yet
16-Ensemble Learning - Cont... - 12-04-2024
13 pages
Breast Cancer Tumor Prediction Using XGBOOST
No ratings yet
Breast Cancer Tumor Prediction Using XGBOOST
1 page
Bagging Vs Boosting in Machine Learning
No ratings yet
Bagging Vs Boosting in Machine Learning
4 pages
22 Boosting
No ratings yet
22 Boosting
32 pages
ML Mod1
No ratings yet
ML Mod1
48 pages
ML U3 Notes
No ratings yet
ML U3 Notes
10 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Chapter 3 - Boosting Theory
No ratings yet
Chapter 3 - Boosting Theory
7 pages
XGBoost Course: Supervised Learning Basics
100% (1)
XGBoost Course: Supervised Learning Basics
39 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
FAQ - Boosting - Ensemble Techniques - Great Learning
No ratings yet
FAQ - Boosting - Ensemble Techniques - Great Learning
2 pages
Graphic Design Business Plan Example
No ratings yet
Graphic Design Business Plan Example
35 pages
CH 12
No ratings yet
CH 12
19 pages
CH 04
No ratings yet
CH 04
12 pages
GROUP2-Ak Bank Part A PDF
No ratings yet
GROUP2-Ak Bank Part A PDF
19 pages
Personnel Planning & Recruiting Quiz
No ratings yet
Personnel Planning & Recruiting Quiz
20 pages
Chapter 1.1 Principles of Marketing
No ratings yet
Chapter 1.1 Principles of Marketing
10 pages
Chap 05 Power Point Slides
No ratings yet
Chap 05 Power Point Slides
103 pages
Course Teaching Plan
No ratings yet
Course Teaching Plan
5 pages
Digital Literacy For The 21st Century: Rethinking & Redesigning The Roles of Libraries
No ratings yet
Digital Literacy For The 21st Century: Rethinking & Redesigning The Roles of Libraries
6 pages
Bullying's Impact on Grade 3 Academics
No ratings yet
Bullying's Impact on Grade 3 Academics
12 pages
Citizenship Advancement Training
No ratings yet
Citizenship Advancement Training
41 pages
By Fendy Sutandio Anyone Experienced in Helping Students in Any
No ratings yet
By Fendy Sutandio Anyone Experienced in Helping Students in Any
7 pages
Abu Jafar Al Tahawi
No ratings yet
Abu Jafar Al Tahawi
8 pages
3 HR Frame Worksheet
No ratings yet
3 HR Frame Worksheet
2 pages
Lesson 02 - Number Conversions and Arithmetic Operations
No ratings yet
Lesson 02 - Number Conversions and Arithmetic Operations
7 pages
Pecs 10
No ratings yet
Pecs 10
2 pages
Module 6
No ratings yet
Module 6
28 pages
Understanding Literature Review in Research
No ratings yet
Understanding Literature Review in Research
9 pages
BSC Quantity Surveying Construction Economics 2 Bqs 3002module Outline
No ratings yet
BSC Quantity Surveying Construction Economics 2 Bqs 3002module Outline
4 pages
MBOSE Class 10 IT - ITES (Vocational Course) Question Paper 2021
No ratings yet
MBOSE Class 10 IT - ITES (Vocational Course) Question Paper 2021
4 pages
Theories and Models in Social Marketing Social Marketing - Lecture 3
100% (1)
Theories and Models in Social Marketing Social Marketing - Lecture 3
53 pages
Catholic Practices Exam Mock
No ratings yet
Catholic Practices Exam Mock
5 pages
Leadership: Definitions and Impact
0% (1)
Leadership: Definitions and Impact
68 pages
Course Outline
No ratings yet
Course Outline
2 pages
Module 5
No ratings yet
Module 5
53 pages
Critical Thinking and Reflective Practices (8611)
No ratings yet
Critical Thinking and Reflective Practices (8611)
11 pages
Dubuque Historical Education Plan
No ratings yet
Dubuque Historical Education Plan
48 pages
Chapter 6-Leading
No ratings yet
Chapter 6-Leading
27 pages
List of All Notaries by 2011
No ratings yet
List of All Notaries by 2011
126 pages
Interior of PDF Book - Freebie Word Scramble
No ratings yet
Interior of PDF Book - Freebie Word Scramble
27 pages
Umbrella To Which All The Defense Mechanism Exist
No ratings yet
Umbrella To Which All The Defense Mechanism Exist
9 pages
From Silent Spring PDF
No ratings yet
From Silent Spring PDF
10 pages
Kids' Summer Writing Camps
No ratings yet
Kids' Summer Writing Camps
1 page
Fin Irjmets1711372102
No ratings yet
Fin Irjmets1711372102
3 pages
Coleridge's Idea of Imagination Vs Fancy
No ratings yet
Coleridge's Idea of Imagination Vs Fancy
4 pages

Module 10 - Part 2 - Boosting Models

Uploaded by

Module 10 - Part 2 - Boosting Models

Uploaded by

Module 10- Part II

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Linear / Logistic Principle K-Mean

SVR SVM SVC

1. Decision Trees (DTs)

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

• Bagging consists of creating many “copies” of the training data

• Boosting consists of using the “original” training data and iteratively

Prof. Pedram Jahangiry

1) Starting with usual splitting criteria!

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

• In gradient boosting, each weak learner corrects its

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

You might also like