0% found this document useful (0 votes)

22 views10 pages

Math Behind AdaBoost Algorithm in 3 Steps

The article explains the mathematical foundations of the AdaBoost algorithm, which enhances the performance of decision trees in binary classification tasks by combining multiple weak learners. It details the process of assigning sample weights, creating stumps based on misclassifications, and updating weights to improve accuracy in subsequent iterations. The article emphasizes how errors from previous stumps influence the creation of new stumps, ultimately leading to a more accurate classification model.

Uploaded by

anbu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views10 pages

Math Behind AdaBoost Algorithm in 3 Steps

Uploaded by

anbu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

Get unlimited access to the best of Medium for less than $1/week. Become a member

Math Behind AdaBoost Algorithm in 3 steps…

Sampath · Follow
8 min read · May 1, 2020

Listen Share More

Hi guys, whenever we are participating in DataScience Hackthons the first

algorithm that comes into our mind is Boosting which improves the accuracy of our
model. But unfortunately, we don’t know the math behind these Boosting
algorithms.

In this article, We are going to master the math behind the boosting algorithms in a
simple manner. They are different types of boosting algorithms:

1. AdaBoost (Adaptive Boosting)

2. Gradient Boosting

3. XGBoost

In this article, we will focus on AdaBoost and will focus on Gradient Boosting and
XGboost in an upcoming article.

I am assuming that you have a basic understanding of how a Decision Tree works. If
you’re not sure of your understanding I would request you to go through Decision
Tree Algorithm before you read on

Terminology related to boosting algorithms:

Most of the blogs or books used term is “Weak Learner”. Technically Weak Learners
are called as stumps.

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 1/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

A tree with just one node and two leaves is called a stump. As shown in Fig.1

Fig 1

Ada Boost:

AdaBoost is best used to boost the performance of decision trees on binary

classification problems. It is best used with weak learners. Let me explain the 3
ideas behind Ada Boost:-

1. AdaBoost combines a lot of “weak learners”(stumps) to make classifications.

2. Some stumps get more say (information) in the classification than others.

3. Each stump is made by taking the previous stump’s mistakes into account

Now we go through the math behind Ada Boost then you can understand the above 3
points for sure.

We considered a simple dataset to understand the concept clearly. Dataset is shown

below Fig.2

Fig.2
https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 2/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

The first thing we do is assign a weight to each and every sample(data point) that
indicates how important it is to be correctly classified. Now we create a new column
with the name “Sample Weight”.

Note:- The “Sample Weight” is different from “Patient Weight”.

At the start, all samples get the same weight and that makes all samples equally
important. The formula to calculate Sample Weight is

Here we have 8 observations in our sample dataset, So our sample weight is

After adding the “Sample Weight” column, now dataset looks like as shown below:-

Fig.3

However, after we make the first stump, these weights will change in order to guide,
how the next stump is created.

Initially, we will ignore the “Sample Weight” column because all the weights are the
same. Now we need to make the first stump.

We start by seeing how well “Chest Pain” classifies the samples and will see how the
variables(Blocked Arteries, Patient Weight ) classifies the samples

Chest Pain:-

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 3/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

Fig.4

Of the 5 samples with “Heart Disease”,3 were correctly classified as having Heart
Disease and 2 were incorrectly classified.

Of the 3 samples “without Heart Disease”, 2 were correctly classified as not Heart
Disease, and 1 was incorrectly classified.

Blocked Arteries:-

Fig.5

Of the 6 samples with “Heart Disease”,3 were correctly classified as having Heart
Disease and 3 were incorrectly classified.

Of the 2 samples “without Heart Disease”, 1was correctly classified as not Heart
Disease, and 1 was incorrectly classified.

Patient Weight:-

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 4/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

Fig.6

Of the 3 samples with “Heart Disease”,3 were correctly classified as having Heart
Disease, and 0 was incorrectly classified.

Of the 5 samples “without Heart Disease”, 4 were correctly classified as not Heart
Disease, and 1 was incorrectly classified.

Now will calculate the Gini Index for these three stumps

Steps to calculate the Gini Index for a split:-

1. Calculate the Gini Index for sub-nodes, using formula sum of squares of
probability for success(In our case “Correct”) and failure(In our case “Incorrect”)
(p² +q²)

2. Calculate the Gini Index for a split, using (1- weighted Gini score of each node of
that split).

Now, I will explain with an example for better understanding. We start with “Chest
Pain” refer to fig.4

1. Calculate Gini for subnode “Yes” = ((3/5)(3/5))+((2/5)(2/5))=0.52. Gini for

subnode “No” = ((2/3)*(2/3))+((1/3)*(1/3)) = 0.556

2. Calculate Weighted Gini Index for split “Chest Pain” =1- ((5/8)*0.52+(3/8)*0.556) =
0.47

In the same way, we calculated for “Blood Arteries” and “Patient Weight” the Gini
Index for Blood Arteries is 0.5 and Patient Weight is 0.2

Chest Pain 0.47

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 5/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

Blocked Arteries 0.5

Patient Weight 0.2

The Gini Index for “Patient Weight” is the lowest, so this would be the first stump.

Now we need to determine how much “Amount of Say”(Information) of this stump

will have in the final classification.

Formula to calculate the “Amount of say” is

First, we need to calculate “Total Error” for stumps. The Total Error for a stump is
the sum of the weights associated with the incorrectly classified samples.

Total Error for Chest Pain:- It made 3 errors i.e 1/8+1/8+1/8=3/8.

Total Error for Blood Arteries:- It made 4 errors i.e 1/8+1/8+1/8+1/8 = 4/8.

Total Error for Patient Weight:- It made 1 error i.e 1/8.

Note:- Because all the sample's weight is added up to 1, Total Error will always be
between 0 and 1.

0 indicates perfect stump, 1 indicates horrible stump.

Now, the Amount of say for “Patient Weight” is 0.97 [1/2 log(7)].

Guys, whenever you attend webinar/lecture on boosting there is one point i.e
Boosting will increase weight for misclassified samples in the follow-up tree but,
how we don’t know. Now it’s time to learn how to update the weights for samples.

Basically boosting will increase sample weight for incorrectly classified samples and
decrease sample weight for correctly classified samples. The formula is below for
updating weights of incorrectly classified samples:-

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 6/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

There is one misclassified sample, Here sample weight of that sample is 0.125 and
the amount of say of Patient Weight is 0.97

New Sample Weight = 0.125* e^0.97 = 0.125 * 2.67 = 0.33. The New Sample Weight for
a misclassified sample is 0.33, which is more than the old weight.

The formula is below for updating weights of correctly classified samples:-

There are seven correctly classified samples, Here sample weight of that samples is
1/8 and the amount of say of Patient Weight is 0.97

New Sample Weight = 0.125*e^-0.97 = 0.125*0.38 = 0.05. The New Sample Weight for
correctly classified samples is 0.05, which is less than the old weight.

Note:- Here in the above scenario the “Sample weight” is the same for all 7
samples, so we are calculating “New Sample Weight” only once. If “Sample Weight”
is different then we need to calculate “New Sample Weight” for each and every
sample individually.

After adding the New Sample Weight Column to our Dataset…

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 7/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

Guys if you observe, the New Sample Weight of a misclassified sample is increased
from 0.125 to 0.33 and correctly classified samples weight is decreased from 0.125 to
0.05.

Right now, if you add up the New Sample Weights, you get 0.68. So we divide each
sample weight with 0.68 to get the normalized values. Now we consider Normalized
Weights as New Sample Weights as shown below.

Before building the next stump, we need to create a new dataset.

So we start by making a new but empty dataset that is the same as the original
dataset size, Here we pick a random number between 0 and 1 and we see where the
number falls, here we use sample weights like a distribution. Guys, hold on let me
explain with an example for better understanding.

Now if we get a random number lies between 0 to 0.07 then we would put the first
sample(data point), 0.071 to 0.14 then we would put the second sample,….

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 8/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

When I pick a random number eight times between 0 and 1 then I got the random
numbers are 0.78,0.56,0.94,0.24,0.68,0.32,0.13,0.73. Now our new dataset is created
based on these numbers. The new dataset is shown below:

New Dataset

Ultimately the wrongly classified sample was added to the new collection of
samples(dataset) 4 times, reflecting its larger sample weight.

From now will use the new collection of samples as a dataset and repeat the same
procedure as above i.e

1. Assign equal weights to all samples

2. Find the stump that does the best job classifying the new collection of samples.

3. Calculate “Total Error and Amount of say” to calculate New Sample Weight

4. Normalize the New Sample Weights

5. Repeat above 4 steps until all the samples correctly classified.

So that is how the errors that the first tree makes influence how the second tree is
made… and the errors that the second tree makes influence how the third tree is
made, … and so on

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 9/15
3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

Finally, Now we need to talk about how a forest of stumps created by AdaBoost
makes classification.

Imagine there are 6 stumps are created by the AdaBoost algorithm. Out of 6 stumps,
4 stumps are classified patient has Heart Disease, and the other 2 stumps classified
patient does not have Heart Disease.

These are the Amount of Say for these stumps are 0.97+0.32+0.78+0.63 = 2.7, and the
Amount of Say of the other 2 stumps are 0.41+0.82=1.23.

Ultimately, the patient is classified as Has Heart Disease because of the larger
Amount of Say(2.7).

In this article, we looked at AdaBoost, one of the methods of ensemble modeling to

enhance the prediction power. Here, we have discussed the math behind AdaBoost.

Guys, if you have any queries don’t hesitate to comment…

Adaboost Machine Learning Data Science Classification Boosting

Written by Sampath
5 Followers · 1 Following

Responses (4)

Skanimozhi

What are your thoughts?

https://medium.com/@sampathtunuguntla13/math-behind-adaboost-algorithm-in-3-steps-477745399553 10/15

Adaboost
No ratings yet
Adaboost
22 pages
Statistics Project
No ratings yet
Statistics Project
5 pages
AdaBoost Notes
No ratings yet
AdaBoost Notes
5 pages
AdaBoost Algorithm: Key Features & Benefits
No ratings yet
AdaBoost Algorithm: Key Features & Benefits
9 pages
DM (Boosting)
No ratings yet
DM (Boosting)
15 pages
Boosting Algorithms Explained
No ratings yet
Boosting Algorithms Explained
79 pages
LECTURE+NOTES Boosting
No ratings yet
LECTURE+NOTES Boosting
8 pages
Bagging - Boosting
No ratings yet
Bagging - Boosting
9 pages
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
No ratings yet
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
35 pages
Pradipta Kumar Pattanayak - Ada Boosting
No ratings yet
Pradipta Kumar Pattanayak - Ada Boosting
44 pages
ML 9
No ratings yet
ML 9
64 pages
Adaboost
No ratings yet
Adaboost
29 pages
Boosting and AdaBoost For Machine Learning
No ratings yet
Boosting and AdaBoost For Machine Learning
18 pages
Ada Boost
No ratings yet
Ada Boost
7 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
FAQ - Boosting - Ensemble Techniques - Great Learning
No ratings yet
FAQ - Boosting - Ensemble Techniques - Great Learning
2 pages
Ensemble Learning
No ratings yet
Ensemble Learning
9 pages
Adaboost: Derek Hoiem March 31, 2004
No ratings yet
Adaboost: Derek Hoiem March 31, 2004
46 pages
Boosted Trees
No ratings yet
Boosted Trees
66 pages
07 Boosting Notes
No ratings yet
07 Boosting Notes
10 pages
Lecture 16: Boosting - Applied ML
No ratings yet
Lecture 16: Boosting - Applied ML
20 pages
AdaBoost Interview Prep Guide
No ratings yet
AdaBoost Interview Prep Guide
6 pages
Boosting
No ratings yet
Boosting
13 pages
Machine Learning Boosting Guide
No ratings yet
Machine Learning Boosting Guide
27 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
AdaBoost: A Guide for Data Scientists
No ratings yet
AdaBoost: A Guide for Data Scientists
17 pages
Resilience To Overfitting AdaBoosts Approach
No ratings yet
Resilience To Overfitting AdaBoosts Approach
8 pages
Boosting Algo Adaboost
No ratings yet
Boosting Algo Adaboost
3 pages
Adaboost
No ratings yet
Adaboost
4 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
ENG6500 7 Ensembles Boosting
No ratings yet
ENG6500 7 Ensembles Boosting
49 pages
Introduction To Machine Learning - Boosting
No ratings yet
Introduction To Machine Learning - Boosting
6 pages
Boosting and Applications Yuan
No ratings yet
Boosting and Applications Yuan
41 pages
Computational Data Analysis: Machine Learning
No ratings yet
Computational Data Analysis: Machine Learning
26 pages
Ensemble Methods
No ratings yet
Ensemble Methods
30 pages
Boosting Algorithms Explained
No ratings yet
Boosting Algorithms Explained
2 pages
Ada Boost
No ratings yet
Ada Boost
16 pages
L07 Classifiers Combination
No ratings yet
L07 Classifiers Combination
17 pages
Ensemble Classifiers Overview
No ratings yet
Ensemble Classifiers Overview
37 pages
Adaboost
No ratings yet
Adaboost
5 pages
Lecture Notes 7
No ratings yet
Lecture Notes 7
8 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Adaboost Explained
No ratings yet
Adaboost Explained
19 pages
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
No ratings yet
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
29 pages
Boosting
No ratings yet
Boosting
2 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Boosting Approach To Machine Learn
No ratings yet
Boosting Approach To Machine Learn
23 pages
Algorithm Adaboost
No ratings yet
Algorithm Adaboost
1 page
Ada Boost
No ratings yet
Ada Boost
25 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
ML Exp 9
No ratings yet
ML Exp 9
3 pages
AdaBoostExample PDF
No ratings yet
AdaBoostExample PDF
2 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Module 4 ML
No ratings yet
Module 4 ML
33 pages
Ensemble
No ratings yet
Ensemble
33 pages
1 Eric Boosting304FinalRpdf
No ratings yet
1 Eric Boosting304FinalRpdf
19 pages
Local Search
No ratings yet
Local Search
37 pages
Regression Answer
No ratings yet
Regression Answer
2 pages
Expectation-Maximization (EM) Algorithm With Example
No ratings yet
Expectation-Maximization (EM) Algorithm With Example
10 pages
Agglomerative Methods in Machine Learning
No ratings yet
Agglomerative Methods in Machine Learning
12 pages
Random Forest Classification
No ratings yet
Random Forest Classification
8 pages
What Is Bagging in Machine Learning and How To Perform Bagging
No ratings yet
What Is Bagging in Machine Learning and How To Perform Bagging
9 pages
Stacking To Improve Model Performance
No ratings yet
Stacking To Improve Model Performance
10 pages
How The Random Forest Algorithm Works in Machine Learning
No ratings yet
How The Random Forest Algorithm Works in Machine Learning
11 pages
Naïve Bayes Classification in Python
No ratings yet
Naïve Bayes Classification in Python
16 pages
What Is Bagging in Machine Learning
No ratings yet
What Is Bagging in Machine Learning
6 pages
DIP With Spatial
No ratings yet
DIP With Spatial
21 pages
Artificial Intelligence Project Presentation
No ratings yet
Artificial Intelligence Project Presentation
34 pages
Dit FFT
100% (1)
Dit FFT
18 pages
Understanding Binary Codes
0% (2)
Understanding Binary Codes
5 pages
Module 3 Topic 3 Lesson 2B Weighted Graphs PDF
No ratings yet
Module 3 Topic 3 Lesson 2B Weighted Graphs PDF
14 pages
Linear and Nonlinear Programming
No ratings yet
Linear and Nonlinear Programming
7 pages
B.Tech EEE Optimization Exam
No ratings yet
B.Tech EEE Optimization Exam
1 page
Rohini 27336950025
No ratings yet
Rohini 27336950025
6 pages
Polynomials: Fundamental Theorem of Algebra: P (X) Has N Roots
No ratings yet
Polynomials: Fundamental Theorem of Algebra: P (X) Has N Roots
1 page
ME PDC Course Plan
No ratings yet
ME PDC Course Plan
6 pages
A Novel IChOA-CNN-LSTM Model For Android Malware Detection Using Opcode-Based Feature Selection and Optimization
No ratings yet
A Novel IChOA-CNN-LSTM Model For Android Malware Detection Using Opcode-Based Feature Selection and Optimization
14 pages
Design and Implementation of A Turbo Code System On FPGA: November 2011
No ratings yet
Design and Implementation of A Turbo Code System On FPGA: November 2011
6 pages
Chapter 2 Polynomials
No ratings yet
Chapter 2 Polynomials
6 pages
Algorithms for CS Students
No ratings yet
Algorithms for CS Students
7 pages
An Ecient Algorithm For Mining Frequent Closed Itemsets
No ratings yet
An Ecient Algorithm For Mining Frequent Closed Itemsets
10 pages
Solutions of Linear Programming Model
No ratings yet
Solutions of Linear Programming Model
9 pages
Genetic Algorithms
No ratings yet
Genetic Algorithms
15 pages
حسین
No ratings yet
حسین
3 pages
Digital Control Systems Implementation
No ratings yet
Digital Control Systems Implementation
58 pages
Digital Signal Processing Guide
No ratings yet
Digital Signal Processing Guide
41 pages
G8 Second Term B
No ratings yet
G8 Second Term B
5 pages
Chapter 3-Syntax Analysis-II
No ratings yet
Chapter 3-Syntax Analysis-II
28 pages
Machine Learning in Block Cipher Modes
No ratings yet
Machine Learning in Block Cipher Modes
10 pages
HEVC Overview Rev2
No ratings yet
HEVC Overview Rev2
110 pages
Fourier Analysis of Signals - Students
No ratings yet
Fourier Analysis of Signals - Students
69 pages
3 - CMF+EMA+AMA+Stiffness
No ratings yet
3 - CMF+EMA+AMA+Stiffness
2 pages
Merged Presentation Choladeck
No ratings yet
Merged Presentation Choladeck
128 pages
Algorithms and Flowcharts
No ratings yet
Algorithms and Flowcharts
4 pages
Vlsi Mini Project
No ratings yet
Vlsi Mini Project
7 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages

Math Behind AdaBoost Algorithm in 3 Steps

Uploaded by

Math Behind AdaBoost Algorithm in 3 Steps

Uploaded by

3/12/25, 7:41 PM Math Behind AdaBoost Algorithm in 3 steps… | by Sampath | Medium

Math Behind AdaBoost Algorithm in 3 steps…

Listen Share More

Hi guys, whenever we are participating in DataScience Hackthons the first

1. AdaBoost (Adaptive Boosting)

Terminology related to boosting algorithms:

AdaBoost is best used to boost the performance of decision trees on binary

1. AdaBoost combines a lot of “weak learners”(stumps) to make classifications.

We considered a simple dataset to understand the concept clearly. Dataset is shown

Note:- The “Sample Weight” is different from “Patient Weight”.

Here we have 8 observations in our sample dataset, So our sample weight is

Steps to calculate the Gini Index for a split:-

1. Calculate Gini for subnode “Yes” = ((3/5)*(3/5))+((2/5)*(2/5))=0.52. Gini for

Chest Pain 0.47

Blocked Arteries 0.5

Patient Weight 0.2

Now we need to determine how much “Amount of Say”(Information) of this stump

Formula to calculate the “Amount of say” is

Total Error for Chest Pain:- It made 3 errors i.e 1/8+1/8+1/8=3/8.

Total Error for Patient Weight:- It made 1 error i.e 1/8.

0 indicates perfect stump, 1 indicates horrible stump.

The formula is below for updating weights of correctly classified samples:-

After adding the New Sample Weight Column to our Dataset…

Before building the next stump, we need to create a new dataset.

1. Assign equal weights to all samples

4. Normalize the New Sample Weights

5. Repeat above 4 steps until all the samples correctly classified.

In this article, we looked at AdaBoost, one of the methods of ensemble modeling to

Guys, if you have any queries don’t hesitate to comment…

Adaboost Machine Learning Data Science Classification Boosting

What are your thoughts?

You might also like

1. Calculate Gini for subnode “Yes” = ((3/5)(3/5))+((2/5)(2/5))=0.52. Gini for