Decision Tree Regression Fully Explained by Example

The document explains decision tree regression, detailing how it constructs models in a tree structure by partitioning data into subsets based on decision nodes and leaf nodes. It describes the ID3 algorithm for building trees, emphasizing the use of standard deviation reduction to determine the best attributes for splitting the data. The process includes calculating standard deviations, determining when to stop branching, and assigning averages to leaf nodes based on the number of instances.

Uploaded by

nareshimpetus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views4 pages

Decision Tree Regression Fully Explained by Example

Uploaded by

nareshimpetus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

4/20/23, 10:50 PM Decision Tree Regression

Map > Data Science > Predicting the Future > Modeling > Regression > Decision Tree

ads via C
Decision Tree - Regression

Decision tree builds regression or classification models in the form of a tree structure. It breaks down a dataset into
smaller and smaller subsets while at the same time an associated decision tree is incrementally developed. The final
result is a tree with decision nodes and leaf nodes. A decision node (e.g., Outlook) has two or more branches (e.g.,
Sunny, Overcast and Rainy), each representing values for the attribute tested. Leaf node (e.g., Hours Played) represents Use mach
a decision on the numerical target. The topmost decision node in a tree which corresponds to the best predictor called Intelligenc
root node. Decision trees can handle both categorical and numerical data. improve effi
and boost

Decision Tree Algorithm

The core algorithm for building decision trees called ID3 by J. R. Quinlan which employs a top-down, greedy search
through the space of possible branches with no backtracking. The ID3 algorithm can be used to construct a decision
tree for regression by replacing Information Gain with Standard Deviation Reduction.

Standard Deviation
A decision tree is built top-down from a root node and involves partitioning the data into subsets that contain instances
with similar values (homogenous). We use standard deviation to calculate the homogeneity of a numerical sample. If
the numerical sample is completely homogeneous its standard deviation is zero.

a) Standard deviation for one attribute:

Standard Deviation (S) is for tree building (branching).

Coefficient of Deviation (CV) is used to decide when to stop branching. We can use Count (n) as well.

Average (Avg) is the value in the leaf nodes.

b) Standard deviation for two attributes (target and predictor):

www.saedsayad.com/decision_tree_reg.htm 1/4
4/20/23, 10:50 PM Decision Tree Regression

ads via C

Use mach
Intelligenc
improve effi
and boost

Standard Deviation Reduction

The standard deviation reduction is based on the decrease in standard deviation after a dataset is split on an attribute.
Constructing a decision tree is all about finding attribute that returns the highest standard deviation reduction (i.e., the
most homogeneous branches).

Step 1: The standard deviation of the target is calculated.

Standard deviation (Hours Played) = 9.32

Step 2: The dataset is then split on the different attributes. The standard deviation for each branch is calculated. The
resulting standard deviation is subtracted from the standard deviation before the split. The result is the standard
deviation reduction.

Step 3: The attribute with the largest standard deviation reduction is chosen for the decision node.

www.saedsayad.com/decision_tree_reg.htm 2/4
4/20/23, 10:50 PM Decision Tree Regression

Step 4a: The dataset is divided based on the values of the selected attribute. This process is run recursively on the non-
leaf branches, until all data is processed.

ads via C

Use mach
Intelligenc
improve effi
and boost

In practice, we need some termination criteria. For example, when coefficient of deviation (CV) for a branch becomes
smaller than a certain threshold (e.g., 10%) and/or when too few instances (n) remain in the branch (e.g., 3).

Step 4b: "Overcast" subset does not need any further splitting because its CV (8%) is less than the threshold (10%). The
related leaf node gets the average of the "Overcast" subset.

Step 4c: However, the "Sunny" branch has an CV (28%) more than the threshold (10%) which needs further splitting.
We select "Temp" as the best best node after "Outlook" because it has the largest SDR.

Because the number of data points for both branches (FALSE and TRUE) is equal or less than 3 we stop further
branching and assign the average of each branch to the related leaf node.

www.saedsayad.com/decision_tree_reg.htm 3/4
4/20/23, 10:50 PM Decision Tree Regression

ads via C

Use mach
Intelligenc
improve effi
and boost

Step 4d: Moreover, the "rainy" branch has an CV (22%) which is more than the threshold (10%). This branch needs
further splitting. We select "Temp" as the best best node because it has the largest SDR.

Because the number of data points for all three branches (Cool, Hot and Mild) is equal or less than 3 we stop further
branching and assign the average of each branch to the related leaf node.

When the number of instances is more than one at a leaf node we calculate the average as the final value for the
target.

Exercise

Try to invent a new algorithm to construct a decision tree from data using MLR instead of average at the leaf node.

www.saedsayad.com/decision_tree_reg.htm 4/4

Foundations of Machine Learning: Module 2: Linear Regression and Decision Tree
100% (2)
Foundations of Machine Learning: Module 2: Linear Regression and Decision Tree
16 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
DecisionTree Numerical ID3Prob
No ratings yet
DecisionTree Numerical ID3Prob
114 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Unit 3
No ratings yet
Unit 3
25 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Module 7 Intangibles
No ratings yet
Module 7 Intangibles
14 pages
2b Decision Tree 18may
No ratings yet
2b Decision Tree 18may
16 pages
IIT JEE Organic Chemistry Solutions
100% (3)
IIT JEE Organic Chemistry Solutions
15 pages
Classification: Decision Trees: Business Analytics Lecture 7/8
No ratings yet
Classification: Decision Trees: Business Analytics Lecture 7/8
35 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Logistic - Regression
No ratings yet
Logistic - Regression
31 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
ML Unit 3
No ratings yet
ML Unit 3
30 pages
Decision Tree (Autosaved)
No ratings yet
Decision Tree (Autosaved)
14 pages
MI - Unit 4
No ratings yet
MI - Unit 4
79 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Lecture 7 - Decision Tree Regression Imran 19032025 103416am
No ratings yet
Lecture 7 - Decision Tree Regression Imran 19032025 103416am
40 pages
Classification Using Decision Trees
No ratings yet
Classification Using Decision Trees
43 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Decision Trees and Regression Techniques
No ratings yet
Decision Trees and Regression Techniques
27 pages
Decision Trees for AI Students
No ratings yet
Decision Trees for AI Students
28 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
1822 B.E Cse Batchno 149
No ratings yet
1822 B.E Cse Batchno 149
66 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
7 pages
BPE 22, Decision Trees
No ratings yet
BPE 22, Decision Trees
11 pages
Decision Tree
No ratings yet
Decision Tree
3 pages
Assignment HBEC4503 Action Research in Early Childhood Education Assignment 2 May 2019 Semester
No ratings yet
Assignment HBEC4503 Action Research in Early Childhood Education Assignment 2 May 2019 Semester
10 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Decision Tree
No ratings yet
Decision Tree
82 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
2014 E400 W212 Relay & Fuse Guide
No ratings yet
2014 E400 W212 Relay & Fuse Guide
15 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - Decision Tree
17 pages
2.12 Chapter 6 Decision Tree
No ratings yet
2.12 Chapter 6 Decision Tree
56 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
28 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Class 12 Physics Electricity Experiment
No ratings yet
Class 12 Physics Electricity Experiment
18 pages
Prac 6
No ratings yet
Prac 6
6 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Decision Tree
No ratings yet
Decision Tree
11 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
UNIT-3 ML Notes
No ratings yet
UNIT-3 ML Notes
4 pages
Szymanowski List of Compositions
No ratings yet
Szymanowski List of Compositions
12 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Decision Trees for Data Enthusiasts
No ratings yet
Decision Trees for Data Enthusiasts
52 pages
Prateek Sharma de
No ratings yet
Prateek Sharma de
1 page
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
ENOVIASynchronicityDesignSyncDataManager ProjectSyncUser V6R2011x
No ratings yet
ENOVIASynchronicityDesignSyncDataManager ProjectSyncUser V6R2011x
295 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
Istqb Advanced Level Test Manager Syllabus v5
No ratings yet
Istqb Advanced Level Test Manager Syllabus v5
126 pages
Lab 2
No ratings yet
Lab 2
3 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
Adcps: Question Paper Cum Answer Sheet
No ratings yet
Adcps: Question Paper Cum Answer Sheet
5 pages
2.unit 2
No ratings yet
2.unit 2
23 pages
How To Send or Receive SMS Message Via GSM Module by at Commands
100% (1)
How To Send or Receive SMS Message Via GSM Module by at Commands
6 pages
ML Ch-3 Decision Trees and Ensemble Methods
No ratings yet
ML Ch-3 Decision Trees and Ensemble Methods
14 pages
Week 12
No ratings yet
Week 12
25 pages
13.decision Tree
No ratings yet
13.decision Tree
29 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
8 pages
Sale of Goods Act 1930 Overview
No ratings yet
Sale of Goods Act 1930 Overview
27 pages
Izadian Leila Thesis 2021
No ratings yet
Izadian Leila Thesis 2021
39 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Tree
No ratings yet
Tree
7 pages
Classification 4
No ratings yet
Classification 4
16 pages
Personal Development: 1 Quarter: Module 2 Developing The Whole Person
100% (2)
Personal Development: 1 Quarter: Module 2 Developing The Whole Person
10 pages
Decision Trees
No ratings yet
Decision Trees
9 pages
International Finance Overview
No ratings yet
International Finance Overview
36 pages
First Language Acquisition Theories
No ratings yet
First Language Acquisition Theories
28 pages
Project Two
No ratings yet
Project Two
14 pages
LU-1500N Series: LU-1508NS LU-1508NH LU-1510N LU-1510N-7 LU-1509NS LU-1509NH LU-1511N-7
No ratings yet
LU-1500N Series: LU-1508NS LU-1508NH LU-1510N LU-1510N-7 LU-1509NS LU-1509NH LU-1511N-7
5 pages
HR Interview Questions
No ratings yet
HR Interview Questions
8 pages
Link Game PPSSPP (Sfile
100% (1)
Link Game PPSSPP (Sfile
9 pages
Physical Science Quiz for Students
No ratings yet
Physical Science Quiz for Students
5 pages
LESSON PLAN - 04-Graphing Linear Equations in Two Variables
No ratings yet
LESSON PLAN - 04-Graphing Linear Equations in Two Variables
6 pages
Chi Square Test
No ratings yet
Chi Square Test
11 pages
Executive Leadership Profile
No ratings yet
Executive Leadership Profile
2 pages
Hydraulic System CX31 (UENR4778-01)
No ratings yet
Hydraulic System CX31 (UENR4778-01)
4 pages
When I Was
No ratings yet
When I Was
6 pages
Black Dog Institute Online Clinic Assessment Report
No ratings yet
Black Dog Institute Online Clinic Assessment Report
7 pages
Mechatronics Project: Linear Displacement Indicator
No ratings yet
Mechatronics Project: Linear Displacement Indicator
6 pages
Use Case Points for Objectory Projects
No ratings yet
Use Case Points for Objectory Projects
9 pages
RaunakJaimini Resume
No ratings yet
RaunakJaimini Resume
1 page
Mahendra Resume
No ratings yet
Mahendra Resume
1 page
Harsh Shrivastava CV
No ratings yet
Harsh Shrivastava CV
1 page
Anshi IIITL
No ratings yet
Anshi IIITL
1 page
Introduction To This Teacher Resource
No ratings yet
Introduction To This Teacher Resource
2 pages
Boarding Pass: Name Booking Code Ticket No
No ratings yet
Boarding Pass: Name Booking Code Ticket No
1 page