0% found this document useful (0 votes)

16 views35 pages

Decision Tree

A Decision Tree is a supervised learning algorithm primarily used for classification tasks, which divides data into homogeneous subsets based on significant input variables. The structure includes root, decision, and leaf nodes, and employs measures like Entropy and Gini Index for attribute selection to optimize splits. Overfitting can be mitigated through pre-pruning and post-pruning techniques to ensure the model generalizes well to unseen data.

Uploaded by

abhijaychauhan88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views35 pages

Decision Tree

Uploaded by

abhijaychauhan88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Decision Tree

Algorithm
Supervised ML
What is a Decision Tree?

 Decision tree is a type of supervised learning algorithm (having a

predefined target variable) that is mostly used in classification
problems. It works for both categorical and continuous input and
output variables. In this technique, we split the population or sample
into two or more homogeneous sets (or sub-populations) based on
most significant splitter / differentiator in input variables.
Structure of a Decision Tree
Structure of a Decision Tree

 Root Node: It represents entire population or sample and this further

gets divided into two or more homogeneous sets.
 Splitting: It is a process of dividing a node into two or more sub-nodes.
 Decision Node: When a sub-node splits into further sub-nodes, then it is
called decision node.
 Leaf/ Terminal Node: Nodes do not split is called Leaf or Terminal node
 Branch / Sub-Tree: A sub section of entire tree is called branch or sub-
tree.
 Parent and Child Node: A node, which is divided into sub-nodes is
called parent node of sub-nodes where as sub-nodes are the child of
parent node.
How does the Decision Tree
Algorithm Work?
The basic idea behind any decision tree algorithm is as follows:
 Select the best attribute using Attribute Selection Measures (ASM) to
split the records.
 Make that attribute a decision node and breaks the dataset into
smaller subsets.
 Start tree building by repeating this process recursively for each child
until one of the condition will match:
 All the tuples belong to the same attribute value.
 There are no more remaining attributes.
 There are no more instances.
How does the Decision Tree
Algorithm Work?
Attribute Selection Measures

Attribute selection measure is a heuristic for selecting the splitting criterion that
partition data into the best possible manner. It is also known as splitting rules
because it helps us to determine breakpoints for tuples on a given node. ASM
provides a rank to each feature(or attribute) by explaining the given dataset.
Best score attribute will be selected as a splitting attribute (Source). In the case
of a continuous-valued attribute, split points for branches also need to define.

Most popular selection measures are:

 Entropy
 Gini Index
 Chi-Square
 Gain Ratio.
What is Entropy?

Entropy is a measure of the uncertainty or impurity in a dataset. It quantifies the

amount of disorder or randomness. In the context of a decision tree, entropy helps
to determine how informative a particular split is.
 High Entropy: Indicates high disorder, meaning the data is diverse and
uncertain.
 Low Entropy: Indicates low disorder, meaning the data is more homogeneous
and certain.
The formula for entropy H for a binary classification problem is:

H(S)=−p +log 2(p +)−p −log 2(p −)

where:
p + is the proportion of positive examples in the dataset S
p − is the proportion of negative examples in the dataset S
What is Information Gain?

 Information Gain (IG) is a measure of the effectiveness of an attribute in

classifying the training data. It quantifies the reduction in entropy (uncertainty)
achieved by splitting the dataset based on an attribute.
 The formula for Information Gain is:
Gain(S, A) = Entropy(S) - ((|Sv| / |S|) * Entropy(Sv))

Where:
• S is the original dataset
• A is the attribute
• Svis the subset of S for which attribute A has value v
• H(S) is the entropy of the original dataset
• H(Sv) is the entropy of the subset
Gini index

 Another decision tree algorithm CART (Classification and Regression

Tree) uses the Gini method to create split points.

Where, pi is the probability that a tuple in D belongs to class Ci.

 The Gini Index considers a binary split for each attribute. You can
compute a weighted sum of the impurity of each partition. If a binary
split on attribute A partitions data D into D1 and D2, the Gini index of
D is:
Decision tree algorithms:-

 CART (Classification and Regression Trees) → uses Gini

Index(Classification) as metric.
 ID3 (Iterative Dichotomiser 3) → uses Entropy function and
Information gain as metrics.
Information Gain:

 By using information gain as a criterion, we try to estimate the

information contained by each attribute. We are going to use some
points deducted from information theory.
 To measure the randomness or uncertainty of a random variable X is
defined by Entropy.
 For a binary classification problem with only two classes, positive and
negative class.
 If all examples are positive or all are negative then entropy will be
zero i.e, low.
 If half of the records are of positive class and half are of negative
class then entropy is one i.e, high.
 By calculating entropy measure of each attribute we can calculate
their information gain. Information Gain calculates the expected
reduction in entropy due to sorting on the attribute.
Entropy can be calculated using formula:-

Here p and q is probability of success and failure respectively in that node.

Entropy is also used with categorical target variable. It chooses the split which
has lowest entropy compared to parent node and other splits. The lesser the
entropy, the better it is.

Steps to calculate entropy for a split:

 Calculate entropy of parent node
 Calculate entropy of each individual node of split and calculate weighted
average of all sub-nodes available in split.
PROCEDURE

 First the entropy of the total dataset is calculated.

 The dataset is then split on the different attributes.
 The entropy for each branch is calculated. Then it is added
proportionally, to get total entropy for the split.
 The resulting entropy is subtracted from the entropy before
the split.
 The result is the Information Gain, or decrease in entropy.

 The attribute that yields the largest IG is chosen for the decision
node.
EXAMPLE
how can we avoid over-fitting in decision
trees?

 Overfitting is a practical problem while building a decision tree model.

The model is having an issue of overfitting is considered when the
algorithm continues to go deeper and deeper in the to reduce the
training set error but results with an increased test set error i.e,
Accuracy of prediction for our model goes down. It generally happens
when it builds many branches due to outliers and irregularities in
data.
Two approaches which we can use to avoid overfitting are:
 Pre-Pruning
 Post-Pruning
 Pre-Pruning
In pre-pruning, it stops the tree construction bit early. It is preferred
not to split a node if its goodness measure is below a threshold value.
But it’s difficult to choose an appropriate stopping point.
 Post-Pruning
In post-pruning first, it goes deeper and deeper in the tree to build a
complete tree. If the tree shows the overfitting problem then pruning
is done as a post-pruning step. We use a cross-validation data
to check the effect of our pruning. Using cross-validation data, it tests
whether expanding a node will make an improvement or not.
If it shows an improvement, then we can continue by expanding that node.
But if it shows a reduction in accuracy then it should not be expanded i.e, the
node should be converted to a leaf node.

Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
Supervised Decision TreeRandom Forest
No ratings yet
Supervised Decision TreeRandom Forest
39 pages
Decision Tree Basics for Data Scientists
No ratings yet
Decision Tree Basics for Data Scientists
61 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
Unit 3.2 Decision Tree Algorithm Wit Examples
No ratings yet
Unit 3.2 Decision Tree Algorithm Wit Examples
85 pages
Lecture 7.1 - Decision Tree Classification
No ratings yet
Lecture 7.1 - Decision Tree Classification
15 pages
Decision Tree
No ratings yet
Decision Tree
21 pages
Data Science Lectures 3
No ratings yet
Data Science Lectures 3
46 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Decision Tree Basics for Data Analysts
No ratings yet
Decision Tree Basics for Data Analysts
6 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
16 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
33 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
DS Tech M 3 1
No ratings yet
DS Tech M 3 1
13 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
22 pages
Trees
No ratings yet
Trees
78 pages
Decision Tree Induction Basics
No ratings yet
Decision Tree Induction Basics
55 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
فاينل تعلم
No ratings yet
فاينل تعلم
144 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Supervised Learning Algorithm DT
No ratings yet
Supervised Learning Algorithm DT
15 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Tree Learning Basics
No ratings yet
Decision Tree Learning Basics
36 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Decision Tree Classification Guide
No ratings yet
Decision Tree Classification Guide
3 pages
Act 9
No ratings yet
Act 9
22 pages
Unit 3
No ratings yet
Unit 3
21 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Unit 4
No ratings yet
Unit 4
33 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Chapter 04
No ratings yet
Chapter 04
48 pages
Module 5 Notes
No ratings yet
Module 5 Notes
8 pages
Session 5b Classification by Decision Tree Induction
No ratings yet
Session 5b Classification by Decision Tree Induction
42 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
Lecture-7 Machine Learning With Python
No ratings yet
Lecture-7 Machine Learning With Python
42 pages
ML-PPT Unit Iii-1
No ratings yet
ML-PPT Unit Iii-1
38 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
117 pages
Data Mining
No ratings yet
Data Mining
13 pages
Introduction To ML
No ratings yet
Introduction To ML
17 pages
K Means
No ratings yet
K Means
25 pages
Confusion Matrix
No ratings yet
Confusion Matrix
16 pages
Regression Metrics
No ratings yet
Regression Metrics
11 pages
Setting The Unit of Analysis
No ratings yet
Setting The Unit of Analysis
34 pages
Probability
No ratings yet
Probability
22 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Statistics
No ratings yet
Statistics
7 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Analytics Overview
No ratings yet
Analytics Overview
34 pages
Watson Studio
No ratings yet
Watson Studio
8 pages
CHAID Decision Tree
No ratings yet
CHAID Decision Tree
14 pages
Indianinstitute of Information Technology Design and Manufacturing (Iiitd&M) Kancheepuram
No ratings yet
Indianinstitute of Information Technology Design and Manufacturing (Iiitd&M) Kancheepuram
1 page
AI and E.Comm
No ratings yet
AI and E.Comm
65 pages
Fairness and Bias in Artificial Intelligence - A Brief Survey of Sources, Impacts, and Mitigation Strategies
No ratings yet
Fairness and Bias in Artificial Intelligence - A Brief Survey of Sources, Impacts, and Mitigation Strategies
16 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
Comprehensive AI & ML Course - From Beginner To Gen...
No ratings yet
Comprehensive AI & ML Course - From Beginner To Gen...
5 pages
Event Driven Programing Lab - Lec
No ratings yet
Event Driven Programing Lab - Lec
21 pages
Energies: Machine Learning Based Photovoltaics (PV) Power Prediction Using Di Parameters of Qatar
No ratings yet
Energies: Machine Learning Based Photovoltaics (PV) Power Prediction Using Di Parameters of Qatar
19 pages
Seminar
No ratings yet
Seminar
14 pages
Ibrar Final Synopsis Plagirism Check
No ratings yet
Ibrar Final Synopsis Plagirism Check
13 pages
Machine Learning-Based Predictive Analytics and Big Data in The Automotive Sector
No ratings yet
Machine Learning-Based Predictive Analytics and Big Data in The Automotive Sector
6 pages
AI's Impact on Workplace Safety
No ratings yet
AI's Impact on Workplace Safety
14 pages
Derby County and Football Analytics
No ratings yet
Derby County and Football Analytics
2 pages
2.1-Characterization of Learning Problems
No ratings yet
2.1-Characterization of Learning Problems
14 pages
Tamr: Unifying Hadoop Data Lakes
No ratings yet
Tamr: Unifying Hadoop Data Lakes
3 pages
Midterm Exam - Deneme Incelemesi
No ratings yet
Midterm Exam - Deneme Incelemesi
19 pages
What Is Bias Variance Decomposition - BbGoogle Search
No ratings yet
What Is Bias Variance Decomposition - BbGoogle Search
3 pages
Wa0000.
No ratings yet
Wa0000.
26 pages
MD Zaid Hussain GEN AI Engineer Resume 1
No ratings yet
MD Zaid Hussain GEN AI Engineer Resume 1
2 pages
Real-Time Object Detection with YOLO
No ratings yet
Real-Time Object Detection with YOLO
76 pages
2022 04 Potakey Grade Control
No ratings yet
2022 04 Potakey Grade Control
12 pages
Neural Learning and ART Explained
No ratings yet
Neural Learning and ART Explained
3 pages
IEEE Project List
No ratings yet
IEEE Project List
20 pages
Class X - AI - 10 MQP 03 With Answers
No ratings yet
Class X - AI - 10 MQP 03 With Answers
7 pages
PCCAIML502
No ratings yet
PCCAIML502
2 pages
M.Tech in Data Science at PES
No ratings yet
M.Tech in Data Science at PES
12 pages
Fixed Weight Competitive Networks Fixed Weight Competitive Nets
No ratings yet
Fixed Weight Competitive Networks Fixed Weight Competitive Nets
5 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
70 pages
COS40007 Design Project
No ratings yet
COS40007 Design Project
11 pages
AI ML Book
100% (1)
AI ML Book
20 pages

Decision Tree

Uploaded by

Decision Tree

Uploaded by

Decision Tree

 Decision tree is a type of supervised learning algorithm (having a

 Root Node: It represents entire population or sample and this further

Most popular selection measures are:

Entropy is a measure of the uncertainty or impurity in a dataset. It quantifies the

H(S)=−p +​log 2​(p +​)−p −​log 2​(p −​)

 Information Gain (IG) is a measure of the effectiveness of an attribute in

 Another decision tree algorithm CART (Classification and Regression

Where, pi is the probability that a tuple in D belongs to class Ci.

 CART (Classification and Regression Trees) → uses Gini

 By using information gain as a criterion, we try to estimate the

Here p and q is probability of success and failure respectively in that node.

Steps to calculate entropy for a split:

 First the entropy of the total dataset is calculated.

 Overfitting is a practical problem while building a decision tree model.

You might also like

H(S)=−p +log 2(p +)−p −log 2(p −)