0% found this document useful (0 votes)

64 views12 pages

Decision Tree

A decision tree is a hierarchical model used for supervised learning, consisting of internal nodes that represent tests on attributes, branches that denote outcomes, and leaf nodes that hold class labels. Key concepts include feature selection methods such as Information Gain, Gini Index, and Gain Ratio, which help determine the structure of the tree. Ensemble methods like Bagging and Boosting enhance predictive performance by combining multiple decision trees.

Uploaded by

soumyamahesh15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views12 pages

Decision Tree

Uploaded by

soumyamahesh15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

MODULE III

DECISION TREE

A Decision tree is a flowchart like tree structure, where

▪ each internal node denotes a test on an attribute,

▪ each branch represents an outcome of the test,

▪ each leaf node (terminal node) holds a class label.

A decision tree is a hierarchical model for supervised learning

whereby the local region is identified in a sequence of recursive splits
in a smaller number of steps.

▪ A decision tree is composed of internal decision nodes and

terminal leaves.

▪ Each decision node m implements a test function fm(x) with

discrete outcomes labeling the branches.

▪ Given an input, at each node, a test is applied and one of the

branches is taken depending on the outcome.
▪ This process starts at the root and is repeated recursively until a
leaf node is hit, at which point the value written in the leaf
constitutes the output.

Feature Selection Method

If a dataset consists of n attributes then deciding which attribute is to

be placed at the root or at different levels of the tree as internal nodes
is a complicated problem.

▪ The most important problem in implementing the decision tree

algorithm is deciding which features are to be considered as the
root node and at each level

Popular feature selection measures are

•Information gain
•Gini index
•Gain Ratio
Entropy

The degree to which a subset of examples contains only a single class

is known as purity and any subset composed of only a single class is
called a pure class
▪ Informally, entropy is a measure of “impurity”in a dataset

▪ Entropy is measured in bits.

▪ If there are only two possible classes, entropy values can range
from 0 to 1.

▪ For n classes, entropy ranges from 0 to log2(n).

▪ In each case, the minimum value indicates that the sample is

completely homogeneous, while the maximum value indicates
that the data are as diverse as possible

Entropy is a measure of the randomness in the information being

processed.

▪ The higher the entropy, the harder it is to draw any conclusions

from that information.

▪ Consider a segment S of a dataset having c number of class

labels.

▪ Let pi be the proportion of examples in S having the ith class

label.

▪ The entropy of S is defined as:

Information Gain

▪ Information gain tells how important a given attribute of the

feature vectors is.

▪ Used to decide the ordering of attributes in the nodes of a

decision tree.

▪ Let S be a set of examples, A be a feature (or, an attribute), Sv

be the subset of S with

▪ A=v
▪ Values (A) be the set of all possible values of A.

▪ Then the information gain of an attribute A relative to the set S,

denoted by Gain (S,A), is defined as
▪ Example of Information Gain

▪ Consider the previous data for target concept “Play Tennis”.

▪ Calculation of Gain (S, outlook)

▪ The values of the attribute “outlook” are “sunny”, “ overcast”

and “rain”.

▪ Calculate Entropy (Sv) for v = sunny, v = overcast and v = rain.

ID3 Algorithm

▪ Algorithm used to generate a decision tree.

▪ The ID3 algorithm was invented by Ross Quinlan.

▪ The ID3 follows the Occam’s razor principle.

▪ Attempts to create the smallest possible decision tree.

Step 1: Create a root node for the tree.

Step 2: Note that not all examples are positive (class label “yes”) and
not all examples are negative (class label “no”). Also the number of
features is not zero.

Step 3: Decide which feature is to be placed at the root node.

For this, calculate the information gains corresponding to each of the
four features.

Step 4: Find the highest information gain which is the maximum

among Gain(S, outlook), Gain(S, temperature), Gain(S, humidity)
and Gain(S, wind).

•Highest information gain = max{0.2469, 0.0293, 0.151, 0.048} =

0.2469

•This corresponds to the feature “outlook”.

•Therefore, place “outlook” at the root node.

Step 5:
Step6: all the examples in the dataset corresponding to Node4 in
figure have the same class label “no” and all the examples
corresponding to Node5 have the same class label “yes”.

•So represent Node 4 as a leaf node with value “no” and Node 5 as a
leaf node with value “yes”.

•Similarly, all the examples corresponding to Node2 have the same

class label “yes”. So convert Node2 as a leaf node with value “yes”.

•Finally, letS(2)=Soutlook=rain. The highest information gain among

Gain(S(2),temperature), Gain(S(2),humidity) and
Gain(S(2),wind)=max{0.02,0.02,0.9710} for this dataset is
Gain(S(2);wind).
•The branches resulting from splitting this node corresponding to the
values “weak” and “strong” of “wind” lead to leaf nodes with class
labels “yes” and ”no”.

Gini Index
•Consider a data set S having r class labels c1,……cr.

•Let pi be the proportion of examples having the class label ci.

•The Gini index of the data set S, denoted by Gini(S), is defined by:

Gain Ratio

Let S be a set of examples, A be a feature having c different values

and let the set of values of A be denoted by Values(A).

•The information gain of A relative to S, denoted by Gain(S,A) is

•The split information of A relative to S, denoted by Split

Information(S,A)
•where S1,……,Sc are the c subsets of examples resulting from
partitioning S into the c values of the attribute A.

The gain ratio of A relative to S, is defined as

Surrogate Splits

▪ Surrogate splits are used to classify test samples having missing

values.
▪ Surrogate splits helps to identify the actual splits
▪ Another decision tree is created to predict the split
▪ The number of surrogates that can be used depends on the training
data

Ensemble Methods
Ensemble methods, which combines several decision trees to produce
better predictive performance than utilizing a single decision tree. The
main principle behind the ensemble model is that a group of weak
learners come together to form a strong learner.
Techniques to perform ensemble decision trees:
1. Bagging
2. Boosting
Bagging :
▪ Is used when our goal is to reduce the variance of a decision
tree.
▪ Each tree is built from a subset of the training dataset by random
sampling the training dataset with replacement.
▪ This technique is also called bootstrapping
▪ When training the decision trees on the bootstrapped samples, the
goal is to build very deep overfitting trees. Each tree thus has very
high variance. As multiple models are used for the final prediction
in the ensemble, the overall outcome is a model with low bias.
This is because if many overfitting models are averaged out, the
result is a model that does not overfit to any one sample so the
ensemble has low bias and lower variance. So, while each tree is
a robust modelling of a data sample, the united model ensemble
is more powerful.
Boosting:
▪ Boosting means that each tree is dependent on prior trees.
▪ The algorithm learns by fitting the residual of the trees that
preceded it.
▪ Thus, boosting in a decision tree ensemble tends to improve
accuracy with some small risk of less coverage.
▪ In boosting each tree is dependent on prior trees.

Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
Unit 3.2 Decision Tree Algorithm Wit Examples
No ratings yet
Unit 3.2 Decision Tree Algorithm Wit Examples
85 pages
QMS Internal Audit Checklist Demo
No ratings yet
QMS Internal Audit Checklist Demo
4 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
Decision Tree - Notes
No ratings yet
Decision Tree - Notes
8 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
List Mechanical Procedure Qualification Test (API 1104) 2018 (CEPU)
No ratings yet
List Mechanical Procedure Qualification Test (API 1104) 2018 (CEPU)
5 pages
فاينل تعلم
No ratings yet
فاينل تعلم
144 pages
Training Day 22
No ratings yet
Training Day 22
48 pages
ML-PPT Unit Iii-1
No ratings yet
ML-PPT Unit Iii-1
38 pages
Data Science Lectures 3
No ratings yet
Data Science Lectures 3
46 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Module 5 Notes
No ratings yet
Module 5 Notes
8 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Trees
No ratings yet
Trees
78 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
ML4 - Decision Trees & Random Forest
No ratings yet
ML4 - Decision Trees & Random Forest
44 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
Lect 8-Decision Tree-2
No ratings yet
Lect 8-Decision Tree-2
16 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
ML Unit 2 Final - III Yr
No ratings yet
ML Unit 2 Final - III Yr
72 pages
Module 3 Notes
No ratings yet
Module 3 Notes
26 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
Decision Tree Induction Basics
No ratings yet
Decision Tree Induction Basics
55 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
33 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
Grounding Installation Guide
100% (1)
Grounding Installation Guide
25 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Lecture 2.6
No ratings yet
Lecture 2.6
23 pages
DM Unit 4
No ratings yet
DM Unit 4
24 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Chapter I Review of Related Studies and Literature
89% (18)
Chapter I Review of Related Studies and Literature
5 pages
Decision Tree Basics for Data Scientists
No ratings yet
Decision Tree Basics for Data Scientists
61 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
22 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
Module 3
No ratings yet
Module 3
101 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
Bhabesh - Chapter 3 Complete Editing Including Summary
No ratings yet
Bhabesh - Chapter 3 Complete Editing Including Summary
18 pages
Decision Trees
No ratings yet
Decision Trees
25 pages
Decision Tree Basics
No ratings yet
Decision Tree Basics
30 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
ML Lec-12
No ratings yet
ML Lec-12
17 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
Chapter 5 2018 2019
No ratings yet
Chapter 5 2018 2019
5 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
Decision Tree - Associative Rule Mining
No ratings yet
Decision Tree - Associative Rule Mining
69 pages
Aws Kms Best Practices PDF
No ratings yet
Aws Kms Best Practices PDF
24 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
Decision Tree Algorithm: and Classification Problems Too
No ratings yet
Decision Tree Algorithm: and Classification Problems Too
12 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
Winterization Guide for Property Managers
No ratings yet
Winterization Guide for Property Managers
1 page
TTL 1 UNIT 1 Intro and Lesson 1 T
No ratings yet
TTL 1 UNIT 1 Intro and Lesson 1 T
32 pages
Shell Diala Ingles
No ratings yet
Shell Diala Ingles
2 pages
FST Aerospace Parts Cross Reference Brochure
No ratings yet
FST Aerospace Parts Cross Reference Brochure
6 pages
CM Bc9000-Eng-Int-B-Catalogue
No ratings yet
CM Bc9000-Eng-Int-B-Catalogue
20 pages
Bom Chiller Cu 1720 01 02 (1951 U 806 A&b) Acds 040 Augqv
No ratings yet
Bom Chiller Cu 1720 01 02 (1951 U 806 A&b) Acds 040 Augqv
2 pages
Identify Your Imac Model - Apple Support
No ratings yet
Identify Your Imac Model - Apple Support
1 page
PAC-USWHS002-WF-2 Install Manual 04 21
No ratings yet
PAC-USWHS002-WF-2 Install Manual 04 21
8 pages
Assignment 1 - Linear Programming I - With Answers
No ratings yet
Assignment 1 - Linear Programming I - With Answers
2 pages
Dynex Technologies DSX Troubleshooting Guide: Tip Pick-up/Eject Errors
No ratings yet
Dynex Technologies DSX Troubleshooting Guide: Tip Pick-up/Eject Errors
3 pages
AI & Cloud Solutions for Business Transformation
No ratings yet
AI & Cloud Solutions for Business Transformation
22 pages
Consent Form Version 6
No ratings yet
Consent Form Version 6
2 pages
T1 Homework 1
100% (1)
T1 Homework 1
3 pages
Appleton Conduit Hub
No ratings yet
Appleton Conduit Hub
1 page
Remote Sensing - Detecting Moving Trucks On Roads Using Sentinel-2 Data
No ratings yet
Remote Sensing - Detecting Moving Trucks On Roads Using Sentinel-2 Data
28 pages
Erp Manager
No ratings yet
Erp Manager
2 pages
Maintaining Training Facilities
No ratings yet
Maintaining Training Facilities
97 pages
6469 4 Sun-Protection Digital
No ratings yet
6469 4 Sun-Protection Digital
2 pages
Code in Voices
No ratings yet
Code in Voices
10 pages
弗兰德减速机
No ratings yet
弗兰德减速机
5 pages
DAA Q Bank CAE2
No ratings yet
DAA Q Bank CAE2
9 pages
Power System Course Outline 2022
No ratings yet
Power System Course Outline 2022
1 page
0936E1001R00
No ratings yet
0936E1001R00
1 page
IPR - Quiz 1 2024
No ratings yet
IPR - Quiz 1 2024
1 page