0% found this document useful (0 votes)

18 views5 pages

Decision Tree

A Decision Tree is a machine learning model structured as a flowchart, where internal nodes represent decisions on attributes, branches show outcomes, and leaf nodes indicate results. The model predicts outcomes by recursively splitting data based on the best features until reaching leaf nodes, and it can generate decision rules for interpretability. While easy to understand and requiring minimal data preparation, Decision Trees can suffer from overfitting, instability, and bias towards dominant classes.

Uploaded by

jokesodysseyurdu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views5 pages

Decision Tree

Uploaded by

jokesodysseyurdu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

1.

Decision Trees

A Decision Tree is a machine learning model that is represented as a flowchart-like structure, where:

 Internal nodes represent a decision or test on an attribute.

 Branches represent the outcome of the decision or test.
 Leaf nodes represent the result or output, such as a class label (in classification) or a value (in
regression).

The tree is structured in a way that helps decision-making based on the features (attributes) of the data.
The flow from the root to the leaf nodes provides a decision rule that helps predict the class or value for
a given set of features.

How Decision Trees Work:

1. Start at the Root Node: The root node represents the entire dataset. We begin by selecting a
feature (attribute) that best splits the data into different classes or outcomes. This split is
determined by specific criteria like Gini Impurity, Information Gain, or Variance Reduction.
2. Split Data Based on Features: At each internal node, the dataset is split based on the feature
that provides the best separation between classes or predicts the target value the best.
3. Continue Splitting: This process continues recursively at each internal node until we reach the
leaf nodes. These leaf nodes hold the final decision (class label for classification or value for
regression).
4. Make a Prediction: For new, unseen data, the prediction is made by following the tree structure
from the root to a leaf node, applying the decisions (tests) along the way.

Decision Rules:

 Definition: A decision rule is a simple "if-then" condition derived from the decision tree.
 Example: Consider a decision tree for classifying whether someone will buy a product based on
their age and income:
o If Age ≤ 30 and Income > 50,000, then "Buy Product" (Class 1).
o If Age > 30 and Income ≤ 50,000, then "Don't Buy Product" (Class 0).

These rules are extracted from the paths leading to the leaf nodes in the decision tree.

Example of Decision Tree for Classification:

Let’s consider a small example to illustrate how a decision tree works for classification:

Problem:

Classify whether a person will play tennis based on the weather conditions (Outlook, Temperature,
Humidity, Wind).
 Attributes: Outlook (Sunny, Overcast, Rain), Temperature (Hot, Mild, Cool), Humidity (High,
Low), Wind (Weak, Strong)
 Target/Label: PlayTennis (Yes, No)

Dataset:
Outlook Temperature Humidity Wind PlayTennis

Sunny Hot High Weak No

Sunny Hot High Strong No

Overcast Hot High Weak Yes

Rain Mild High Weak Yes

Rain Cool Low Weak Yes

Rain Cool Low Strong No

Overcast Cool Low Strong Yes

Sunny Mild High Weak No

Sunny Cool Low Weak Yes

Rain Mild Low Weak Yes

Building the Decision Tree:

1. Step 1: Select the Root Node: The root node is selected based on the best feature to split the
data. In this case, we would use a criterion like Information Gain to decide the best feature.

After calculating the Information Gain, we might find that Outlook is the best feature to split the
data, as it has the highest Information Gain.

2. Step 2: Split Data: The tree branches into three based on the possible values of Outlook (Sunny,
Overcast, Rain).
3. Step 3: Continue Splitting: Now, for each of these branches, we further split based on the next
best feature (say, Humidity or Wind).
o For Sunny, the tree might split based on Humidity: If Humidity = High, predict "No"
(Leaf node), otherwise "Yes".
o For Rain, the tree might split based on Wind: If Wind = Weak, predict "Yes" (Leaf node),
otherwise "No".
4. Step 4: Reach Leaf Nodes: The decision tree will keep splitting until it reaches leaf nodes with a
predicted label.

Decision Tree Diagram:

Below is a simplified decision tree for the above example.

Explanation of the Tree:

1. Root Node: The first decision is based on Outlook.

o If Outlook is Overcast, predict Yes (PlayTennis).
o If Outlook is Sunny, we move to the next test: Humidity.
 If Humidity is High, predict No.
 If Humidity is Low, predict Yes.
o If Outlook is Rain, the next test is Wind.
 If Wind is Weak, predict Yes.
 If Wind is Strong, predict No.

Advantages:

 Easy to Interpret: The model is visual and intuitive, making it easy to explain to non-experts.
 No Need for Data Preparation: Minimal data preprocessing is required (e.g., no need for
normalization or scaling).

Disadvantages:

 Overfitting: Decision trees can easily overfit to training data, especially with deep trees.
 Instability: Small changes in the data can result in a completely different tree.
 Bias toward Dominant Classes: Decision trees can be biased if the dataset is imbalanced.

2. Generating Decision Trees

To construct a Decision Tree:

1. Choose the Best Attribute:

o Use measures like Information Gain or the Gini Index to identify the attribute that
splits the data most effectively.
2. Recursively Split Data:
o Apply the splitting process to each subset until the stopping criteria are met.
3. Assign Labels or Predictions:
o At the leaf nodes, assign the majority class label (for classification) or the average
value (for regression).

3. Pruning Decision Trees

Pruning is the process of reducing the size of a decision tree to prevent overfitting and improve
generalization.

 Pre-Pruning: Stop the tree's growth early based on conditions like maximum depth or
minimum data at a node.
 Post-Pruning: Grow the entire tree and then remove branches that do not improve
performance on a validation set.

5. Decision Rules

Decision Rules are IF-THEN conditions derived from Decision Trees. For example, a rule might
look like:

 IF age > 30 AND income > 50K THEN approve loan.

These rules provide a straightforward way to represent the tree’s logic, offering interpretability
and flexibility in practical applications.
6. Limitations of Decision Trees and Rules

1. Overfitting:
o Decision Trees can grow excessively, capturing noise in the training data.
o Pruning helps mitigate this but may lead to underfitting if over-pruned.
2. Bias Towards Dominant Features:
o Trees can favor features with many levels (e.g., ID numbers) or numeric features
with high variance.
3. Instability:
o Small changes in the training data can lead to entirely different tree structures.
4. Performance on Complex Relationships:
o Decision Trees struggle with datasets where features interact in complex, non-
linear ways.
5. Scalability:
o For very large datasets, tree construction can become computationally expensive.

25 Zero Investment Business Ideas
No ratings yet
25 Zero Investment Business Ideas
109 pages
Unit 3
No ratings yet
Unit 3
25 pages
2.12 Chapter 6 Decision Tree
No ratings yet
2.12 Chapter 6 Decision Tree
56 pages
System Based Error Book
No ratings yet
System Based Error Book
16 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
18 pages
Decision Trees for Data Scientists
0% (1)
Decision Trees for Data Scientists
24 pages
Bokeh Cheat Sheet
No ratings yet
Bokeh Cheat Sheet
1 page
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
FBM224 说明书
No ratings yet
FBM224 说明书
16 pages
User Manual: Journal Article Latex Authoring Template
No ratings yet
User Manual: Journal Article Latex Authoring Template
14 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Form 1 Math Exam Paper
No ratings yet
Form 1 Math Exam Paper
6 pages
Notes On Decision Trees
No ratings yet
Notes On Decision Trees
2 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
2.unit 2
No ratings yet
2.unit 2
23 pages
Aileen English-LAS-Project-Proposal
No ratings yet
Aileen English-LAS-Project-Proposal
5 pages
Decisiontree, Prefixcodeandgametree
No ratings yet
Decisiontree, Prefixcodeandgametree
12 pages
Unit 3 - ML (NEW)
No ratings yet
Unit 3 - ML (NEW)
68 pages
Decision Tree Classification Regression Internal Node Branch Leaf Node
No ratings yet
Decision Tree Classification Regression Internal Node Branch Leaf Node
2 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Decision Trees
No ratings yet
Decision Trees
9 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
ML Ch-3 Decision Trees and Ensemble Methods
No ratings yet
ML Ch-3 Decision Trees and Ensemble Methods
14 pages
Speech Emotion Analysis System
No ratings yet
Speech Emotion Analysis System
10 pages
Real-Time Simulation with FLIGHTLAB
No ratings yet
Real-Time Simulation with FLIGHTLAB
18 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
8 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Iso File Naming Macro
No ratings yet
Iso File Naming Macro
6 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
En
No ratings yet
En
44 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
B&E 105: T B S: Echnology For Usiness Olutions
No ratings yet
B&E 105: T B S: Echnology For Usiness Olutions
11 pages
1822 B.E Cse Batchno 149
No ratings yet
1822 B.E Cse Batchno 149
66 pages
Decision Tree Learning (8 Hours)
No ratings yet
Decision Tree Learning (8 Hours)
141 pages
What Is TikTok An Introduction
No ratings yet
What Is TikTok An Introduction
8 pages
UNV【Datasheet】 IPC2122LB-SF28 (40) -A-BY 2MP Mini Fixed Bullet Network Camera Datasheet V1.1-EN
No ratings yet
UNV【Datasheet】 IPC2122LB-SF28 (40) -A-BY 2MP Mini Fixed Bullet Network Camera Datasheet V1.1-EN
4 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
ABB Robot Control S4C or S4P in Conjunction With WAGO Profibus Components 750-303 and 750-301
No ratings yet
ABB Robot Control S4C or S4P in Conjunction With WAGO Profibus Components 750-303 and 750-301
12 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
45 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
Python CGI Scripts for Beginners
No ratings yet
Python CGI Scripts for Beginners
11 pages
ML Unit3 QB Solutions
No ratings yet
ML Unit3 QB Solutions
11 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
AOS7 Troubleshooting
No ratings yet
AOS7 Troubleshooting
179 pages
UNIT-3 ML Notes
No ratings yet
UNIT-3 ML Notes
4 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Prac 6
No ratings yet
Prac 6
6 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Intel WiFi Link 6200 622ANHMW Wireless N 300M Half MiniCard
No ratings yet
Intel WiFi Link 6200 622ANHMW Wireless N 300M Half MiniCard
5 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
Foundations of Machine Learning: Module 2: Linear Regression and Decision Tree
100% (2)
Foundations of Machine Learning: Module 2: Linear Regression and Decision Tree
16 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
3manager WhitePaper DataCollector
No ratings yet
3manager WhitePaper DataCollector
26 pages
ML Unit3
No ratings yet
ML Unit3
8 pages
CURRICULUM MAP 10 Computer
No ratings yet
CURRICULUM MAP 10 Computer
11 pages
Unit 4
No ratings yet
Unit 4
33 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
Chapter 2-Analytical Decision Making
No ratings yet
Chapter 2-Analytical Decision Making
39 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Log
No ratings yet
Log
2 pages
Breaking Down Decision Tree Algorithm
No ratings yet
Breaking Down Decision Tree Algorithm
10 pages
Client Copy Procedure
No ratings yet
Client Copy Procedure
14 pages
Decision Tree
No ratings yet
Decision Tree
21 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
PPC 120T Ce RT - DS
No ratings yet
PPC 120T Ce RT - DS
2 pages
Web Technologies Week 03-04 (CSS)
No ratings yet
Web Technologies Week 03-04 (CSS)
50 pages
Decision Tree Comprehesive
No ratings yet
Decision Tree Comprehesive
7 pages
BPE 22, Decision Trees
No ratings yet
BPE 22, Decision Trees
11 pages
Arslan CV
No ratings yet
Arslan CV
1 page
Kemuning/Icu Isolasi 3 JAB Rating Bobot N
No ratings yet
Kemuning/Icu Isolasi 3 JAB Rating Bobot N
7 pages
Odoo 14 IAP Services Guide
No ratings yet
Odoo 14 IAP Services Guide
3 pages
6 Decision Trees in Data Mining
No ratings yet
6 Decision Trees in Data Mining
10 pages
Lecture 19 - Decision Tress
No ratings yet
Lecture 19 - Decision Tress
21 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Unit Iir20
No ratings yet
Unit Iir20
22 pages
2b Decision Tree 18may
No ratings yet
2b Decision Tree 18may
16 pages
3.4.5 Packet Tracer - Configure Trunks
No ratings yet
3.4.5 Packet Tracer - Configure Trunks
2 pages
Data Mining: Classification-1
No ratings yet
Data Mining: Classification-1
53 pages
Lecture Note 5
No ratings yet
Lecture Note 5
7 pages
Decision Treesnotes
No ratings yet
Decision Treesnotes
3 pages
Decision Tree
No ratings yet
Decision Tree
7 pages