0% found this document useful (0 votes)

9 views8 pages

DataMining-Handouts1 5

A decision tree is a supervised machine learning algorithm used for classification and regression, structured as a tree with nodes representing decisions based on feature values. Key components include root nodes, internal nodes, leaf nodes, and branches, with algorithms like ID3 and CART used for building the trees based on criteria such as entropy and Gini index. Decision trees are interpretable and versatile but can suffer from overfitting, and ensemble methods like Random Forests and Gradient Boosting can enhance their performance.

Uploaded by

Huzaifa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views8 pages

DataMining-Handouts1 5

Uploaded by

Huzaifa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Decision Tree

A decision tree is a supervised machine learning algorithm used for both classification
and regression tasks. It works by splitting the data into subsets based on the value of
input features, creating a tree-like structure of decisions. Each internal node represents a
decision based on a feature, each branch represents the outcome of that decision, and
each leaf node represents a final outcome (class label or continuous value).

Key Components of a Decision Tree:

1. Root Node: The topmost node that represents the entire dataset.
2. Internal Nodes: Nodes that split the data based on a feature.
3. Leaf Nodes: Terminal nodes that provide the final decision or prediction.
4. Branches: Paths from the root to the leaves, representing decision rules.

How a Decision Tree Works:

1. Feature Selection: The algorithm selects the best feature to split the data based
on criteria like Gini impurity, information gain, or variance reduction.
2. Splitting: The dataset is divided into subsets based on the selected feature.
3. Recursion: The process is repeated for each subset until a stopping condition is
met (e.g., maximum depth, minimum samples per leaf).
4. Prediction: For a new data point, the tree is traversed from the root to a leaf
node to make a prediction.

1. How Decision Trees are Built

Decision trees are constructed using algorithms that aim to find the most informative features to
split the data at each node. Here are two common approaches:

 ID3 (Iterative Dichotomiser 3): This algorithm uses entropy and information gain to
select the best feature for splitting.
o Entropy: Measures the impurity or randomness of a set of data. A set with equal
proportions of different classes has high entropy, while a set with only one class
has low entropy.
o Information Gain: Measures the reduction in entropy achieved by splitting the
data on a particular feature. The feature with the highest information gain is
chosen for the split.
 CART (Classification and Regression Trees): This algorithm uses the Gini index to
select the best feature for splitting.
o Gini Index: Measures the impurity of a set of data, similar to entropy. A lower
Gini index indicates higher purity.

2. Splitting Criteria

 Numerical Features: For numerical features, the splitting condition usually involves a
threshold. For example, "Age < 25?" splits the data into two groups: those younger than
25 and those 25 or older.
 Categorical Features: For categorical features, the splitting condition can be based on
the values of the feature. For example, "Favorite Genre = Action?" splits the data into
groups based on their favorite genre.
o Categorical data is a type of data that consists of labels or categories rather than
numerical values. It represents qualitative characteristics of an object or event.

Example:

o Colors: {Red, Blue, Green}

3. Overfitting and Pruning

 Overfitting: Decision trees can become very complex and capture noise in the data,
leading to poor performance on unseen data. This is called overfitting.
 Pruning: To avoid overfitting, we can prune the tree by removing branches or nodes that
do not contribute significantly to the prediction accuracy. Pruning can be done by
limiting the depth of the tree, setting a minimum number of samples required at a node,
or using statistical measures to evaluate the importance of branches.

4. Handling Different Data Types

 Categorical Data: Decision trees can handle categorical data directly by creating
branches for each category.
 Numerical Data: Numerical data can be used directly or discretized into categories. For
example, age can be divided into age groups like "young," "middle-aged," and "old."
 Missing Values: Decision trees can handle missing values by assigning them to the most
likely branch or creating a separate branch for missing values.

5. Advantages and Disadvantages (Expanded)

 Advantages:
o Interpretability: Decision trees are easy to understand and visualize, making
them useful for explaining decisions.
o Versatility: They can handle both classification and regression tasks, as well as
different data types.
o Minimal Data Preprocessing: Decision trees require less data preprocessing
compared to some other machine learning algorithms.
 Disadvantages:
o Overfitting: Decision trees are prone to overfitting, especially when they are very
complex.
o Instability: Small changes in the data can lead to significant changes in the tree
structure.
o Bias: Decision trees can be biased towards features with more levels or
categories.

6. Applications (Expanded)

 Customer Relationship Management (CRM): Predicting customer churn, identifying

potential customers, and personalizing marketing campaigns.
 Risk Assessment: Assessing credit risk, predicting loan defaults, and evaluating
insurance applications.
 Medical Diagnosis: Diagnosing diseases based on symptoms and medical history,
predicting patient outcomes, and personalizing treatment plans.
 Fraud Detection: Identifying fraudulent transactions in financial systems, detecting
suspicious activities in online platforms.

7. Ensemble Methods

To improve the performance and robustness of decision trees, ensemble methods can be used.
These methods combine multiple decision trees to make predictions. Two popular ensemble
methods are:

 Random Forests: Create multiple decision trees on different subsets of the data and
combine their predictions through averaging or voting.
 Gradient Boosting: Build trees sequentially, where each tree tries to correct the errors of
the previous trees.

Types of Decision Trees:

 Classification Trees: Predict categories (e.g., "likes movie" or "dislikes movie").

 Regression Trees: Predict continuous values (e.g., the price of a house).
ID3 Algorithm (Iterative Dichotomiser 3) - Step by Step

The ID3 (Iterative Dichotomiser 3) algorithm is a decision tree learning algorithm developed
by Ross Quinlan. The ID3 (Iterative Dichotomiser 3) algorithm is a decision tree learning
algorithm used for classification tasks. It builds a tree by selecting the attribute with the highest
Information Gain at each step. It uses Entropy and Information Gain to determine the best
attribute to split the data. Information Gain measures how much a feature reduces the
uncertainty (entropy) in the dataset.

Note: ID3 is a foundational algorithm. More advanced decision tree algorithms like C4.5 and CART
address some of these limitations.

Step-by-Step Explanation of ID3 Algorithm

1. Start with the Entire Dataset:

oBegin with the complete dataset and all available features.
2. Calculate the Entropy of the Target Attribute:
o Entropy measures the impurity or uncertainty in the dataset. For a binary
classification problem, entropy is calculated as:

3. Calculate Information Gain for Each Feature:

o Information gain measures how much a feature reduces the entropy. It is
calculated as:

4. Select the Feature with the Highest Information Gain:

o Choose the feature that maximizes information gain as the splitting
criterion.
5. Split the Dataset:
Split the dataset into subsets based on the selected feature's values.
o
6. Repeat the Process Recursively:
o Repeat steps 2–5 for each subset until:
All instances in a subset belong to the same class (no further

splitting needed).
 No more features are left to split on.
 A predefined stopping condition is met (e.g., maximum tree depth).
7. Create the Decision Tree:
o The splits form the internal nodes of the tree, and the leaf nodes represent
the class labels.

Example Problem
We will use the ID3 Algorithm to build a decision tree to determine if a person will play tennis
based on these features:

Outlook Temperature Humidity Windy Play Tennis?

Sunny Hot High False No
Sunny Hot High True No
Overcast Hot High False Yes
Rainy Mild High False Yes
Rainy Cool Normal False Yes
Rainy Cool Normal True No
Overcast Cool Normal True Yes
Sunny Mild High False No
Sunny Cool Normal False Yes
Rainy Mild Normal False Yes
Sunny Mild Normal True Yes
Overcast Mild High True Yes
Overcast Hot Normal False Yes
Rainy Mild High True No

Compute Entropy for the Target Variable (Play Tennis?)

So, the entropy of the dataset is 0.94

Compute Information Gain for Each Attribute

We now calculate the Information Gain for Outlook, Temperature, Humidity, and Windy.

Information Gain for "Outlook"

Outlook Total Play Tennis: Yes Play Tennis: No

Sunny 5 2 3
Overcast 4 4 0
Rainy 5 3 2

First, compute the entropy for outlook

Now, compute the Information Gain-IG for outlook

5 4 5
𝐼𝐺(𝑂𝑢𝑡𝑙𝑜𝑜𝑘) = 0.94 − (( × 0.97) + ( × 0) + ( × 0.97))
14 14 14

𝐼𝐺(𝑂𝑢𝑡𝑙𝑜𝑜𝑘) = 0.94 − (0.35 + 0 + 0.35)

𝐼𝐺(𝑂𝑢𝑡𝑙𝑜𝑜𝑘) = 0.24

Similarly, we compute IG for Temperature, Humidity, and Windy and choose the highest one.
Information Gains (IGs)

 Outlook: 0.247
 Temperature: 0.029
 Humidity: 0.151
 Windy: 0.048

Select the Feature with the Highest Information Gain:

Since, Outlook has the highest Information Gain, we split on it:

Recursively Build the DecisionTree:

 For the Overcast subset (all "Yes"), create a leaf node labeled "Yes".
 For the Sunny and Rainy subsets, repeat the process to find the best feature to
split on.

Final Decision Tree

 If Overcast, Play Tennis = Yes.

 If Sunny, check Humidity:
o If High, Play Tennis = No.
o If Normal, Play Tennis = Yes.
 If Rainy, check Windy:
o If False, Play Tennis = Yes.
o If True, Play Tennis = No.

Making Predictions
Now, we can use the tree to classify new data.

Example:

 Outlook = Rainy, Windy = False → Play Tennis = Yes

 Outlook = Sunny, Humidity = High → Play Tennis = No

Conclusion
The ID3 algorithm:

1. Calculates Entropy for the dataset.

2. Finds Information Gain for each attribute.
3. Selects the attribute with the highest IG as the root node.
4. Recursively splits the dataset until all nodes are pure or other stopping criteria are met.

Decision Trees for Data Scientists
0% (1)
Decision Trees for Data Scientists
24 pages
Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
45 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
Decision Trees and Decision Modeling
No ratings yet
Decision Trees and Decision Modeling
58 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Decision Trees for CS Students
No ratings yet
Decision Trees for CS Students
54 pages
Lesson 7 Supervised Method (Decision Trees) Algorithms
No ratings yet
Lesson 7 Supervised Method (Decision Trees) Algorithms
12 pages
Classification
No ratings yet
Classification
148 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
DM Unit Iii
No ratings yet
DM Unit Iii
87 pages
Chapter 2 Types of Machine Learning and Their Learning Strategies
No ratings yet
Chapter 2 Types of Machine Learning and Their Learning Strategies
45 pages
AI Basics for Students
100% (1)
AI Basics for Students
203 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Business Analytics: Foundation: Material Handouts
No ratings yet
Business Analytics: Foundation: Material Handouts
7 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
ML Unit 2 Final - III Yr
No ratings yet
ML Unit 2 Final - III Yr
72 pages
Udemy For Business Course List
No ratings yet
Udemy For Business Course List
150 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Decision Tree Classification Guide
No ratings yet
Decision Tree Classification Guide
3 pages
Slide 3
No ratings yet
Slide 3
23 pages
Entropy and Information Gain For Decision Tree Algorithm
No ratings yet
Entropy and Information Gain For Decision Tree Algorithm
12 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
OOP Assignments for AI & DS Semester 1
No ratings yet
OOP Assignments for AI & DS Semester 1
6 pages
Tree Models
No ratings yet
Tree Models
42 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
12 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
PPSC Test Preparation
No ratings yet
PPSC Test Preparation
448 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
ML Unit4
No ratings yet
ML Unit4
10 pages
Tree Based Algorithms in Machine Learning
No ratings yet
Tree Based Algorithms in Machine Learning
8 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
Data Science Lectures 3
No ratings yet
Data Science Lectures 3
46 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
Money, Jobs and Careers 1-2-3
No ratings yet
Money, Jobs and Careers 1-2-3
20 pages
Unit 3 - ML (NEW)
No ratings yet
Unit 3 - ML (NEW)
68 pages
Prac 6
No ratings yet
Prac 6
6 pages
ML Unit 3 Qa
No ratings yet
ML Unit 3 Qa
26 pages
6 DecisionTrees ID3 CART
No ratings yet
6 DecisionTrees ID3 CART
24 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
2.12 Chapter 6 Decision Tree
No ratings yet
2.12 Chapter 6 Decision Tree
56 pages
Optimizing Britannia's Supply Chain
No ratings yet
Optimizing Britannia's Supply Chain
8 pages
DataMining-Handouts1 4
No ratings yet
DataMining-Handouts1 4
3 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Unit3 ML
No ratings yet
Unit3 ML
23 pages
Digital Image Processing
No ratings yet
Digital Image Processing
33 pages
Unit 2
No ratings yet
Unit 2
29 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
45 pages
ML-PPT Unit Iii-1
No ratings yet
ML-PPT Unit Iii-1
38 pages
ML Unit 3
No ratings yet
ML Unit 3
15 pages
Case Study For Hackathon
No ratings yet
Case Study For Hackathon
6 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Report Writing
100% (1)
Report Writing
9 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
Wired USA 2023-09
No ratings yet
Wired USA 2023-09
92 pages
Decisiontree, Prefixcodeandgametree
No ratings yet
Decisiontree, Prefixcodeandgametree
12 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
Larvel
No ratings yet
Larvel
268 pages
DataMining-Handouts1 0
No ratings yet
DataMining-Handouts1 0
38 pages
Analysis and Design Accounting Information System PDF
No ratings yet
Analysis and Design Accounting Information System PDF
5 pages
Bridging The Gap
No ratings yet
Bridging The Gap
31 pages
Industrial Networking Insights 2024
No ratings yet
Industrial Networking Insights 2024
30 pages
Theory For The Paper
No ratings yet
Theory For The Paper
33 pages
Laurence Moroney
No ratings yet
Laurence Moroney
27 pages
Question Bank
No ratings yet
Question Bank
26 pages
Oracle Fusion Demo
No ratings yet
Oracle Fusion Demo
18 pages
Tianjic A Unified and Scalable Chip Bridging Spike-Based and Continuous Neural Computation
No ratings yet
Tianjic A Unified and Scalable Chip Bridging Spike-Based and Continuous Neural Computation
19 pages
Introduction To Artificial Intelligence Week 12
No ratings yet
Introduction To Artificial Intelligence Week 12
16 pages
Ceng583 Presentation
No ratings yet
Ceng583 Presentation
18 pages
DataMining-Handouts1 0
No ratings yet
DataMining-Handouts1 0
11 pages
07 Boosting Notes
No ratings yet
07 Boosting Notes
10 pages
DGX Pod Reference Design Whitepaper
No ratings yet
DGX Pod Reference Design Whitepaper
15 pages
Deepflow by ThinkDeep AI
No ratings yet
Deepflow by ThinkDeep AI
10 pages
3D Animation
No ratings yet
3D Animation
8 pages
Fine Tuned Understanding Enhancing Social Bot Detection With Transformer Based Classification
No ratings yet
Fine Tuned Understanding Enhancing Social Bot Detection With Transformer Based Classification
8 pages
Hand Gesture Recognition Approach:A Survey
No ratings yet
Hand Gesture Recognition Approach:A Survey
4 pages
MobileNetV2 Code
No ratings yet
MobileNetV2 Code
3 pages
Awiros - Job Description - 2022-23
No ratings yet
Awiros - Job Description - 2022-23
3 pages
ManojKumar Resume
No ratings yet
ManojKumar Resume
2 pages
Sathyabama: Register Number
No ratings yet
Sathyabama: Register Number
2 pages
June-July 2023 22SCN22
No ratings yet
June-July 2023 22SCN22
2 pages
Huzaifa Noor
No ratings yet
Huzaifa Noor
1 page
Computer Architecture
No ratings yet
Computer Architecture
2 pages
Statement of Purpose Huzaifa Noor
No ratings yet
Statement of Purpose Huzaifa Noor
1 page
Machine Learning Quick Start Guide
No ratings yet
Machine Learning Quick Start Guide
1 page
Final Term Syllabus
No ratings yet
Final Term Syllabus
1 page