0% found this document useful (0 votes)

152 views12 pages

Entropy and Information Gain For Decision Tree Algorithm

Xrd6 gu9tx sjfsmzvmzv 59d fuptuf hda khx. Yotd mhtyc

Uploaded by

engtawkibhasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

152 views12 pages

Entropy and Information Gain For Decision Tree Algorithm

Xrd6 gu9tx sjfsmzvmzv 59d fuptuf hda khx. Yotd mhtyc

Uploaded by

engtawkibhasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Decision trees are a popular machine learning algorithm for both classification and regression tasks.

They
work by recursively splitting the dataset into subsets based on feature values, creating a tree-like
structure of decisions that leads to predictions. Here’s an overview of decision trees and some
commonly used algorithms:

1. Basic Concept of Decision Trees

• Nodes: Each node represents a feature (or attribute) in the dataset.

• Edges: Each branch from a node represents a decision based on that feature’s value.

• Leaf Nodes: Represent the final output (class or value) after all decisions have been made.

• Root Node: The topmost node in a tree, representing the initial feature or question.

2. Decision Tree Algorithms

a) ID3 (Iterative Dichotomiser 3)

• Developed by Ross Quinlan, ID3 is one of the earliest algorithms.

• Criterion: It uses information gain to decide which feature to split on, favoring splits that result
in the greatest reduction in entropy.

• Limitations: Prone to overfitting and can’t handle numeric data directly without modification.

b) C4.5

• An extension of ID3, also developed by Quinlan.

• Criterion: Uses gain ratio to handle continuous and categorical data better than ID3.

• Pruning: Implements pruning to reduce overfitting.

• Handling of Missing Values: C4.5 can handle datasets with missing values more effectively than
ID3.

c) CART (Classification and Regression Trees)

• Developed by Leo Breiman, CART is widely used in both classification and regression.

• Criterion: For classification, CART uses Gini impurity as the splitting criterion, while for
regression, it uses mean squared error (MSE).

• Binary Splits Only: CART splits the data into exactly two branches at each node, creating binary
trees.

• Pruning: Prunes trees based on a cost-complexity parameter to manage overfitting.

d) CHAID (Chi-Square Automatic Interaction Detector)

• CHAID is used for categorical data and is based on the chi-square test.

• Criterion: Uses statistical tests (chi-square for classification, ANOVA for regression) to determine
splits.
• Multifurcating Splits: Unlike CART, CHAID can create branches with multiple splits from a single
node.

• Use Cases: Often used for market research and survey analysis.

3. Advantages of Decision Trees

• Interpretability: Easy to understand and visualize, even for non-experts.

• Non-linearity: Can model non-linear relationships.

• Little Data Preprocessing: Often requires minimal data preparation, like normalization or scaling.

4. Limitations of Decision Trees

• Overfitting: Decision trees can easily overfit, especially with deep trees.

• Bias: Sensitive to small changes in data, which can lead to vastly different trees (high variance).

• Preference for Certain Features: Tend to favor features with more levels.

5. Applications of Decision Trees

• Classification tasks (e.g., spam detection, customer churn prediction)

• Regression tasks (e.g., predicting housing prices)

• Feature selection

To calculate entropy and information gain for building a decision tree:

entropy and information gain calculation with a small dataset and use it to build a decision tree. We’ll
also make a prediction for a given input based on the final decision.
Details:
https://towardsdatascience.com/decision-tree-in-machine-learning-
e380942a4c96

DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
ML Unit 3
No ratings yet
ML Unit 3
15 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Unit 2
No ratings yet
Unit 2
29 pages
Intro to Decision Trees for ML Students
No ratings yet
Intro to Decision Trees for ML Students
15 pages
Decision Trees in Machine Learning
No ratings yet
Decision Trees in Machine Learning
15 pages
Decisiontree, Prefixcodeandgametree
No ratings yet
Decisiontree, Prefixcodeandgametree
12 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
Assignment of Decision Tree
No ratings yet
Assignment of Decision Tree
15 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
ML Unit 3 Qa
No ratings yet
ML Unit 3 Qa
26 pages
Unit 3
No ratings yet
Unit 3
21 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Trees for Data Scientists
0% (1)
Decision Trees for Data Scientists
24 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Decision Trees for Data Mining Students
No ratings yet
Decision Trees for Data Mining Students
30 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
ML - Module-3-Chapter-6 RNSIT
No ratings yet
ML - Module-3-Chapter-6 RNSIT
10 pages
ML-PPT Unit Iii-1
No ratings yet
ML-PPT Unit Iii-1
38 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
Presentation On Decision Trees
No ratings yet
Presentation On Decision Trees
12 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
A Survey of Decision Trees Concepts Algorithms and Applications
No ratings yet
A Survey of Decision Trees Concepts Algorithms and Applications
12 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Unit 3 - ML (NEW)
No ratings yet
Unit 3 - ML (NEW)
68 pages
My Decision Tree Algorithm
No ratings yet
My Decision Tree Algorithm
21 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
7 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Unit 3
No ratings yet
Unit 3
31 pages
Unit-3 Introduction To Machine Learning Algorithms
No ratings yet
Unit-3 Introduction To Machine Learning Algorithms
18 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Lesson 7 Supervised Method (Decision Trees) Algorithms
No ratings yet
Lesson 7 Supervised Method (Decision Trees) Algorithms
12 pages
Decision Tree Induction Algorithm
No ratings yet
Decision Tree Induction Algorithm
6 pages
Decision Trees Cheat Sheet PDF
No ratings yet
Decision Trees Cheat Sheet PDF
2 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
Unit 3
No ratings yet
Unit 3
25 pages
Decision Tree Algorithms for Data Mining
No ratings yet
Decision Tree Algorithms for Data Mining
5 pages
Ml-Unit Iii-1
No ratings yet
Ml-Unit Iii-1
46 pages
Decision Tree Algorithms Guide
No ratings yet
Decision Tree Algorithms Guide
49 pages
AIML Removed
No ratings yet
AIML Removed
25 pages
Business Analytics: Foundation: Material Handouts
No ratings yet
Business Analytics: Foundation: Material Handouts
7 pages
AIML Removed Merged
No ratings yet
AIML Removed Merged
31 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
45 pages
Chapter 2 Types of Machine Learning and Their Learning Strategies
No ratings yet
Chapter 2 Types of Machine Learning and Their Learning Strategies
45 pages
ML Unit3
No ratings yet
ML Unit3
24 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
18 pages
2.12 Chapter 6 Decision Tree
No ratings yet
2.12 Chapter 6 Decision Tree
56 pages
Aiml Question-Bank Solutions-Full Combined
No ratings yet
Aiml Question-Bank Solutions-Full Combined
109 pages
Machine Learning Lab File
No ratings yet
Machine Learning Lab File
48 pages
Project JAISON
No ratings yet
Project JAISON
61 pages
A Modified ID3 Decision Tree Algorithm Based On Cumulative
100% (1)
A Modified ID3 Decision Tree Algorithm Based On Cumulative
19 pages
Machine Learning Lab Viva
No ratings yet
Machine Learning Lab Viva
3 pages
Classification Techniques Overview
No ratings yet
Classification Techniques Overview
160 pages
Lecture 13-Supervised Learning-Decision Trees-M
No ratings yet
Lecture 13-Supervised Learning-Decision Trees-M
47 pages
Assignment 4 DT NB LR Solution
No ratings yet
Assignment 4 DT NB LR Solution
5 pages
Classification and Predication in Data Mining
No ratings yet
Classification and Predication in Data Mining
6 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
Report
No ratings yet
Report
45 pages
Interview Questions For DS & DA (ML)
100% (1)
Interview Questions For DS & DA (ML)
66 pages
03 InformationGain
No ratings yet
03 InformationGain
20 pages
Unit-3 New
No ratings yet
Unit-3 New
75 pages
DecisionTree Numerical ID3Prob
No ratings yet
DecisionTree Numerical ID3Prob
114 pages
Intrusion Detection via Decision Trees
No ratings yet
Intrusion Detection via Decision Trees
7 pages
CS8091 - Big Data Analytics - Unit 2
No ratings yet
CS8091 - Big Data Analytics - Unit 2
44 pages
Noah Silverman - Predicting The Outcome of The Horse Race Using Data Mining Technique
No ratings yet
Noah Silverman - Predicting The Outcome of The Horse Race Using Data Mining Technique
20 pages
Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
3 pages
Decision Tree
No ratings yet
Decision Tree
22 pages
Data Science Problem Solving
No ratings yet
Data Science Problem Solving
3 pages
Decision Trees-Lecture 9&10
No ratings yet
Decision Trees-Lecture 9&10
60 pages
Decision Trees for Data Analysis
No ratings yet
Decision Trees for Data Analysis
90 pages
DW&M Unit 3 Part I
No ratings yet
DW&M Unit 3 Part I
101 pages
Decision Tree
No ratings yet
Decision Tree
71 pages
Module 2 Notes
No ratings yet
Module 2 Notes
20 pages
MCQ Machine Learning
No ratings yet
MCQ Machine Learning
23 pages
Lossy Counting
No ratings yet
Lossy Counting
39 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages

Entropy and Information Gain For Decision Tree Algorithm

Uploaded by

Entropy and Information Gain For Decision Tree Algorithm

Uploaded by

Decision trees are a popular machine learning algorithm for both classification and regression tasks.

1. Basic Concept of Decision Trees

• Nodes: Each node represents a feature (or attribute) in the dataset.

2. Decision Tree Algorithms

a) ID3 (Iterative Dichotomiser 3)

• Developed by Ross Quinlan, ID3 is one of the earliest algorithms.

• An extension of ID3, also developed by Quinlan.

• Pruning: Implements pruning to reduce overfitting.

c) CART (Classification and Regression Trees)

• Pruning: Prunes trees based on a cost-complexity parameter to manage overfitting.

d) CHAID (Chi-Square Automatic Interaction Detector)

3. Advantages of Decision Trees

• Interpretability: Easy to understand and visualize, even for non-experts.

• Non-linearity: Can model non-linear relationships.

4. Limitations of Decision Trees

5. Applications of Decision Trees

• Classification tasks (e.g., spam detection, customer churn prediction)

• Regression tasks (e.g., predicting housing prices)

To calculate entropy and information gain for building a decision tree:

You might also like