0% found this document useful (0 votes)

46 views21 pages

4-Decision Tree Learning 1

This document discusses decision tree learning. It begins with an overview of supervised learning techniques for classification and regression. It then covers concepts such as training and testing data splits, and decision trees. Decision trees are explained as a popular machine learning method that selects features to split data into increasingly homogeneous subsets, with information gain used to measure feature informativeness. An example decision tree is presented to classify customers as likely to default on loans.

Uploaded by

gbdgkmbn8c

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views21 pages

4-Decision Tree Learning 1

Uploaded by

gbdgkmbn8c

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

ISOM3360 Data Mining for Business Analytics

Decision Tree Learning

Instructor: Yi Yang
Department of ISOM
Spring 2023
q Last lecture
q Data preparation

q This Lecture
q Decision tree

2
Data Mining Process

3
Supervised Learning

q Classification is used to predict which class

(discrete value) a data point belongs to

q Fraud detection, customer churn prediction

q Regression is used to predict continuous value.

q Stock price prediction, housing price prediction

4
Training vs. Testing

1. Learn the model on all data, evaluate on parts of data

2. Split data into two parts, learn the model on one part, and
evaluate on the other part.

Recall that ML model is used to make predictions on

unseen data.
5
Training vs. Testing

1. Learn the model on all data, evaluate on parts of data

2. Split data into two parts, learn the model on one part, and
evaluate on the other part.

Supervised learning rule of thumb:

Never ever use testing data to learn your model 6

Supervised Learning data split

q training set—a subset to train a model.

q test set—a subset to test the trained model.

q Large enough to yield statistically meaningful results

q Is representative of the data set as a whole. In other

words, don't pick a test set with different
characteristics than the training set.

7
Decision Tree

q Decision trees are one of the most popular data mining

tools.

q Decision Trees are easy to understand, implement and

use, and computationally cheap.

q “It is probably the machine learning workhorse most

widely used in practice to date.”

q The model comprehensibility is important for

communication to non-DM-savvy stakeholders.

8
Decision Tree

categorical continuous
Employed Balance Age Default
Yes 123,000 50 No
No 51,100 40 Yes
No 68,000 55 No
Yes 34,000 46 Yes
Yes 50,000 44 No
No 100,000 50 Yes

Yes 70,000 35 ?????

Predicting customers who will default on loan payments

9
Decision tree
Tree
v A upside down if-else tree, start Root Node
Employed
with all training data
Yes No
Node Class =
v Each node has an if-else condition Not
Balance Node
about one feature. (Which feature?) Default

v Dataset is split into subset based >=50K <50K

on the condition Class =
v Root node contains all training Not
Default Age
examples
v Leaf node contains a subset of >=45 <45
training examples Class =
Class =
v (Optional) numerical features are Not
Default
discretized Default
Leaf
Node

10
The essence of Decision Tree

q The essence of supervised learning (prediction) is to find

features that are informative and have high
predictability.

q Decision tree methods iteratively select a feature so that,

after splitting, the sub dataset becomes more
pure/homogenous. In other words, the selected feature
has high predictability.

q Information gain is one way to measure informativeness.

11
Entropy

q Entropy H(S) is a measure of the amount of

uncertainty/impurity in the dataset S (i.e. entropy
characterizes the dataset S). It measures chaos.

p(x) is the proportion of class x

in the data S

q Eg. A dataset is composed of 16 cases of class

“Positive” and 14 cases of class “Negative”

Entropy (dataset) =

12
Entropy Exercise
𝑡𝑖𝑝: 0 log ! 0 = 0

13
Let’s play a game. I have someone in my mind,
and your job is to guess this person. You can only
ask yes/no question.

This person is a HK celebrity.

Go!

14
Information Gain
q The information gain is based on the
decrease in entropy after a dataset is split
on a feature.

A: has credit card??

Weighted average of subset entropy
• H(S) – Entropy of set S

• T – The subsets created from splitting set S by feature A

• p(t) – The proportion of subset t to set S

Yes No
• H(t) – Entropy of subset t

15
Information Gain Example
A1: has credit card?? A2: is student??

Entropy=

before

Yes No Yes No after

Entropy=0.837 Entropy = 0.722

IG= 0.997 - 0.615 = 0.382 IG= 0.997 - 0.779 = 0.218 16

Information Gain Exercise
A1 A2

before

Yes No Yes No after

Without calculation, which split (left or right) gives the highest

information gain? Which feature (A1 or A2) do we prefer?
17
Decision Tree: ID3
ID3: Only for classification, only handles
categorical features.
q Step 1: Calculate the information gain of every feature
q Step 2: Split the set S into subsets using the feature for
which the information gain is maximum
q Step 3: Make a decision tree node containing that feature,
divide the dataset by its branches and repeat the same
process on every branch.
q Step 4a: A branch with entropy of 0 is a leaf node.
q Step 4b: A branch with entropy more than 0 needs further
splitting.

18
Outlook: Sunny,
Overcast, Rain

Temp: Hot, Mild,

Cool

Humidity: High,
Normal

Wind: Weak,
Strong

Decision:
Yes (9), No (5)

19
q On board demonstration

20
Recap: The essence of Decision Tree

q The essence of supervised learning (prediction) is to find

features that are informative and have high
predictability.

q Decision tree methods select a feature so that, after

splitting, the sub dataset becomes more homogenous. In
other words, the selected feature has high predictability.

q Information gain is one way to measure informativeness.

Lecture 023+-+Decision+Trees+ - 1
No ratings yet
Lecture 023+-+Decision+Trees+ - 1
54 pages
Mastering Digital 2D and 3D Art. The Artists Guide To High-Quality Digital Art Creation (PDFDrive)
100% (1)
Mastering Digital 2D and 3D Art. The Artists Guide To High-Quality Digital Art Creation (PDFDrive)
341 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Asset v1 MKAU+SEng9032+DEV 01+Type@Asset+Block@ML Chapterthree
No ratings yet
Asset v1 MKAU+SEng9032+DEV 01+Type@Asset+Block@ML Chapterthree
129 pages
Lecture 6 Classification-Decision Tree Rule Based K-NN
No ratings yet
Lecture 6 Classification-Decision Tree Rule Based K-NN
73 pages
Decision Tree
No ratings yet
Decision Tree
41 pages
Lecture 6 - Decision Trees
No ratings yet
Lecture 6 - Decision Trees
43 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Classification
No ratings yet
Classification
75 pages
Module 4 Lecture - 2
No ratings yet
Module 4 Lecture - 2
65 pages
Decision Trees Lectures
No ratings yet
Decision Trees Lectures
55 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Lesson 3.1 - Supervised Learning Decision Trees
No ratings yet
Lesson 3.1 - Supervised Learning Decision Trees
51 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
Trees
No ratings yet
Trees
78 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
79 pages
Decision Tree Basics for Data Scientists
No ratings yet
Decision Tree Basics for Data Scientists
61 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
Unit-2 Material
No ratings yet
Unit-2 Material
52 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Data Mining Unit 2
No ratings yet
Data Mining Unit 2
40 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Decision Tree Classification Guide
No ratings yet
Decision Tree Classification Guide
161 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
CH 5
No ratings yet
CH 5
84 pages
Module - 4.1-DM-1
No ratings yet
Module - 4.1-DM-1
63 pages
Lesson 5 - Supervised Learning-Classification
100% (1)
Lesson 5 - Supervised Learning-Classification
91 pages
Decision Tree
No ratings yet
Decision Tree
15 pages
ML-Lec-06-Supervised Learning-Decision Trees
No ratings yet
ML-Lec-06-Supervised Learning-Decision Trees
45 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Data Science Lectures 3
No ratings yet
Data Science Lectures 3
46 pages
AIML Lec-11
No ratings yet
AIML Lec-11
18 pages
DM Unit Iii
No ratings yet
DM Unit Iii
87 pages
2025 Lecture07 P1 ID3
No ratings yet
2025 Lecture07 P1 ID3
41 pages
Classification & Prediction Techniques
No ratings yet
Classification & Prediction Techniques
71 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
33 pages
Slide 3
No ratings yet
Slide 3
23 pages
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
No ratings yet
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
60 pages
Chapter 02 - DM Tasks - Part I - Classification
No ratings yet
Chapter 02 - DM Tasks - Part I - Classification
58 pages
Module 3
No ratings yet
Module 3
102 pages
UNIT - 3 ML
No ratings yet
UNIT - 3 ML
24 pages
06-Classification Part1
No ratings yet
06-Classification Part1
44 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Chapter 7 Supervised Learning
No ratings yet
Chapter 7 Supervised Learning
71 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
Decision Trees for ML Students
No ratings yet
Decision Trees for ML Students
22 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
Decisiontree 2
No ratings yet
Decisiontree 2
16 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
ML Unit II - Final
No ratings yet
ML Unit II - Final
138 pages
Perceptible 1747127830
No ratings yet
Perceptible 1747127830
15 pages
Digital Comm Lab Manual
No ratings yet
Digital Comm Lab Manual
33 pages
Part 4-50-2 Merchant Initiated Closed Loop Payment API Implementation Guide v1.0
No ratings yet
Part 4-50-2 Merchant Initiated Closed Loop Payment API Implementation Guide v1.0
29 pages
Sticker Book PDF
No ratings yet
Sticker Book PDF
66 pages
Jquery 17 Visual Cheat Sheet
100% (1)
Jquery 17 Visual Cheat Sheet
8 pages
C Programming: Conditional Logic
No ratings yet
C Programming: Conditional Logic
3 pages
Njrat Uncovered
No ratings yet
Njrat Uncovered
27 pages
Carbon Programming 1st Edition Kevin Bricknell PDF Download
100% (1)
Carbon Programming 1st Edition Kevin Bricknell PDF Download
76 pages
PL MS6M30 1B-1
No ratings yet
PL MS6M30 1B-1
9 pages
Introduction To The UPS Developer Kit
No ratings yet
Introduction To The UPS Developer Kit
33 pages
MATLAB Objectives
No ratings yet
MATLAB Objectives
38 pages
Presentation 17
No ratings yet
Presentation 17
18 pages
Optimize Hitachi Storage and Server Platforms in Vmware Vsphere 5 5 Environments Best Practices Guide PDF
No ratings yet
Optimize Hitachi Storage and Server Platforms in Vmware Vsphere 5 5 Environments Best Practices Guide PDF
49 pages
EdYoda Data Scientist Program Curriculum
No ratings yet
EdYoda Data Scientist Program Curriculum
24 pages
A Survey On RISC-V Security: Hardware and Architecture: Tao Lu
No ratings yet
A Survey On RISC-V Security: Hardware and Architecture: Tao Lu
39 pages
Components
100% (1)
Components
34 pages
19 - Xcode Build (Signed)
No ratings yet
19 - Xcode Build (Signed)
3,540 pages
Computer Networks Overview
No ratings yet
Computer Networks Overview
22 pages
Dictionary in Python
No ratings yet
Dictionary in Python
6 pages
Customer Support (Resume)
No ratings yet
Customer Support (Resume)
2 pages
SAP S - 4HANA Migration Cockpit - Deep Dive LTMOM For Staging Tables - OP2020
No ratings yet
SAP S - 4HANA Migration Cockpit - Deep Dive LTMOM For Staging Tables - OP2020
113 pages
JRC Standard Data Inputoutput Format
No ratings yet
JRC Standard Data Inputoutput Format
15 pages
Projects List
No ratings yet
Projects List
7 pages
Compact Performance CP Fieldbus Node 13: Programming and Diagnosis
No ratings yet
Compact Performance CP Fieldbus Node 13: Programming and Diagnosis
103 pages
User's Manual Heidenhain Conversational: NC Software 340 590-01 340 591-01 340 594-01
No ratings yet
User's Manual Heidenhain Conversational: NC Software 340 590-01 340 591-01 340 594-01
589 pages
7XV5655-0BA00-Hub Manual A3 en
No ratings yet
7XV5655-0BA00-Hub Manual A3 en
53 pages
BE - Cyber - Security - and - Digital - Forensics - Question Bank
No ratings yet
BE - Cyber - Security - and - Digital - Forensics - Question Bank
2 pages
IWCF Forum Candidate User Role
No ratings yet
IWCF Forum Candidate User Role
7 pages
Pro3 Working With A Mock Client
No ratings yet
Pro3 Working With A Mock Client
10 pages
Vue.js Guide for Developers
100% (7)
Vue.js Guide for Developers
19 pages