Notes - CH 5 Decision Trees and Linear Regression

Chapter 5 discusses classification and regression in supervised learning, outlining the data, model, cost function, and types of supervised learning such as regression for continuous outputs and classification for discrete outputs. It explains logistic regression, classification trees, and the concept of purity gain, along with methods for controlling tree complexity and model evaluation using confusion matrices. Additionally, it covers linear regression models and the transformation of features to improve accuracy while maintaining linearity.

Uploaded by

ryanmak803

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Notes - CH 5 Decision Trees and Linear Regression

Uploaded by

ryanmak803

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

CH 5 Classification and regression

Supervised learning

Data Model Cost Function Types of supervised learning

Input Function with Dissimilarity measure between Regression: continuous y
(features) and parameters 𝑤 that observation and prediction to Classification: discrete y
outputs (label) maps input to output determine a model is good or bad)
{𝒙! , 𝑦! }"! 𝑓(𝒙; 𝒘) 𝑑(𝑦, 𝑓(𝒙; 𝒘))
Classicisation
Logistic regression
exp (𝑋𝒘) 1
𝜋! = = , 𝜋 = 𝑃(𝑦! = 1|𝒙! )
1 + exp (𝑋𝒘) 1 + exp (−𝑋𝒘) !
#
𝒘 can be found using the MLE approach. To maximize likelihood function ∏ 𝜋! ! (1 − 𝜋! )$%#!
max ∑𝑦! (ln 𝜋! ) + (1 − 𝑦! )(ln(1 − 𝜋! ))

Classification trees
If stop criterion is met: e.g. If the criterion is not met, partition the data into
Only contains 1 type of element subsets
Add a leaf note which assigns every Ask a number of question, partition the data accordingly
observation to the post prevalent class and select the question with the greatest purity gain

Purity gain
A binary splits create 3 partitions, root, left and right branches 𝑣$ , 𝑣& .
For each partition, 𝐼(𝑟), 𝐼(𝑣$ ), 𝐼(𝑣& ) is founded. Impurity measure can be one of the following
𝑬𝒏𝒕𝒓𝒐𝒑𝒚(𝒗) 𝑮𝒊𝒏𝒊(𝒗) 𝑪𝒍𝒂𝒔𝒔𝑬𝒓𝒓𝒐𝒓(𝒗)
' '
1 − max 𝑝(𝑐|𝑣)
− P 𝑝(𝑐|𝑣) log & 𝑝(𝑐|𝑣) 1 − P 𝑝(𝑐|𝑣)&
()$ ()$
𝑁𝑜. 𝑜𝑓 𝑐𝑙𝑎𝑠𝑠 𝑐 𝑖𝑛 𝑏𝑟𝑎𝑛𝑐ℎ 𝑣
𝑝(𝑐|𝑣) =
𝑁(𝑣)
Purity gain is the weighted reduction in impurity
𝑁(𝑣* )
Δ = 𝐼(𝑟) − ∑ 𝐼(𝑣* )
𝑁(𝑟)

1
CH 5 Classification and regression
Example
𝒗𝟏 𝒗𝟐 Root
𝑷(𝑴𝒂𝒎𝒎𝒂𝒍) 0.2 0.6 0.333
𝑷(𝑵𝒐𝒏 − 𝒎𝒂𝒎𝒎𝒂𝒍) 0.8 0.4 0.666
𝒗𝟏 𝒗𝟐 Root
Entropy 0.3200 0.4800 0.4444
Gini 0.7219 0.9710 0.9183
Class Error 0.2000 0.4000 0.3333
Entropy Gini Class Error
Impurity gain 0.01778 0.3035 0
Controlling tree complexity
Stop splitting when a branch contains less than a specific number of observations.
Stop splitting if a certain depth of the tree is reached.
Stop splitting if purity gain ∆ for the best split is below a certain value.

Model evolution
Confusion matrix
Labeled positive Predicted negative
Actually positive True positive False negative
Actually negative False negative True negative

-./-0 1./10
Accuracy 0 , error rate 0 = 1 − 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦
K-nearest neighbor
Choose the number of neighbor K and A measure of distance
When performing classification: 1) Compute distance to all other data objects à Find the K-nearest data
objects à Classify according to majority of neighbors

Nearest neighbor decision surface

2
CH 5 Classification and regression
Regression
Linear model for regression
gg⃗ + 𝜀⃗ , where 𝛽⃗ = (𝑋 - 𝑋)%$ 𝑋 - 𝑦⃗
g⃗ = 𝑋𝑤
𝑌
Regression line in 1 dimension feature space Regression plane in 2 dimension feature space

Linear model after feature transformation

The features can be transformed into different forms to provide more accurate output without affecting
the linearity
g⃗ = 𝜙(𝑥)2 𝑤
𝑌 gg⃗ + 𝜀⃗, where 𝜙(𝑥)2 is a vector of function
Consider a model 𝑦 = 𝑤3 + 𝑤$ 𝑥$ + 𝑤& 𝑥& + 𝑤4 𝑥4 . 𝑥$& , cos(𝑥& ) or ln 𝑥4 can be used instead, the model
become 𝑦 = 𝑤3 + 𝑤$ 𝑥$& + 𝑤& cos 𝑥& + 𝑤4 ln 𝑥4 or 𝑌 g⃗ = 𝜙(𝑥)2 𝑤
gg⃗ + 𝜀⃗, 𝜙(𝑥) = [𝑥 & , cos 𝑥 , ln 𝑥]
Regression: 𝒚 = 𝒘𝟎 + 𝒘𝟏 𝒙 + 𝒘𝟐 𝒙𝟐 + 𝒘𝟑 𝒙𝟑 Regression: 𝒚 = 𝒘𝟎 + 𝒘𝟏 𝒄𝒐𝒔(𝒙) + 𝒘𝟐 𝒔𝒊𝒏 (𝟐𝒙)

08 CSE358 Intro To Machine Learning II
No ratings yet
08 CSE358 Intro To Machine Learning II
100 pages
Unit 2
No ratings yet
Unit 2
151 pages
Business Statistics Level 3: LCCI International Qualifications
100% (1)
Business Statistics Level 3: LCCI International Qualifications
22 pages
Machine Learning Unit-2
No ratings yet
Machine Learning Unit-2
89 pages
Classification Algorithm Guide
100% (2)
Classification Algorithm Guide
23 pages
Statistical Learning Slides
No ratings yet
Statistical Learning Slides
60 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
Supervised Learning
No ratings yet
Supervised Learning
187 pages
Module 3
No ratings yet
Module 3
132 pages
Unit 5
No ratings yet
Unit 5
73 pages
Classification
100% (2)
Classification
105 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
32 pages
Mbas901 - L4
No ratings yet
Mbas901 - L4
83 pages
Machine Learning Issues & Algorithms
No ratings yet
Machine Learning Issues & Algorithms
133 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
Logistic Regression 5
No ratings yet
Logistic Regression 5
61 pages
Lecture 7 Classification
No ratings yet
Lecture 7 Classification
33 pages
Cheatsheet Supervised Learning
100% (1)
Cheatsheet Supervised Learning
4 pages
Datamining Lect12
No ratings yet
Datamining Lect12
75 pages
Session 5
No ratings yet
Session 5
36 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Machine Learning Theory
100% (1)
Machine Learning Theory
12 pages
QSRI Lecture1
No ratings yet
QSRI Lecture1
45 pages
004-5-MATH 361 Probability & Statistics
No ratings yet
004-5-MATH 361 Probability & Statistics
1 page
AIML
No ratings yet
AIML
30 pages
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
No ratings yet
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
4 pages
Chapter 7 Supervised Learning
No ratings yet
Chapter 7 Supervised Learning
71 pages
Supervised Learning Notes
No ratings yet
Supervised Learning Notes
13 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
ML Unit-IV Notes
No ratings yet
ML Unit-IV Notes
49 pages
Classification and Clustering Algorithm Notes
No ratings yet
Classification and Clustering Algorithm Notes
19 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
Supervised Learning Cheatsheet
No ratings yet
Supervised Learning Cheatsheet
2 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
Unit III
No ratings yet
Unit III
5 pages
ML-classification Models
No ratings yet
ML-classification Models
27 pages
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
23 pages
Lesson 8 - Classification
No ratings yet
Lesson 8 - Classification
74 pages
MIT18 657F15 LecNote PDF
No ratings yet
MIT18 657F15 LecNote PDF
194 pages
06 Lectureslides LinearClassification Fixed
No ratings yet
06 Lectureslides LinearClassification Fixed
52 pages
Supervised Learning Guide
No ratings yet
Supervised Learning Guide
46 pages
Supervised Learning Cheatsheet
No ratings yet
Supervised Learning Cheatsheet
4 pages
IoT Data Analytics for Tech Enthusiasts
No ratings yet
IoT Data Analytics for Tech Enthusiasts
27 pages
New Course Outline Managerial Statistics-1
100% (1)
New Course Outline Managerial Statistics-1
4 pages
Classification
No ratings yet
Classification
7 pages
WK 07
No ratings yet
WK 07
8 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
Small Sample Significance Tests
No ratings yet
Small Sample Significance Tests
35 pages
ML:Introduction: Week 1 Lecture Notes
No ratings yet
ML:Introduction: Week 1 Lecture Notes
5 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Calculating Basic Statistical Procedures in SPSS
No ratings yet
Calculating Basic Statistical Procedures in SPSS
161 pages
Lect 1
No ratings yet
Lect 1
24 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
Estimation and Sampling
No ratings yet
Estimation and Sampling
5 pages
UCS551 Chapter 6 - Classification
No ratings yet
UCS551 Chapter 6 - Classification
20 pages
ML Model Paper 2 Solution
No ratings yet
ML Model Paper 2 Solution
15 pages
Big Data Lesson 5 Lucrezia Noli
No ratings yet
Big Data Lesson 5 Lucrezia Noli
30 pages
Cheet Sheet
No ratings yet
Cheet Sheet
47 pages
Linear Regression Assumptions
100% (2)
Linear Regression Assumptions
16 pages
Review Questions
No ratings yet
Review Questions
3 pages
6.867 Lecture Notes: Section 1: Introduction: 1 Intro 2 2 Problem Class 3
No ratings yet
6.867 Lecture Notes: Section 1: Introduction: 1 Intro 2 2 Problem Class 3
10 pages
Machine Learning Guide 2017
No ratings yet
Machine Learning Guide 2017
15 pages
Machine Learning Cheatsheet
100% (1)
Machine Learning Cheatsheet
15 pages
Pretest ch10
No ratings yet
Pretest ch10
7 pages
AE Week 3
No ratings yet
AE Week 3
3 pages
Advanced Econometrics
No ratings yet
Advanced Econometrics
61 pages
2.160 Identification, Estimation, and Learning Lecture Notes No. 1
No ratings yet
2.160 Identification, Estimation, and Learning Lecture Notes No. 1
7 pages
Propensity Score Method in Management
No ratings yet
Propensity Score Method in Management
40 pages
Wagner Chapter 5
No ratings yet
Wagner Chapter 5
10 pages
46 Buku Statistics IP - Merged - Compressed
No ratings yet
46 Buku Statistics IP - Merged - Compressed
186 pages
JRS, 523 - Mey Damayanti. C (143-151)
No ratings yet
JRS, 523 - Mey Damayanti. C (143-151)
9 pages
Manova: Presented By
No ratings yet
Manova: Presented By
13 pages
Forecasting: Categories of Forecasting Methods
No ratings yet
Forecasting: Categories of Forecasting Methods
1 page
Maths Class Xi Chapter 13 Statistics Practice Paper 19 2024 QP
No ratings yet
Maths Class Xi Chapter 13 Statistics Practice Paper 19 2024 QP
3 pages
Buggy Car Assigment 5
No ratings yet
Buggy Car Assigment 5
6 pages
Probability Distributions Circuit Training
No ratings yet
Probability Distributions Circuit Training
4 pages
Chi-Square Practical
No ratings yet
Chi-Square Practical
3 pages
Chapter 8 Ken Black
No ratings yet
Chapter 8 Ken Black
31 pages
Part A Assignment - No - 4
No ratings yet
Part A Assignment - No - 4
14 pages
One-Way ANOVA Tutorial
No ratings yet
One-Way ANOVA Tutorial
4 pages
Chi Square Test A4
No ratings yet
Chi Square Test A4
17 pages
Example For Z Distribution Confidence Intervals 1
No ratings yet
Example For Z Distribution Confidence Intervals 1
7 pages
Further Mathematics 2019: Unit 3 & 4: Examples Answered
No ratings yet
Further Mathematics 2019: Unit 3 & 4: Examples Answered
30 pages
Cluster Analysis for Data Mining
No ratings yet
Cluster Analysis for Data Mining
43 pages