0% found this document useful (0 votes)

15 views31 pages

Logistic - Regression

Machine learning logistic regression

Uploaded by

mostafa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views31 pages

Logistic - Regression

Machine learning logistic regression

Uploaded by

mostafa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Machine Learning Diploma

Level3: Machine Learning

Session 5
Agenda

➔ Standard Deviation and Coefficient of Variation (CV)

➔ Decision Tree Regression
➔ Tuning Trees
➔ Sklearn and Decision Trees

2
2. Standard Deviation and CV

3
Definition:
Standard Deviation
The Standard Deviation is a measure of how spread-out numbers are.
Its symbol is σ (the greek letter sigma)
The formula is easy: it is the square root of the Variance.
Variance
The Variance is defined as:
The average of the squared differences from the Mean.

4
Definition:

5
Definition:

6
Definition:
Coefficient of Variation (CV)
The coefficient of variation (CV) is a statistical measure of the dispersion
of data points in a data series around the mean. The coefficient of
variation represents the ratio of the standard deviation to the mean

Coefficient of Variation (CV)

147
𝐶𝑉 =
394

= 37.3%

7
2. Decision Tree Regression

8
Definition:
➔ Decision tree builds regression or classification models in the form of a
tree structure.
➔ It breaks down a dataset into smaller and smaller subsets while at the
same time an associated decision tree is incrementally developed.
➔ The final result is a tree with Decision Nodes and Leaf Nodes.

9
True Nodes:
➔ The Root Node is the initial node which represents the entire sample
and may get split further into further nodes.
➔ The Interior Nodes represent the features of a data set and the branches
represent the decision rules. ^&!@#$%%%
➔ Finally, the Leaf Nodes represent the outcome.

10
Decision Tree Structure:

11
Decision Tree Example:

12
3. Tuning Trees

13
Building Decision Tree:

14
Building Decision Tree:
➔ Decision tree employs a top-down greedy search through the space of
possible branches.
➔ We use Standard Deviation to calculate the Homogeneity of a numerical
sample. If the numerical sample is completely homogeneous its standard
deviation is zero.
➔ Standard Deviation: A decision tree is built top-down from a root node
and involves partitioning the data into subsets that contain instances
with similar values (homogenous).

15
Building Decision Tree:
➔ Tree is constructed using Standard Deviation Reduction.
➔ Standard Deviation Reduction: It is based on the decrease in standard
deviation after a dataset is split on an attribute.
➔ Constructing a decision tree is all about finding attribute that returns the
highest standard deviation reduction (i.e., the most homogeneous
branches).

16
Tuning Tree:
➔ Step 1: The standard deviation of the target is calculated.

17
Tuning Tree:
➔ Step 2: The dataset is then split on the different attributes. The standard
deviation for each branch is calculated. The resulting standard deviation
is subtracted from the standard deviation before the split. The result is
the standard deviation reduction.

18
Tuning Tree:
➔ Step 3: The attribute with the largest standard deviation reduction is
chosen for the decision node.

19
Tuning Tree:
➔ Step 4-a: The dataset is divided based on the values of the selected
attribute. This process is run recursively on the non-leaf branches, until
all data is processed.

20
Tuning Tree:
➔ Step 4-a: We need some termination criteria. For example, when
coefficient of deviation (CV) for a branch becomes smaller than a certain
threshold (e.g., 10%) and/or when too few instances (n) remain in the
branch (e.g., 3).

21
Tuning Tree:
➔ Step 4-b: "Overcast" subset does not need any further splitting because
its CV (8%) is less than the threshold (10%). The related leaf node gets
the average of the "Overcast" subset.

22
Tuning Tree:
➔ Step 4-c: However, the "Sunny" branch has an CV (28%) more than the
threshold (10%) which needs further splitting. We select "Windy" as the
best node after "Outlook" because it has the largest SDR.

23
Tuning Tree:
➔ Step 4-c: Because the number of data points for both branches (FALSE
and TRUE) is equal or less than 3, we stop further branching and assign
the average of each branch to the related leaf node.

24
Tuning Tree:
➔ Step 4-d: Moreover, the "rainy" branch has an CV (22%) which is more
than the threshold (10%). This branch needs further splitting. We select
“Temp" as the best node because it has the largest SDR.

25
Tuning Tree:
➔ Step 4-d: Because the number of data points for all three branches
(Cool, Hot and Mild) is equal or less than 3, we stop further branching
and assign the average of each branch to the related leaf node.

26
Decision Tree Algorithm Fitting:

27
Decision Tree Algorithm Fitting:

28
Decision Tree Advantages and disadvantage:
Advantages:
1.It can be used for both Classification and Regression problems
2.Easy to Understand, Interpret, Visualize
3.Useful in Data exploration
4.Less data preparation required
5.Can capture Nonlinear relationships

Disadvantage
1. Over fitting:
2. Not fit for continuous variables
3. Decision trees can be unstable
29
4. Sklearn and Decision Trees

30
THANK YOU!

Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
45 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Unit 3
No ratings yet
Unit 3
25 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
16 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
DecisionTree Numerical ID3Prob
No ratings yet
DecisionTree Numerical ID3Prob
114 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Classification: Decision Trees: Business Analytics Lecture 7/8
No ratings yet
Classification: Decision Trees: Business Analytics Lecture 7/8
35 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
(D862.Ebook) PDF Download Principles of Textile Testing by Je Booth
50% (2)
(D862.Ebook) PDF Download Principles of Textile Testing by Je Booth
4 pages
Decision Trees and Regression Techniques
No ratings yet
Decision Trees and Regression Techniques
27 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Trees: Types and Terminologies
No ratings yet
Decision Trees: Types and Terminologies
17 pages
AIML Ak
No ratings yet
AIML Ak
21 pages
ML Unit 4
No ratings yet
ML Unit 4
47 pages
Decision Tree (Autosaved)
No ratings yet
Decision Tree (Autosaved)
14 pages
MI - Unit 4
No ratings yet
MI - Unit 4
79 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Unit 4-2
No ratings yet
Unit 4-2
20 pages
Decision Tree
No ratings yet
Decision Tree
82 pages
Support, Decision and Random
No ratings yet
Support, Decision and Random
8 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
Iso 10160 2015
No ratings yet
Iso 10160 2015
15 pages
MIS410 Chapter6
No ratings yet
MIS410 Chapter6
47 pages
Unit 4
No ratings yet
Unit 4
33 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Tree Based Learning Methods
No ratings yet
Tree Based Learning Methods
28 pages
Lecture 7 - Decision Tree Regression Imran 19032025 103416am
No ratings yet
Lecture 7 - Decision Tree Regression Imran 19032025 103416am
40 pages
Lecture 11 Slides - After
No ratings yet
Lecture 11 Slides - After
55 pages
EST Cheatsheet
No ratings yet
EST Cheatsheet
5 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
2.12 Chapter 6 Decision Tree
No ratings yet
2.12 Chapter 6 Decision Tree
56 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Tree
No ratings yet
Decision Tree
3 pages
Decision Tree & Regression
No ratings yet
Decision Tree & Regression
33 pages
ML Unit3
No ratings yet
ML Unit3
8 pages
Decision Tree Regression Fully Explained by Example
No ratings yet
Decision Tree Regression Fully Explained by Example
4 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Classification and Regression Tree Construction
No ratings yet
Classification and Regression Tree Construction
18 pages
Reg Tree
No ratings yet
Reg Tree
38 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Prac 6
No ratings yet
Prac 6
6 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Notes On Decision Trees
No ratings yet
Notes On Decision Trees
2 pages
Decision Tree
No ratings yet
Decision Tree
11 pages
Decision Tree
No ratings yet
Decision Tree
21 pages
Konnwei Kw310 Can Obdii+Eobd Code Reader: Specifications
No ratings yet
Konnwei Kw310 Can Obdii+Eobd Code Reader: Specifications
16 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
11 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Mini Mk8 MM Manual 24.07.2020
No ratings yet
Mini Mk8 MM Manual 24.07.2020
194 pages
Decision Tree R
No ratings yet
Decision Tree R
5 pages
ML Ch-3 Decision Trees and Ensemble Methods
No ratings yet
ML Ch-3 Decision Trees and Ensemble Methods
14 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
ML-PPT Unit Iii-1
No ratings yet
ML-PPT Unit Iii-1
38 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Lab 2
No ratings yet
Lab 2
3 pages
TTC Catalog - EN 2013
No ratings yet
TTC Catalog - EN 2013
148 pages
Format4 Tpacad 4 0 Code Definition Rev2 14 12 2016
No ratings yet
Format4 Tpacad 4 0 Code Definition Rev2 14 12 2016
49 pages
ASUG Attendee List
No ratings yet
ASUG Attendee List
90 pages
Samsung Max-Vl65 Vl69 SCH
No ratings yet
Samsung Max-Vl65 Vl69 SCH
12 pages
LaTeX Homework Help Service
100% (1)
LaTeX Homework Help Service
6 pages
Organized (1) (AutoRecovered)
No ratings yet
Organized (1) (AutoRecovered)
37 pages
Inocontroller Control Module Instructions Manual Sames DRT7134 Uk
No ratings yet
Inocontroller Control Module Instructions Manual Sames DRT7134 Uk
44 pages
Focusrite Windows Driver Updates
No ratings yet
Focusrite Windows Driver Updates
10 pages
REAKTOR 6 What Is New English 072220
No ratings yet
REAKTOR 6 What Is New English 072220
34 pages
BCA Syllabus
No ratings yet
BCA Syllabus
21 pages
CS335 Lecture 1 Slides
No ratings yet
CS335 Lecture 1 Slides
30 pages
Lec 9 - System Analysis Part2
No ratings yet
Lec 9 - System Analysis Part2
17 pages
Digital Literacy
No ratings yet
Digital Literacy
19 pages
Advanced Container Loading Strategies
No ratings yet
Advanced Container Loading Strategies
15 pages
Elkhoukhi 2019
No ratings yet
Elkhoukhi 2019
13 pages
Image Analytics, Unit-3
No ratings yet
Image Analytics, Unit-3
12 pages
Cyber Behavioral Model for Situational Awareness
No ratings yet
Cyber Behavioral Model for Situational Awareness
9 pages
RM Plagarism Report
No ratings yet
RM Plagarism Report
10 pages
Main Project 2021 Zeroth
No ratings yet
Main Project 2021 Zeroth
9 pages
Lung Cancer Detection Using CT Scan Images: Sciencedirect
No ratings yet
Lung Cancer Detection Using CT Scan Images: Sciencedirect
8 pages
Anisha ETL DataEngineer
No ratings yet
Anisha ETL DataEngineer
7 pages
Abhipedia Abhimanu Com Article 1049 MjcyMDc2 My Experiments With Silence
No ratings yet
Abhipedia Abhimanu Com Article 1049 MjcyMDc2 My Experiments With Silence
5 pages
Sample Report
No ratings yet
Sample Report
5 pages
How To Create A Responsive Navigation Menu With Icons
No ratings yet
How To Create A Responsive Navigation Menu With Icons
4 pages
Resuume
No ratings yet
Resuume
2 pages
CLA Guitars
No ratings yet
CLA Guitars
13 pages

Logistic - Regression

Uploaded by

Logistic - Regression

Uploaded by

Machine Learning Diploma

Level3: Machine Learning

➔ Standard Deviation and Coefficient of Variation (CV)

Coefficient of Variation (CV)

You might also like