0% found this document useful (0 votes)

11 views20 pages

Class 10 - Logistic Regression-Checkpoint

The document provides an overview of logistic regression, comparing it to the linear probability model and discussing its application in classification problems. It covers key concepts such as the sigmoid function, maximum likelihood estimation, and performance metrics like accuracy, precision, recall, F1 score, ROC, and AUC. Additionally, it highlights the importance of logistic regression in predicting categorical outcomes, using examples like credit card defaults.

Uploaded by

omarfaroque910

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views20 pages

Class 10 - Logistic Regression-Checkpoint

Uploaded by

omarfaroque910

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Class 10 – Logistic Regression

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry 1

Road map
ML Algorithm

Supervised Unsupervised

Dimensionality
Regression Classification Clustering
Reduction

Linear / Logistic
Principle K-Mean
Polynomial regression
Component
Penalized Analysis (PCA)
regression KNN Hierarchical
KNN

SVC
SVR

CART
CART Random Forest
Random Forest

Prof. Pedram Jahangiry

Topics
Part I
1. Linear probability model (LPM) vs Logistic regression
2. Sigmoid function
3. Logistic regression

Part II
1. Classification performance metrics
a) Accuracy,
b) Precision,
c) Recall,
d) F1 score,
e) ROC and AUC.

Prof. Pedram Jahangiry 3

Classification

• Qualitative variables can be either nominal or ordinal.

• Qualitative variables are often referred to as categorical.
• Classification is the process of predicting categorical variables.
• Classification problems are quite common, perhaps even more than regression
problems.
• Examples:
• Financial instrument tranches (investment grade or junk)
• Online transactions (fraudulent or not)
• Loan application (approved or denied)
• Credit card default (default or not)
• Car insurance customers (high, medium, low risk)

Prof. Pedram Jahangiry 4

Credit card default example

 Goal: Build a classifier that performs well in both train and test set.

Prof. Pedram Jahangiry 5

Part I
Logistic Regression

Prof. Pedram Jahangiry 6

Linear Probability Model (LPM) vs Logistic Regression

Starting with simple LPM : 𝑦 = 𝛽0 + 𝛽1 𝑏𝑎𝑙 + 𝜖 where, 𝑌 = 1 for default and 0 otherwise.

𝐸 𝑌 = 1 𝑏𝑎𝑙 = Pr 𝑌 = 1 𝑏𝑎𝑙 = 𝑃 𝑥 = 𝛽0 + 𝛽1 𝑏𝑎𝑙

• It seems that simple regression is perfect for this task,

• But what are the caveats?

Prof. Pedram Jahangiry 7

Sigmoid Function

• We need a monotone mapping function that has a range of [0,1]

Prof. Pedram Jahangiry 8

Logistic Regression (Model)

1
• The model: 𝑓𝑤,𝑏 𝑋 =
1+𝑒 − 𝑊𝑋+𝑏

• In case of two classes, 𝑓𝑤,𝑏 𝑋 = Pr 𝑌 = 1 𝑥 = 𝑝(𝑥).

• A bit of rearrangement gives
𝑝 𝑋
𝐿𝑜𝑔 = 𝑊𝑋 + 𝑏
1−𝑝 𝑥

• This monotone transformation is called the log odds or logit transformation of 𝑝(𝑥).
• Logistic regression ensures that our estimates always lie between 0 and 1

Prof. Pedram Jahangiry 9

Logistic regression fit (Decision boundary)

• Depending on how we define 𝑊𝑋 + 𝑏, we can get any of the following fits from
logistic regression classifier.

Prof. Pedram Jahangiry 10

Logistic Regression (Maximum Likelihood)

• In logistic regression, instead of minimizing the average loss, we maximize the likelihood
of the training data according to our model. This is called maximum likelihood estimation.
• A fantastic visualization!
• Can you do the same visualization with the S curve?

1−𝑦𝑖
𝐿𝑤,𝑏 = ෑ 𝑓𝑤,𝑏 𝑥𝑖 𝑦𝑖 1 − f𝑤,𝑏 𝑥𝑖
𝑖

Prof. Pedram Jahangiry 11

Logistic Regression (Objective function)

• Maximizing the likelihood function:

1−𝑦𝑖
𝑀𝑎𝑥 {𝐿𝑤,𝑏 = ς𝑖 𝑓𝑤,𝑏 𝑥𝑖 𝑦𝑖 1 − f𝑤,𝑏 𝑥𝑖 }

• Solution: In practice, it is more convenient to maximize the log-likelihood function. This

log-likelihood maximization, gives us 𝑤 ∗ and 𝑏 ∗ . There is no closed form solution to this
optimization problem. We need to use gradient descent.
• We are now ready to make predictions.
1
𝑓𝑤 ∗,𝑏∗ 𝑋 = ∗ ∗
1+𝑒 − 𝑊 𝑋+𝑏

• Depending on how we define the probability threshold, we can classify the observations.
In practice, the choice of the threshold could be different depending on the problem.

Prof. Pedram Jahangiry 12

Logistic regression output for credit card default example

1
𝑃(𝑑𝑒𝑓𝑎𝑢𝑙𝑡|𝑏𝑎𝑙, 𝑖𝑛𝑐) =
1 + 𝑒 −(𝑏 +𝑤1 𝑏𝑎𝑙 +𝑤2 𝑖𝑛𝑐 )

Prof. Pedram Jahangiry 13

Part II
Classification Performance Metrics

Prof. Pedram Jahangiry 14

Confusion Matrix

Prof. Pedram Jahangiry 15

Accuracy, Precision, Recall and F1score

While recall expresses the ability to

find all relevant instances in a dataset,
precision expresses the proportion of
the data points our model says was
relevant were actually relevant.

F1 uses the harmonic mean instead of a simple average because it

punishes extreme values.

Prof. Pedram Jahangiry 16

ROC (Receiver Operating Characteristic)

𝜌 FN
TPR

FP FPR
𝜌

Prof. Pedram Jahangiry 17

AUC

Prof. Pedram Jahangiry 18

Some other classification metrics

Prof. Pedram Jahangiry 19

Students’ questions
1) Are we treating (classifying) 𝑦ො = 0.51 and 𝑦ො = 0.99 the same?
2) Does it make sense to have non-linear decision boundaries in logistic regression?
3) Is logistical regression useful for anything beyond probability prediction?
4) What do ROC and AUC tell us about our predictions?

Prof. Pedram Jahangiry 20

Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression for Analysts
No ratings yet
Logistic Regression for Analysts
33 pages
ML Assignment Kv2
No ratings yet
ML Assignment Kv2
10 pages
Machine Learning for Mechanics
No ratings yet
Machine Learning for Mechanics
19 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Week 8
No ratings yet
Week 8
38 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Eml 24.7.25
No ratings yet
Eml 24.7.25
23 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
L6 LogisticRegression
No ratings yet
L6 LogisticRegression
22 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Lecture 7 Classification
No ratings yet
Lecture 7 Classification
33 pages
Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
Day.12 Logistic Regression
No ratings yet
Day.12 Logistic Regression
8 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Regression vs Classification Algorithms
100% (1)
Regression vs Classification Algorithms
13 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
41 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
Lec 20
No ratings yet
Lec 20
16 pages
MLS - Logistic Regression
No ratings yet
MLS - Logistic Regression
13 pages
Notes 05
No ratings yet
Notes 05
51 pages
Logistic Regression
No ratings yet
Logistic Regression
5 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
05 Logistic Regression
No ratings yet
05 Logistic Regression
33 pages
ML Lec 3
No ratings yet
ML Lec 3
4 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Module 2
No ratings yet
Module 2
92 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
Yousef ML Washin Classification
100% (1)
Yousef ML Washin Classification
333 pages
03 Logistic Regression
No ratings yet
03 Logistic Regression
23 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
Week 7
No ratings yet
Week 7
21 pages
07 Logistics Regression
No ratings yet
07 Logistics Regression
23 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Binary Logistic Regression 2
No ratings yet
Binary Logistic Regression 2
43 pages
COMP-377Week6 v1.1
No ratings yet
COMP-377Week6 v1.1
38 pages
11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
Intro to Classification & Regression
No ratings yet
Intro to Classification & Regression
42 pages
SMDS Unit 5
No ratings yet
SMDS Unit 5
21 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Logistic Regression & Classification
No ratings yet
Logistic Regression & Classification
30 pages
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
No ratings yet
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
31 pages
Logistic Regression
No ratings yet
Logistic Regression
61 pages
Logistic Regression by IntuitiveAI v2.5
No ratings yet
Logistic Regression by IntuitiveAI v2.5
8 pages
Class 5 - ML Concepts (Part II)
No ratings yet
Class 5 - ML Concepts (Part II)
17 pages
Covid-19 Detection From Chest X-Tay Images
No ratings yet
Covid-19 Detection From Chest X-Tay Images
12 pages
Module 6 - Deep Sequence Modeling-Original
No ratings yet
Module 6 - Deep Sequence Modeling-Original
65 pages
Module 7 - Deep Sequence Modeling
No ratings yet
Module 7 - Deep Sequence Modeling
61 pages
SQLforEveryone1 1
No ratings yet
SQLforEveryone1 1
10 pages
Unit 4 BR1
No ratings yet
Unit 4 BR1
24 pages
Jurnal Penelitian
No ratings yet
Jurnal Penelitian
6 pages
Predicting Churn
100% (10)
Predicting Churn
14 pages
23.0 Logistic Regression-6
No ratings yet
23.0 Logistic Regression-6
24 pages
Jurnal Inter FD
No ratings yet
Jurnal Inter FD
22 pages
Determinants LR (Article Review)
No ratings yet
Determinants LR (Article Review)
6 pages
CSCB HW 1
No ratings yet
CSCB HW 1
6 pages
Conjoint Analysis Case Study PDF
No ratings yet
Conjoint Analysis Case Study PDF
8 pages
Kumlachew Mekasha PDF
No ratings yet
Kumlachew Mekasha PDF
74 pages
Likelihood Statistic For Interpretation of The Stability Graph For
No ratings yet
Likelihood Statistic For Interpretation of The Stability Graph For
10 pages
Test Cricket Player Rating System
No ratings yet
Test Cricket Player Rating System
13 pages
H3 Acid Balance Tsai-Wei Huang
No ratings yet
H3 Acid Balance Tsai-Wei Huang
23 pages
Potocky-Tripodi - 2003 - Refugee Economic Adaptation Theory, Evidence, and
No ratings yet
Potocky-Tripodi - 2003 - Refugee Economic Adaptation Theory, Evidence, and
30 pages
Final Saint Mary Thesis 4
No ratings yet
Final Saint Mary Thesis 4
72 pages
Identifying Appropriate Statistical Method
No ratings yet
Identifying Appropriate Statistical Method
18 pages
1 s2.0 S0033350622002037 Main
No ratings yet
1 s2.0 S0033350622002037 Main
7 pages
13 Data Analysis & Visualization THEMED TRACK
No ratings yet
13 Data Analysis & Visualization THEMED TRACK
6 pages
An Introduction To Categorical Data Analysis, 2Nd Ed
No ratings yet
An Introduction To Categorical Data Analysis, 2Nd Ed
13 pages
Poland's Palikot Movement
No ratings yet
Poland's Palikot Movement
15 pages
Financial Statement Timeliness Factors
No ratings yet
Financial Statement Timeliness Factors
11 pages
Lecture 03 Logistic Regression
No ratings yet
Lecture 03 Logistic Regression
34 pages
The Buffalo Study Outcome and Associated Predictors in Endodontic Microsurgery A Cohort Study
No ratings yet
The Buffalo Study Outcome and Associated Predictors in Endodontic Microsurgery A Cohort Study
34 pages
Financial Distress Forecasting With A Machine Learning
No ratings yet
Financial Distress Forecasting With A Machine Learning
16 pages
Chap10 Logistic Regression
No ratings yet
Chap10 Logistic Regression
36 pages
Log-linear Models for Biostatisticians
No ratings yet
Log-linear Models for Biostatisticians
59 pages
Logistic Regression for Beginners
No ratings yet
Logistic Regression for Beginners
9 pages
Accident Analysis and Prevention: Sciencedirect
No ratings yet
Accident Analysis and Prevention: Sciencedirect
10 pages
Statistical Modelling in Biostatistics and Bioinformatics
100% (3)
Statistical Modelling in Biostatistics and Bioinformatics
250 pages
TBC ENGLAND Reilly
No ratings yet
TBC ENGLAND Reilly
14 pages
Report Writing Skills Assgn 1 Sp20-Bba-110
No ratings yet
Report Writing Skills Assgn 1 Sp20-Bba-110
4 pages