0% found this document useful (0 votes)

12 views16 pages

4.logistic Regression

Logistic regression is a supervised classification algorithm that predicts discrete outcomes based on input features, with types including binary, multi-class, and ordinal logistic regression. It uses the sigmoid function to ensure predicted probabilities fall within the range of 0 to 1, and employs a cost function known as Log-Loss to minimize prediction errors during training. The training process involves gradient descent to optimize coefficients, allowing for accurate classification of new inputs.

Uploaded by

Sairam Manne

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views16 pages

4.logistic Regression

Uploaded by

Sairam Manne

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Logistic Regression

Logistic Regression
• A supervised classification algorithm.
• Predicted output is discrete – only specific values or classes
are allowed. Eg: Pass/Fail, Spam/No spam, Malignant or
Benign Tumor etc.
• That means, Output y (y = f (x)), which is predicted from x
(inputs or features), takes only discrete values/classes.
• Types of logistic regression:
(a) Binary logistic regression: y takes two discrete values (for
two classes).

1
Logistic Regression (contd.)

(b) Multi-class logistic regression: y takes more than two

discrete values (for multiple number of classes). For example,
in digit classification, there are 10 classes (0 – 9).
(c) Ordinal logistic regression: Multiple classes in some order
(Eg: Low, Medium, High).
• Predicts class based on probability.

2
Binary Logistic Regression

• A Two-class classification: y = 0 or 1.
• y can be predicted based on single variable or feature (x):
y = f (x) = β0 + β1 x
• Based on multiple features (x1 , x2 , · · · , xk ):
y = f (x1 , x2 , · · · , xk ) = β0 + β1 x1 + β2 x2 + · · · βk xk
• An example on student dataset:
Hours studied Hours slept Result (Pass (1)/Fail (0))
4.85 9.63 1
8.62 3.23 0
5.43 8.23 1
9.21 6.34 0

3
Binary Logistic Regression (contd.)

• f should generate probabilities (in the range, 0 to 1), to

predict classes.
• Choose a threshold P (say, 0.5) such that, if probability is less
than threshold P, predict y as Class 0. Otherwise, predict y
as Class 1.
• The above function, f , is used for linear regression.
• As shown in the below figure, it is not useful for logistic
regression, as predicted output should be in the range, [0, 1].

4
Binary Logistic Regression (contd.)

• A function that has the range, [0, 1], is required.

• Sigmoid function maps any real value to the range, [0, 1].
σ(z) = 1+e1 −z

5
Binary Logistic Regression (contd.)

• If z = β0 + β1 x1 + β2 x2 + · · · + βk xk , σ(z) gives values in the

range, [0, 1].

6
Binary Logistic Regression (contd.)

• Logistic regression is based on Sigmoid function which is

also called Logistic function, given by:
1
y = f (x1 , x2 , · · · , xk ) = −(β
1 + e 0 1 1 +β2 x2 +···+βk xk )
+β x
• Similar to linear regression, coefficients (β0 , β1 , etc.) are
learnt from training data, based on gradient descent on the
error or cost function – Training process.

7
Cost function
• Consider n training points or samples, (Xi , Yi ), where each Xi
is a k-dimensional feature vector (k features) and Yi = 0 or 1.
• For logistic regression, as Sigmoid function is used which has
exponential term in the denominator, the following cost
function called Log-Loss or Cross Entropy, is used (instead
of MSE used for linear regression):
J(θ) = J(β0 , β1 , · · · , βk ) =
n
1X
− [Yi log (f (Xi )) + (1 − Yi )log (1 − f (Xi ))]
n
i=1
• Similar to MSE, the above cost function which quantifies the
difference between actual (Yi ) and predicted (f (Xi )) outputs,
has to be minimized.

8
Cost function (contd.)

• If the actual output, Yi = 1, J(θ) = −log (f (Xi )). If f (Xi ) is

low, the error, J(θ), is high. If f (Xi ) approaches 1, J(θ)
moves towards 0.
• If the actual output, Yi = 0, J(θ) = −log (1 − f (Xi )). If f (Xi )
is high, error J(θ) is high. If f (Xi ) approaches 0, J(θ) moves
towards 0.
• That means, if both actual and predicted outputs, Yi and
f (Xi ), match, then the error is 0.

9
Cost function (contd.)

• Illustrated in the below plots, for some Y and f (X ):

J(θ) vs f (X ) for Y = 1 and Y = 0.

10
Gradient descent for training

• Initialize each βj , j = 1, 2, · · · k, to some random values.

• For one or more epochs or until some minimum error
threshold (say, ϵ < 0.001) is reached, do the following:
For each j = 1, 2, · · · k,
(i) δβj = −η ∂J(θ)
∂βj
(ii) βj = βj + δβj
• Training process can be monitored by plotting cost, J(θ) vs
number of training epochs.

11
Gradient descent for training (contd.)

12
Testing

• After weights or coefficients (β0 , β1 , · · · , βk ) are learnt after

training, the function,
1
y = f (x1 , x2 , · · · , xk ) = −(β +β x +β2 x2 +···+βk xk )
1+e 0 1 1
can be used to predict output, y , for any new input,
(x1 , x2 , · · · , xk ).
• Choose a threshold, P, say 0.5.
• If y ≥ P, output Class 1.
• If y < P, output Class 0.

13
Multi-class logistic regression

• Suppose there are c classes.

• Divide the problem into c binary classification problems, for
each class – Generate c binary classifiers.
• To train binary classifier i, training samples from class i have
actual output, y = 1. For the remaining samples, y = 0.
• After training c binary classifiers, each classifier i can be used
to determine if a sample belongs to class i or not (binary).
• For testing, compute probabilities from the c binary classifiers
corresponding to the c classes, and then output the class
which has the maximum probability.

14
Thank You

Nep Chemistry 1ST and 2ND Sem
No ratings yet
Nep Chemistry 1ST and 2ND Sem
32 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Lecture 08
No ratings yet
Lecture 08
42 pages
Lecture 3. Classification
No ratings yet
Lecture 3. Classification
60 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Qa QC
No ratings yet
Qa QC
62 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
Lecture 03 Logistic Regression
No ratings yet
Lecture 03 Logistic Regression
34 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
No ratings yet
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
9 pages
Machine Learning Unit 2 Que and Ans
No ratings yet
Machine Learning Unit 2 Que and Ans
16 pages
Exp 2
No ratings yet
Exp 2
7 pages
ML Lec 3
No ratings yet
ML Lec 3
4 pages
CO 2 Session 3
No ratings yet
CO 2 Session 3
39 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
LR, Decision Tree
No ratings yet
LR, Decision Tree
48 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Week 8
No ratings yet
Week 8
38 pages
01B DL2023 LinearModels
No ratings yet
01B DL2023 LinearModels
47 pages
Day.12 Logistic Regression
No ratings yet
Day.12 Logistic Regression
8 pages
COMP-377Week6 v1.1
No ratings yet
COMP-377Week6 v1.1
38 pages
Iet Cipher ML Bootcamp (Session-1)
No ratings yet
Iet Cipher ML Bootcamp (Session-1)
67 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
53 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
ML Assignment Kv2
No ratings yet
ML Assignment Kv2
10 pages
Basic ML Algorithm
No ratings yet
Basic ML Algorithm
74 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
41 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
Lecture 8 Logistic Regression
No ratings yet
Lecture 8 Logistic Regression
34 pages
Logistic Regression Tutorial
100% (1)
Logistic Regression Tutorial
22 pages
Lecture 3 - 1
No ratings yet
Lecture 3 - 1
22 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Task 1
No ratings yet
Task 1
7 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Impact of Fringe Benefits On Employee Performance: A Study of Nasco Group, Jos Plateau State
No ratings yet
Impact of Fringe Benefits On Employee Performance: A Study of Nasco Group, Jos Plateau State
22 pages
Slide 2
No ratings yet
Slide 2
30 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
50 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
25 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Lecture3 Logistic Regression Regularization
No ratings yet
Lecture3 Logistic Regression Regularization
39 pages
Logistic Regression Class Notes
No ratings yet
Logistic Regression Class Notes
3 pages
Algorithms Notes
No ratings yet
Algorithms Notes
66 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
Uestion 1
No ratings yet
Uestion 1
52 pages
Intro to Classification & Regression
No ratings yet
Intro to Classification & Regression
42 pages
Machine Learning for Mechanics
No ratings yet
Machine Learning for Mechanics
19 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
General Principles of Surveying
No ratings yet
General Principles of Surveying
8 pages
Estimating Demand Functions: Managerial Economics
No ratings yet
Estimating Demand Functions: Managerial Economics
38 pages
Nomalization - Navathe
No ratings yet
Nomalization - Navathe
61 pages
04 Probability and Learning PDF
No ratings yet
04 Probability and Learning PDF
34 pages
Regression Analysis Exercises
No ratings yet
Regression Analysis Exercises
94 pages
Time Series
No ratings yet
Time Series
27 pages
Hot Plate
No ratings yet
Hot Plate
9 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
Effect of Audit Quality On The Financial Performance of Selected Banks in Nigeria
No ratings yet
Effect of Audit Quality On The Financial Performance of Selected Banks in Nigeria
14 pages
Report Statistical Technique in Decision Making (GROUP BPT) - Correlation & Linear Regression123
No ratings yet
Report Statistical Technique in Decision Making (GROUP BPT) - Correlation & Linear Regression123
20 pages
How Smart Is My Dummy? Time Series Tests For The Influence of Politics
No ratings yet
How Smart Is My Dummy? Time Series Tests For The Influence of Politics
18 pages
Hyperspectral Image Segmentation: A Comprehensive Survey: Reaya Grewal Singara Singh Kasana Geeta Kasana
No ratings yet
Hyperspectral Image Segmentation: A Comprehensive Survey: Reaya Grewal Singara Singh Kasana Geeta Kasana
54 pages
Functional Dependency
No ratings yet
Functional Dependency
35 pages
Data Mining and Machine Learning
No ratings yet
Data Mining and Machine Learning
48 pages
6 Stored Procedures
No ratings yet
6 Stored Procedures
33 pages
2.1 BinaryRepresentation
No ratings yet
2.1 BinaryRepresentation
48 pages
2.3 Int Arthematic
No ratings yet
2.3 Int Arthematic
37 pages
5ssmn932 Lecture7 2021 Collated Online
No ratings yet
5ssmn932 Lecture7 2021 Collated Online
79 pages
Dynamic Matrix Control - A Computer Control Algorithm
50% (2)
Dynamic Matrix Control - A Computer Control Algorithm
7 pages
Econ7020X 2024S FinalExam
No ratings yet
Econ7020X 2024S FinalExam
10 pages
Bland-Altman Plot and Analysis
No ratings yet
Bland-Altman Plot and Analysis
25 pages
Database Management Systems - Syllabus
No ratings yet
Database Management Systems - Syllabus
2 pages
Econometrics: Heteroskedasticity Basics
No ratings yet
Econometrics: Heteroskedasticity Basics
15 pages
Generalized Structured Component Analysis
No ratings yet
Generalized Structured Component Analysis
28 pages
Math 232: Maple Regression Guide
No ratings yet
Math 232: Maple Regression Guide
16 pages
Examining Change in Maximal Reliability For Multiple-Component Measuring Instruments
No ratings yet
Examining Change in Maximal Reliability For Multiple-Component Measuring Instruments
18 pages
Conjoint Example Carpet Cleaner-Students
No ratings yet
Conjoint Example Carpet Cleaner-Students
8 pages
Chapter8 Stats
No ratings yet
Chapter8 Stats
10 pages
Web Data Analysis Insights
No ratings yet
Web Data Analysis Insights
11 pages
Python
No ratings yet
Python
4 pages
Multivariate Regression Guide
No ratings yet
Multivariate Regression Guide
7 pages
ANOVA Introduction and Concepts
No ratings yet
ANOVA Introduction and Concepts
4 pages
Mboxcox, Interpreting Difficult Regressions: 2 Answers
No ratings yet
Mboxcox, Interpreting Difficult Regressions: 2 Answers
1 page

4.logistic Regression

Uploaded by

4.logistic Regression

Uploaded by

Logistic Regression

(b) Multi-class logistic regression: y takes more than two

• f should generate probabilities (in the range, 0 to 1), to

• A function that has the range, [0, 1], is required.

• If z = β0 + β1 x1 + β2 x2 + · · · + βk xk , σ(z) gives values in the

• Logistic regression is based on Sigmoid function which is

• If the actual output, Yi = 1, J(θ) = −log (f (Xi )). If f (Xi ) is

• Illustrated in the below plots, for some Y and f (X ):

• Initialize each βj , j = 1, 2, · · · k, to some random values.

• After weights or coefficients (β0 , β1 , · · · , βk ) are learnt after

• Suppose there are c classes.

You might also like