0% found this document useful (0 votes)

10 views8 pages

Binary Logistic Regression

Uploaded by

himanshumalik.du.or.25

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views8 pages

Binary Logistic Regression

Uploaded by

himanshumalik.du.or.25

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

BINARY LOGISTIC REGRESSION

Linear Regression is defined by the statement :

Yi ~ N(1 + 2 Xi2 + ... + k Xik , 2 )

or
k
Yi ~ N(  jXij , 2 ) , i = 1,2,...,n , j=1,2,...,k , Xi1 = 1 i
j=1

In BINARY LOGISTIC REGRESSION , Y assumes the values 0 and 1 , and so is a Bernoulli

random variable, the explanatory variables can be discrete or continuous but are treated as
fixed.
The basic form of Logistic regression can be derived using Bayes’ rule. Assume that k=2, so
that there is one non-trivial explanatory variable X and a constant term, then

exp 1 + 2 X
if X is discrete. Also (1)  P  Y = 1| X  =
1 + exp 1 + 2 X

Page 1 of 8
If X is continuous (1) holds using the density f(.) instead of P. In other words,

P  Yi = 1| X i1  = F ( 1 + 2 X i1 ) ,

where

exp  x 
F(x) = .
1 + exp  x 

The conditional probability function is:

f ( y | Xi1 ) = P  Yi = y|Xi1 

= ( F ( 1 + 2 Xi1 ) ) (1 − F (1 + 2 Xi1 ) )

y 1− y

F ( 1 + 2 Xi1 ) if y=1

=
1 − F ( 1 + 2 Xi1 ) if y = 0
Thus, the logistic regression model is:

 exp 1 + 2 Xi1  

Yi |Xi1 ~ Bernoulli  
1 + exp 1 + 2 Xi1  
or
exp 1 + 2 Xi1 
i = P  Yi = 1 | Xi1  =
1 + exp 1 + 2 Xi1 
or
  
log  i  = 1 + 2 X i1
1 − i 
or
log it  i  = 1 + 2 X i1.

exp  x 
The term Logistic Regression derives from the fact that the function F(x) = is
1 + exp  x 
known as the Logistic Function.

ASSUMPTIONS
▪ The data Y1, Y2, ..., Yn are independently distributed, i.e., cases are independent.
▪ Binary logistic regression model assumes Bernoulli distribution of the response.
▪ Does NOT assume a linear relationship between the dependent variable and the independent
variables, but it does assume linear relationship between the logit of the response and the
explanatory variables; log it  i  = 1 + 2 X i1 .

Page 2 of 8
▪ Independent (explanatory) variables can be even the power terms or some other nonlinear
transformations of the original independent variables.
▪ The homogeneity of variance does NOT need to be satisfied. In fact, it is not even possible in
many cases given the model structure.
▪ Errors need to be independent but NOT normally distributed.
▪ It uses maximum likelihood estimation (MLE) rather than ordinary least squares (OLS) to
estimate the parameters, and thus relies on large-sample approximations.

For modelling , Logistic Regression is often used to estimate probabilities as a function of

explanatory variables, X. and parameters ,  . Often these probabilities are used to find odds,
odd ratios and relative risks.

ODDS AND ODDS RATIOS

The odds is the ratio of the probability that something is true divided by the probabilities that
it is not true. Thus,

P  Yi = 1| X i1 
Odd(X) = P  Yi = 0 | X i1 
= exp 1 + 2 X i1 .
The odd ratio is the ratio of two odds for different values of Xi1 , say Xi1 =x and Xi1 = x + x

Odd(x + x) exp 1 + 2 (x + x) 

=
Odd(x) exp 1 + 2 (x + x) 
= exp 2 x  ,

where x is a small change in x.

Then,

Page 3 of 8
1  Odd(x + x) - Odd(x) 
x  
lim
x →0 Odd(x) 
 exp 2 x  − 1 
= lim  
x →0
 x 
 exp 2 x  − 1 
= 2 lim  
x →0
 2 x 
d exp  u 
= 2
du u =0

= 2 exp  0
= 2 .

Thus, 2 may be interpreted as the relative change in the odds due to small change x in Xi1 :

Odd(x + x) - Odd(x) Odd(x + x)

= -1
Odd(x) Odd(x)
 2 x

If Xi1 is a Binary variable itself, Xi1 = 0 or Xi1 = 1 , then only reasonable choices for x + x
and x are 1 and 0, respectively, so that then
Odd(1)
-1
Odd(0)
Odd(1) − Odd(0)
=
Odd(0)
= exp 2  - 1.

Only if 2 is small we may use the approximation exp 2  - 1  2 . If not, one has to interpret
2 in terms of the log of the odd ratio involved:

 Odd(1) 
log   = 2 .
 Odd(0) 

Page 4 of 8
GENERALIZATION
If k  2 and X ij are independent

 P  X | Y=1  k  P  Xij | Yi =1 

log   
= log    .
 P  X | Y=0  j=2  P  Xij | Yi =0  

Setting

 P  Xij | Yi =1 
 jXij = log   
 P  Xij | Yi =0 

One can extend the model and obtain general logistic regression model

  k  
 exp 1 +   jXij  
  j= 2  
Yi |Xij ~ Bernoulli  .
 k 
1 + exp  +  X  
 1  j ij 
 
 j= 2  
Regardless of whether the Xs’ are dichotomous, polychotomous or continuous, Logistic
Regression is a way to identify the distribution of Y as a function of X and of parameter  ,
just as linear regression is a way to identify the distribution of a function of X and of parameter
(different)  .

The interpretation of the coefficients  j , j=2,3,...,k in the logistic model is given as:

(
Odd X1j , X 2j ,...,Xi-1, j , Xij + Xij , Xi+1, j , ...,X kj ) -1
(
Odd X1j , X 2j ,...,Xi-1, j , Xij , Xi+1,j , ...,X kj )
  j Xij ,

if X ij is small.

For example, j may be interpreted as the percentage change in the

( )
Odd X1j , X 2j ,...,Xi-1, j , Xij , Xi+1,j , ...,X kj due to small percentage change in X ij .

Page 5 of 8
ESTIMATION OF PARAMETERS
Let k=2. The parameters 1 and 2 are estimated using method of maximum likelihood.

The log of likelihood function L ( 1 , 2 ) is given as:

n
log L ( 1 , 2 ) =  log ( f ( yi | Xi1, 1 , 2 ) )
i =1
n n
=  yi log F(1 + 2 Xi1 ) +  (1 − yi ) log(1 − F(1 + 2Xi1))
i =1 i =1
n
F(1 + 2 Xi1 ) n
=  yi log +  log(1 − F(1 + 2Xi1))
i =1 1 − F(1 + 2 Xi1 ) i =1
n n
=  yi (1 + 2 X i1 ) -  log(1 + exp 1 + 2 Xi1 )
i =1 i =1

 log L ( 1 , 2 ) n n exp 1 + 2 Xi1 
=  yi − 
1 i =1 i =1 1 + exp 1 + 2 X i1 

and
 log L ( 1 , 2 ) n n X exp  +  X 
2
=  yi Xi1 -  1 +i1 exp 1 +  2X i1
i =1 i =1 1 2 i1
n
=  ( yi - i )Xi1
i =1

Since this is a transcendental equation, therefore it is not possible to obtain closed-form solution
of 2 . One can use Newton-Raphson can be used to obtain ̂2 :

 ˆ 1   ˆ 1(0) 
ˆ ˆ
• Guess initial value of  =   , say,  = 
(0)
 , say, ̂02 .
~  ˆ  ~  ˆ (0) 
 2  2 
• Use

Page 6 of 8
  log L ( 1 , 2 ) 
 
−1  1 ,
ˆ (t +1) = ˆ (t +1) +  −H 
~ ~   log L (  ,  ) 
 1 2

 2 
where H is Hessian Matrix given as:
  2 log L ( 1 , 2 )  2 log L ( 1 , 2 ) 
 
  21  212 
H=  2
  log L ( 1 , 2 )  log L ( 1 , 2 ) 
2

  212  21 
 

iteratively till two consecutive values of ̂ are approximately equal.

The estimated variance covariance matrix of ̂ is  − H  .The diagonal elements of this

−1
~

matrix gives estimated standard errors of parameters 1 and 2 .

Foe k>2, the result can be generalized.

TESTING OF HYPOTHESES
I. Testing the significance of single regression coefficient
If sample size is large , under H 0 :  j =  j0 ,

(
n ˆ j -  j0 ) ~ N(0, 1) , j = 1,2,...,k .
sˆ
j

These results can be used to test whether the coefficients  j is zero or not, j=1,3,…,k. The
null hypothesis H 0 :  j =  j0 , j = 2,…,k is of interest since this hypothesis implies that

the conditional probability P  Yi = 1| Xij  does not depend on X ij , j = 2,3,…,k. Under

H 0 :  j =  j0 ,

n ˆ j
~ N(0, 1) , j = 2,...,k .
sˆ
j

This statistic is called pseudo t-value as it is used in the same way as the t-value in linear
regression and sˆ is called the standard error of ̂ j .The test-statistic is also called Wald’s
j
statistic and the corresponding test Wald’s test.

Page 7 of 8
II. Testing the joint significance of all predictors
We are interested in testing H0 : 2 = 3 = ... = m = 0 (m  k) against the alternative
hypothesis that at least one of 2 , 3 , ... , m is not equal to zero. For this we proceed as
follows:
Re-estimate the logit model using

(
log L 0, 0 , ..., 0,ˆ m+1 , ˆ m+2 , ... , ˆ k = )
max
m+1 , m + 2 , ... , k
( log L ( 0, 0 , ..., 0,m+1 , m+2 , ... , k ) )
Then, under H0

(
 L 0, 0 , ..., 0,ˆ m+1 , ˆ m+ 2 , ... , ˆ k
LR m = -2log 
) 

 (
L ˆ 2 , ˆ 3 , ... , ˆ k ) 

 2m−1 .

This is the LIKELIHOOD RATIO test which is right-sided.

PREDICTION WITH LOGISTIC REGRESSION
From prediction point of view , logistic regression can be used for classification and the
zero and one are taken as class labels.

Suppose data of the form ( Yi , X i1 ) , i= 1,2,…,n is available and estimates of parameters

have been obtained. These estimators are consistent and asymptotically normally
distributed. The objective is to estimate conditional probability of the event such as
Yn +1 given X n +1 , 1. This is given as :

exp ˆ 1 + ˆ 2 X n+1, 1 

Est.P  Yn +1 = 1 | X n+1,1  = .
1 + exp ˆ 1 + ˆ 2 X n+1, 1 

If the above probability is greater than half , one is led to predict that Yn +1 = 1 , otherwise

Yn +1 = 1 for given X n+1, 1 .

References

Allison, P.D.(2012). Logistic Regression using SAS-Theory and Application, SAS

Institute Inc., Cary, NC, USA, 2nd ed..

Page 8 of 8

Nisha Arora - Logistics Regression Using SPSS
No ratings yet
Nisha Arora - Logistics Regression Using SPSS
76 pages
Logistic Regresson
No ratings yet
Logistic Regresson
32 pages
T3 Logistic Regression
No ratings yet
T3 Logistic Regression
53 pages
Day 4
No ratings yet
Day 4
29 pages
Chap4 Logistic Regression
No ratings yet
Chap4 Logistic Regression
40 pages
4 - Logistic Reg 1
No ratings yet
4 - Logistic Reg 1
30 pages
Lecture 8
No ratings yet
Lecture 8
39 pages
Regression 3
No ratings yet
Regression 3
5 pages
Unit - II Regression-LogisticRegressionModels
No ratings yet
Unit - II Regression-LogisticRegressionModels
7 pages
Lecture 22. GLM
No ratings yet
Lecture 22. GLM
41 pages
302 F 14 Logistic Regression
No ratings yet
302 F 14 Logistic Regression
23 pages
Logistic Regression Analysis in R
No ratings yet
Logistic Regression Analysis in R
6 pages
Logistic Regression: Jia Li
No ratings yet
Logistic Regression: Jia Li
44 pages
Lecture 3
No ratings yet
Lecture 3
18 pages
Lecture 8
No ratings yet
Lecture 8
22 pages
Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
3logistic Regression
No ratings yet
3logistic Regression
61 pages
Notes 13
No ratings yet
Notes 13
18 pages
26GeneralizedLinearModelBernoulliAnnotated PDF
No ratings yet
26GeneralizedLinearModelBernoulliAnnotated PDF
46 pages
5.1) Binary Logistic Regression
No ratings yet
5.1) Binary Logistic Regression
32 pages
Unit II NOTES
No ratings yet
Unit II NOTES
31 pages
Softmax Regression for Data Scientists
100% (1)
Softmax Regression for Data Scientists
10 pages
Logistic Nota
No ratings yet
Logistic Nota
87 pages
CQF ML Lab Estimating Default Probability With Logistic Regression
No ratings yet
CQF ML Lab Estimating Default Probability With Logistic Regression
7 pages
Regression3 Slides
No ratings yet
Regression3 Slides
47 pages
FCM 3.4 Biostatistics
No ratings yet
FCM 3.4 Biostatistics
9 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
23 pages
Introduction To Logistic Regression
No ratings yet
Introduction To Logistic Regression
20 pages
Stat5900 f24 Lec11 Handout
No ratings yet
Stat5900 f24 Lec11 Handout
5 pages
Logistic Regression
No ratings yet
Logistic Regression
23 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Handout 41100 LogModels PDF
No ratings yet
Handout 41100 LogModels PDF
4 pages
PD2004 9
No ratings yet
PD2004 9
26 pages
1 LogisticRegressionNotes1
No ratings yet
1 LogisticRegressionNotes1
11 pages
Roni Presentation
No ratings yet
Roni Presentation
17 pages
Logit PDF
No ratings yet
Logit PDF
44 pages
Logistic Regression
No ratings yet
Logistic Regression
54 pages
Logistic Regression & GLMs Guide
No ratings yet
Logistic Regression & GLMs Guide
20 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Logistic
No ratings yet
Logistic
14 pages
Using SAS To Extend Logistic Regression
No ratings yet
Using SAS To Extend Logistic Regression
8 pages
Regresion Logistica
No ratings yet
Regresion Logistica
71 pages
An Introduction To Logistic Regression
No ratings yet
An Introduction To Logistic Regression
48 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
Logit
No ratings yet
Logit
48 pages
Regression Logistic 4
No ratings yet
Regression Logistic 4
51 pages
Logistic Regression A Primer
No ratings yet
Logistic Regression A Primer
94 pages
Logistic and Nonlinear Regression: Department of Political Science AND International Relations Posc/Uapp 816
No ratings yet
Logistic and Nonlinear Regression: Department of Political Science AND International Relations Posc/Uapp 816
15 pages
Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
L9 Logistical Regression Models Updated
No ratings yet
L9 Logistical Regression Models Updated
10 pages
Logistic Regression Insights
No ratings yet
Logistic Regression Insights
33 pages
Regression Basics for Epidemiologists
No ratings yet
Regression Basics for Epidemiologists
18 pages
Generalized Linear Models Guide
No ratings yet
Generalized Linear Models Guide
12 pages
Binary Logistic Regression Lecture 9
No ratings yet
Binary Logistic Regression Lecture 9
33 pages
Social Statistics For A Diverse Society 8th Edition Frankfort Nachmias Test Bank PDF Version
100% (1)
Social Statistics For A Diverse Society 8th Edition Frankfort Nachmias Test Bank PDF Version
103 pages
Correlation Coefficient Definition
100% (1)
Correlation Coefficient Definition
8 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
Home Sales Price Prediction Guide
No ratings yet
Home Sales Price Prediction Guide
1 page
Econometrics
No ratings yet
Econometrics
205 pages
Elements of Statistics BCA Sem-I.
No ratings yet
Elements of Statistics BCA Sem-I.
46 pages
Burnham Etal 2008 - Research Methods in Politics - PP 171-186
No ratings yet
Burnham Etal 2008 - Research Methods in Politics - PP 171-186
9 pages
Advanced Econometrics: OLS & Regression Analysis
No ratings yet
Advanced Econometrics: OLS & Regression Analysis
65 pages
Statistics & Probability Exercises
No ratings yet
Statistics & Probability Exercises
3 pages
HW 9 Update
No ratings yet
HW 9 Update
3 pages
Unit 6
No ratings yet
Unit 6
51 pages
Bayesian Linear Model Gory Details
No ratings yet
Bayesian Linear Model Gory Details
9 pages
Fuzzy Statistical Decision-Making
100% (1)
Fuzzy Statistical Decision-Making
358 pages
TUGAS BIOSTATISTIK (Winda)
No ratings yet
TUGAS BIOSTATISTIK (Winda)
3 pages
Dunnett's Test for Treatment Comparison
No ratings yet
Dunnett's Test for Treatment Comparison
3 pages
Senior High Stats & Probability Guide
No ratings yet
Senior High Stats & Probability Guide
61 pages
Wedding Cost Regression Analysis
No ratings yet
Wedding Cost Regression Analysis
4 pages
Oct2024.MN2196 Exam Paper
No ratings yet
Oct2024.MN2196 Exam Paper
7 pages
Levine Smume7 Bonus Ch09
No ratings yet
Levine Smume7 Bonus Ch09
6 pages
Confidence Interval Estimation Techniques
No ratings yet
Confidence Interval Estimation Techniques
5 pages
Activities For Week 1 1
No ratings yet
Activities For Week 1 1
6 pages
Samcofenac Menggunakan Metode Aggregate Planning Untuk
No ratings yet
Samcofenac Menggunakan Metode Aggregate Planning Untuk
7 pages
Parametric Statistical Analysis in SPSS
No ratings yet
Parametric Statistical Analysis in SPSS
56 pages
Chapter 5
No ratings yet
Chapter 5
16 pages
Heteroscedasticity: What Heteroscedasticity Is. Recall That OLS Makes The Assumption That
No ratings yet
Heteroscedasticity: What Heteroscedasticity Is. Recall That OLS Makes The Assumption That
20 pages
DS100 Sp22 Lec 09 - Intro To Modeling, SLR
No ratings yet
DS100 Sp22 Lec 09 - Intro To Modeling, SLR
69 pages
Stock Markets, Banks, and Growth: Panel Evidence: Thorsten Beck, Ross Levine
No ratings yet
Stock Markets, Banks, and Growth: Panel Evidence: Thorsten Beck, Ross Levine
20 pages
Statistical Inference in Regression
No ratings yet
Statistical Inference in Regression
30 pages
Coping With Multicollinearity: An Example On Application of Principal Components Regression in Dendroecology
No ratings yet
Coping With Multicollinearity: An Example On Application of Principal Components Regression in Dendroecology
48 pages

Binary Logistic Regression

Uploaded by

Binary Logistic Regression

Uploaded by

BINARY LOGISTIC REGRESSION

Linear Regression is defined by the statement :

Yi ~ N(1 + 2 Xi2 + ... + k Xik , 2 )

In BINARY LOGISTIC REGRESSION , Y assumes the values 0 and 1 , and so is a Bernoulli

The conditional probability function is:

= ( F ( 1 + 2 Xi1 ) ) (1 − F (1 + 2 Xi1 ) )

F ( 1 + 2 Xi1 ) if y=1

 exp 1 + 2 Xi1  

For modelling , Logistic Regression is often used to estimate probabilities as a function of

ODDS AND ODDS RATIOS

Odd(x + x) exp 1 + 2 (x + x) 

where x is a small change in x.

Odd(x + x) - Odd(x) Odd(x + x)

 P  X | Y=1  k  P  Xij | Yi =1 

For example, j may be interpreted as the percentage change in the

The log of likelihood function L ( 1 , 2 ) is given as:

iteratively till two consecutive values of ̂ are approximately equal.

The estimated variance covariance matrix of ̂ is  − H  .The diagonal elements of this

matrix gives estimated standard errors of parameters 1 and 2 .

Foe k>2, the result can be generalized.

the conditional probability P  Yi = 1| Xij  does not depend on X ij , j = 2,3,…,k. Under

This is the LIKELIHOOD RATIO test which is right-sided.

Suppose data of the form ( Yi , X i1 ) , i= 1,2,…,n is available and estimates of parameters

exp ˆ 1 + ˆ 2 X n+1, 1 

Yn +1 = 1 for given X n+1, 1 .

Allison, P.D.(2012). Logistic Regression using SAS-Theory and Application, SAS

You might also like