0% found this document useful (0 votes)

54 views9 pages

Logistic Regression in Biostatistics

1. Logistic regression is used to model binary outcome variables and extends linear regression to non-normally distributed outcomes. It is applied to outcomes such as disease presence/absence. 2. The logistic regression model relates the log-odds of the outcome (logit) to the predictor variables. It allows estimation of odds ratios to quantify the effect of predictors on the outcome. 3. An example uses logistic regression to predict lymph node metastasis in prostate cancer patients based on age, serum acid level, x-ray results, tumor size, and grade. The odds ratios estimated from the model quantify the effect of each predictor on metastasis risk.

Uploaded by

IuliaOpris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views9 pages

Logistic Regression in Biostatistics

Uploaded by

IuliaOpris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

05‐11‐2020

Logistic regression

1 Applied Methods in Biostatistics - Week 2 2019

Generalization of the
Linear regression model

In many practical situation linear regression model is inadequate.

For example in case where: the outcome has two possible responses (binary data)
or the outcome represents count data (positive integers)
 it makes no sense to model the outcome as normally distributed
Generalized linear models (GLMs) are an extension of linear regression
Regression models to model non-normally distributed outcome variables.

2 Applied Methods in Biostatistics - Week 2

1
05‐11‐2020

Binary outcome variable

In many studies the outcome of interest is the presence or absence of some condition.
Examples:
 smoking status
 responding to a treatment
 presence or absence of cancer
 survival status of a subject after a surgery: dead or alive
 having myocardial infarction or CHD: yes/no
 success (’yes’/1) and failure (’no’/0) are often used as generic terms of the two possible
responses
the interest is in quantifying the risk or odds of success or occurrence of some event of
interest

3 Applied Methods in Biostatistics - Week 2

Example: Prostatic cancer

A study of 53 prostate cancer patients. Before surgery two continuous exposure variables (age,
serumacid, phosphatase) and three categorical (binary) exposure variables (X-ray, tumour size,
tumour grade) were measured. The patients then had surgery (laparotomy) to determine whether
there was nodal involvement, i. e. lymph node metastases (NI = 1) or not (NI = 0) in the cancer to
adopt the treatment regimen for the patient.

Pat NI Age Acid Xray Size Grade

(pos) (large) (serious)
1 0 66 0.48 0 0 0
2 0 68 0.56 0 0 0
…
52 1 64 0.89 1 1 0
53 1 68 1.26 1 1 1
Brown, B.W. (1980)
4 Applied Methods in Biostatistics - Week 2

2
05‐11‐2020

Risk outcome: odds

Studies
• Case-control / Cross sectional
• Cohort: cumulative incidence rate

Simple (exploratory) inference

• Confidence intervals & hypothesis tests
• comparing risks between exposed/unexposed groups
• Test for association (two or more groups)
• Chi-square-tests/ Fishers exact tests

5 Applied Methods in Biostatistics - Week 2

Logistic regression model

The model is based on:

• Relationship
• logit (p) = log (p/(1-p))
= log-odds (p) = β0 + β1 x1 + β2x2 + … + βkxk
• E.g: p not linear in βs, but logit(p) linear
• Data from binomial distribution

Inference similar to linear model

• Allows many categorical & numerical indep. variables

Estimation & inference: computer

6 Applied Methods in Biostatistics - Week 2

3
05‐11‐2020

Purposes of logstic regression

Effect estimation
• exp (β1) = OR1 = Effect of variable
• Stata: logistic calculates effect estimates exp (β1) directly!

Prediction:
• Best model for predicting risk p of disease for new cases
• Stata: logit calculates parameter estimates of β0, β1, β2, …
• Rule of thumb: at least 10 cases and 10 controls for each indep. var. in model

7 Applied Methods in Biostatistics - Week 2

Estimation:
Interpretation of the coefficients
Interpretations of coefficients is similar to linear regression. However since the logit is linear, the coefficients we
have an analogous interpretation on the logit or log odds scale.
Logit (πNI(Xray, Size, Age)) = β0 + β1Xray + β2 Size + β3 Age

Binary exposure (Comparing Xray examination (1 = positive finding, 0 = negative finding) for Size and Age
fixed)
| , , β0 + β1Xray + β2 Size + β3 Age
𝑂𝑅 xray 𝑒 β1
| , , β0 + β2 Size + β3 Age

Continous exposure variable

| , , β0 + β1Xray + β2 Size + β3 Age+1
𝑂𝑅age 𝑒 β3
| , , β0 + β2 Size + β3 Age

8 Applied Methods in Biostatistics - Week 2

4
05‐11‐2020

Estimation Example (1):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age
. logit NI Xray Size Age

Iteration 0: log likelihood = -35.126076

Iteration 1: log likelihood = -26.176433
Iteration 2: log likelihood = -26.042916
Iteration 3: log likelihood = -26.04263
Iteration 4: log likelihood = -26.04263
Logistic regression Number of obs = 53
LR chi2(3) = 18.17
Prob > chi2 = 0.0004
Log likelihood = -26.04263 Pseudo R2 = 0.2586
-------------------------------------------------------------------------
NI | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------+-----------------------------------------------------------------
Xray | 2.175658 .7644116 2.85 0.004 .6774385 3.673877
Size | 1.596897 .7079243 2.26 0.024 .2093913 2.984403
Age | -.0604558 .054447 -1.11 0.267 -.16717 .0462584
_cons | 1.518419 3.22939 0.47 0.638 -4.811069 7.847908
------------------------------------------------------------------------

9 Applied Methods in Biostatistics - Week 2

Estimation Example (2):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age

. logistic NI Xray Size Age

Logistic regression Number of obs = 53
LR chi2(3) = 18.17
Prob > chi2 = 0.0004
Log likelihood = -26.04263 Pseudo R2 = 0.2586
-------------------------------------------------------------------------
NI | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
------+------------------------------------------------------------------
Xray | 8.807976 6.732919 2.85 0.004 1.968828 39.40437
Size | 4.937689 3.49551 2.26 0.024 1.232927 19.7747
Age | .9413353 .0512529 -1.11 0.267 .8460557 1.047345
-------------------------------------------------------------------------

10 Applied Methods in Biostatistics - Week 2

5
05‐11‐2020

Inferences - Testing overall regression

Hypotheses: H0 : β1 = β2 = . . . = βn = 0
(e. g., Xray, Size and Age are not of predictable value for prostatic cancer

Likelihood ratio (LR) statistic compares two models

1. minimal model = logistic regression model under H0
2. full model = logistic regression model taking account for (all) the exposure variables of interest
 for each model the maximum likelihood function L is calculated:
1. Lm := L( 𝛽 0) for the minimal model
2. Lf := L(𝛽 0 , 𝛽 1 , 𝛽 2. … 𝛽 n) for the full model

Likelihood ratio statistic

LR = 2{log(Lf ) − log(Lm)} = 2 log~ chi square distributed

11 Applied Methods in Biostatistics - Week 2

Estimation Example (overall test):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age

. logistic NI Xray Size Age

12 Applied Methods in Biostatistics - Week 2

6
05‐11‐2020

Inferences - Wald-test
Which factors had a significant effect on the dependent variable adjusted for all the other
independent variables?

 Hypotheses: H 0 :  i  0 vs . H 1 :  i  0

ˆ i
 Test statistics:  N(0,1)-distributed
Z i
 ~
se ˆ i

with Z ~  -distributed,
2 2

degree of freedom=1
13 Applied Methods in Biostatistics - Week 2

Estimation Example (Wald test):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age

. logistic NI Xray Size Age

14 Applied Methods in Biostatistics - Week 2

7
05‐11‐2020

Maximum likelihood estimation

The idea behind: determine the parameters that maximize the probability
(likelihood) of the sample data.
From a statistical point of view, the method of maximum likelihood is considered
to be more robust and yields estimators with good statistical properties.
An efficient methods for quantifying uncertainty through confidence bounds.
Although the methodology for maximum likelihood estimation is simple, the
implementation is mathematically intense. Using today's computer power,
however, mathematical complexity is not a big obstacle.
Maximize the likelihood function L(ϑ) is equivalent to maximize the log-
Likelihood-function l(ϑ)

15 Applied Methods in Biostatistics - Week 2

Estimation Example (ML estimation):

Model: Logit (πNI(Xray, Size, Age)) = Log odds (NI=1|Xray, Size,Age)
= β0 + β1Xray + β2 Size + β3 Age

. logistic NI Xray Size Age

16 Applied Methods in Biostatistics - Week 2

8
05‐11‐2020

Prediction
The logistic regression approach is suitable for predicting success probability or the outcome risk
for new cases in dependence of exposures
Example: Prostatic cancer

𝐿𝑜𝑔𝑖𝑡 𝜋𝑁𝐼 𝑋𝑟𝑎𝑦, 𝑆𝑖𝑧𝑒, 𝐴𝑔𝑒 𝛽0 𝛽1𝑋𝑟𝑎𝑦 𝛽2 𝑆𝑖𝑧𝑒 𝛽3 𝐴𝑔𝑒

1.52 2.18Xray 1.60Size-0.06Age

Xray Size Age logit 𝜋𝑁𝐼 π𝑁𝐼 𝑃(NI = 1)

0 0 68 -2.56 0.072
1 0 68 -0.38 0.515
0 1 51 0.06 0.406
1 1 57 1.88 0.868

17 Applied Methods in Biostatistics - Week 2

Panchakavyam Method of Preparation
No ratings yet
Panchakavyam Method of Preparation
1 page
Logistic Regression for Researchers
100% (1)
Logistic Regression for Researchers
51 pages
Applied Statistics II-2 and III
100% (1)
Applied Statistics II-2 and III
59 pages
Basf Masterseal Cr195 Tds
No ratings yet
Basf Masterseal Cr195 Tds
2 pages
Lecture 1 - Introduction To Statistics For Health Science
No ratings yet
Lecture 1 - Introduction To Statistics For Health Science
47 pages
Regenerative Endodontics: A Way Forward
No ratings yet
Regenerative Endodontics: A Way Forward
9 pages
Where To Buy GBL Wheelcleaner in China
No ratings yet
Where To Buy GBL Wheelcleaner in China
1 page
Hope 4: Swimming
100% (4)
Hope 4: Swimming
14 pages
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein
No ratings yet
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein
36 pages
Regression Logistic Regression
100% (1)
Regression Logistic Regression
37 pages
Biostatistics1718 1 PDF
No ratings yet
Biostatistics1718 1 PDF
30 pages
Work at Height For OHTL
No ratings yet
Work at Height For OHTL
25 pages
HH 300 Technical Specs Photos
100% (1)
HH 300 Technical Specs Photos
32 pages
Classification With Logistic Regression: DR Sandipan Karmakar Mnit Jaipur
No ratings yet
Classification With Logistic Regression: DR Sandipan Karmakar Mnit Jaipur
54 pages
Logistic Regression - 2021 ch-8
No ratings yet
Logistic Regression - 2021 ch-8
52 pages
Logistic Regression-Advanced Biostat PDF
No ratings yet
Logistic Regression-Advanced Biostat PDF
86 pages
Integrated Science Form 2 - Term 2 2009
No ratings yet
Integrated Science Form 2 - Term 2 2009
5 pages
18logistic Regression Yilma
No ratings yet
18logistic Regression Yilma
88 pages
May 09 Newsletter
No ratings yet
May 09 Newsletter
2 pages
Logistic Regression Analysis
No ratings yet
Logistic Regression Analysis
48 pages
7 Logistic Randomintercept
No ratings yet
7 Logistic Randomintercept
48 pages
Kenya's Gig Economy Regulation
No ratings yet
Kenya's Gig Economy Regulation
99 pages
Logistic Regression A Self Learning Text (Statistics For Biology and Health) 3rd Ed. 2010 Edition Latest Edition Download
100% (12)
Logistic Regression A Self Learning Text (Statistics For Biology and Health) 3rd Ed. 2010 Edition Latest Edition Download
16 pages
Logistic Regression
0% (1)
Logistic Regression
4 pages
5.1) Binary Logistic Regression
No ratings yet
5.1) Binary Logistic Regression
32 pages
Laboratory 10
No ratings yet
Laboratory 10
8 pages
L5 Logistic Regression (2011)
100% (1)
L5 Logistic Regression (2011)
55 pages
HSU B301 BIOSTATISTICS FOR HEALTH SCIENCES Main Exam
100% (1)
HSU B301 BIOSTATISTICS FOR HEALTH SCIENCES Main Exam
12 pages
Bioepi Finals Module 5
No ratings yet
Bioepi Finals Module 5
84 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
No ratings yet
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
32 pages
Logistic Regression-1
No ratings yet
Logistic Regression-1
27 pages
LP4 Study-Design Biostatistics
No ratings yet
LP4 Study-Design Biostatistics
7 pages
Anesthesiology Residency Guide
50% (2)
Anesthesiology Residency Guide
12 pages
GLM - Slides - Week7
No ratings yet
GLM - Slides - Week7
9 pages
ABG Analysis for Med Students
No ratings yet
ABG Analysis for Med Students
31 pages
Bio2 Module 5 - Logistic Regression
No ratings yet
Bio2 Module 5 - Logistic Regression
19 pages
BMC Medical Research Methodology: Bias in Odds Ratios by Logistic Regression Modelling and Sample Size
No ratings yet
BMC Medical Research Methodology: Bias in Odds Ratios by Logistic Regression Modelling and Sample Size
5 pages
Character Analysis of All The Characters.
No ratings yet
Character Analysis of All The Characters.
9 pages
UNIT 3 - Sensation and Perception
No ratings yet
UNIT 3 - Sensation and Perception
77 pages
Modeling Ordinal Categorical Data (Agresti)
No ratings yet
Modeling Ordinal Categorical Data (Agresti)
71 pages
C1 C3OpManual 12 15 061 155
No ratings yet
C1 C3OpManual 12 15 061 155
22 pages
Logistic Regression Insights
No ratings yet
Logistic Regression Insights
54 pages
Lecture 10
No ratings yet
Lecture 10
13 pages
2007 Rare Plant Auction - Delaware Center For Horticulture
100% (1)
2007 Rare Plant Auction - Delaware Center For Horticulture
88 pages
Malka Saba: Presentation " Paper Electrophoresis"
No ratings yet
Malka Saba: Presentation " Paper Electrophoresis"
19 pages
Mock Exam Scenario 3 PDF
No ratings yet
Mock Exam Scenario 3 PDF
2 pages
CUHK STAT5102 Ch7
No ratings yet
CUHK STAT5102 Ch7
33 pages
Sas 11 Inferential Stat
No ratings yet
Sas 11 Inferential Stat
6 pages
Skin
No ratings yet
Skin
9 pages
Study Designs: Sample Bias
No ratings yet
Study Designs: Sample Bias
4 pages
Regresion Logistica
No ratings yet
Regresion Logistica
71 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
The Collapse of Silicon Valley Bank (SVB)
No ratings yet
The Collapse of Silicon Valley Bank (SVB)
6 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
HFHFH
No ratings yet
HFHFH
37 pages
Course - Outline - PBH 711.2 - Spring - 2024
No ratings yet
Course - Outline - PBH 711.2 - Spring - 2024
3 pages
ISYE6414 FA23 Practice Midterm Exam 2 Solutions
No ratings yet
ISYE6414 FA23 Practice Midterm Exam 2 Solutions
6 pages
Logistic Regression Basics
No ratings yet
Logistic Regression Basics
48 pages
Fildis Filipino Sa Ibat Ibang Disiplina 1
No ratings yet
Fildis Filipino Sa Ibat Ibang Disiplina 1
11 pages
Course Outline PBH711
No ratings yet
Course Outline PBH711
2 pages
Review of Logistic and Poisson Regression Models
No ratings yet
Review of Logistic and Poisson Regression Models
15 pages
BIOSTAT Assignment
No ratings yet
BIOSTAT Assignment
6 pages
Logistic Regression
100% (1)
Logistic Regression
37 pages
GNIPST Bulletin: Jan 2018 Highlights
No ratings yet
GNIPST Bulletin: Jan 2018 Highlights
28 pages
Biostatistics: ABSITE Review Series Sarah Abdulla
No ratings yet
Biostatistics: ABSITE Review Series Sarah Abdulla
30 pages
Hnu B215 Biostatistics For Health Sciences
No ratings yet
Hnu B215 Biostatistics For Health Sciences
13 pages
Gypsum Properties and Uses in Construction
100% (1)
Gypsum Properties and Uses in Construction
16 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Slides Logistic
No ratings yet
Slides Logistic
33 pages
Biostat Practice 23 07 Categorical
No ratings yet
Biostat Practice 23 07 Categorical
18 pages
07 - Estimation - of - Glucose - by - God-Pod-18-12-2018
No ratings yet
07 - Estimation - of - Glucose - by - God-Pod-18-12-2018
14 pages
Modeling Ordered Categorical Data: James J. Dignam
No ratings yet
Modeling Ordered Categorical Data: James J. Dignam
27 pages
Biostats II 2013 Lecture 1
No ratings yet
Biostats II 2013 Lecture 1
19 pages
Logistics Regression
No ratings yet
Logistics Regression
30 pages
Dflu1200: 1.0A Surface Mount Super-Fast Rectifier Powerdi123
No ratings yet
Dflu1200: 1.0A Surface Mount Super-Fast Rectifier Powerdi123
6 pages
Logistic 6
No ratings yet
Logistic 6
17 pages
Repeated Measures, Part 2: Charles E. Mcculloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, Ucsf
No ratings yet
Repeated Measures, Part 2: Charles E. Mcculloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, Ucsf
29 pages
Minitab Tip Sheet 15
No ratings yet
Minitab Tip Sheet 15
5 pages
Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
Allen-Bradley 1783-US5T Switch, Unmanaged, 5 Ports, RJ45 Copper, AC or DC
No ratings yet
Allen-Bradley 1783-US5T Switch, Unmanaged, 5 Ports, RJ45 Copper, AC or DC
4 pages
Climate Change Conference Brochure-2025-KU
No ratings yet
Climate Change Conference Brochure-2025-KU
2 pages
Urban Stormwater Management Guide
No ratings yet
Urban Stormwater Management Guide
19 pages
Lect7 Math231
No ratings yet
Lect7 Math231
29 pages
Report 2402410522 1
No ratings yet
Report 2402410522 1
4 pages
Safety and Health Management System Plan and Procedure For C-Line (Draft) Apr-14
No ratings yet
Safety and Health Management System Plan and Procedure For C-Line (Draft) Apr-14
34 pages
GROUP 9 STS Research Term Paper On Stem Cell
No ratings yet
GROUP 9 STS Research Term Paper On Stem Cell
13 pages
Psy 512 Logistic Regression
No ratings yet
Psy 512 Logistic Regression
12 pages

Logistic Regression in Biostatistics

Uploaded by

Logistic Regression in Biostatistics

Uploaded by

05‐11‐2020

1 Applied Methods in Biostatistics - Week 2 2019

In many practical situation linear regression model is inadequate.

2 Applied Methods in Biostatistics - Week 2

Binary outcome variable

3 Applied Methods in Biostatistics - Week 2

Example: Prostatic cancer

Pat NI Age Acid Xray Size Grade

Risk outcome: odds

Simple (exploratory) inference

5 Applied Methods in Biostatistics - Week 2

Logistic regression model

The model is based on:

Inference similar to linear model

Estimation & inference: computer

6 Applied Methods in Biostatistics - Week 2

Purposes of logstic regression

7 Applied Methods in Biostatistics - Week 2

Continous exposure variable

8 Applied Methods in Biostatistics - Week 2

Estimation Example (1):

Iteration 0: log likelihood = -35.126076

9 Applied Methods in Biostatistics - Week 2

Estimation Example (2):

. logistic NI Xray Size Age

10 Applied Methods in Biostatistics - Week 2

Inferences - Testing overall regression

Likelihood ratio (LR) statistic compares two models

Likelihood ratio statistic

11 Applied Methods in Biostatistics - Week 2

Estimation Example (overall test):

. logistic NI Xray Size Age

12 Applied Methods in Biostatistics - Week 2

Estimation Example (Wald test):

. logistic NI Xray Size Age

14 Applied Methods in Biostatistics - Week 2

Maximum likelihood estimation

15 Applied Methods in Biostatistics - Week 2

Estimation Example (ML estimation):

. logistic NI Xray Size Age

16 Applied Methods in Biostatistics - Week 2

𝐿𝑜𝑔𝑖𝑡 𝜋𝑁𝐼 𝑋𝑟𝑎𝑦, 𝑆𝑖𝑧𝑒, 𝐴𝑔𝑒 𝛽0 𝛽1𝑋𝑟𝑎𝑦 𝛽2 𝑆𝑖𝑧𝑒 𝛽3 𝐴𝑔𝑒

Xray Size Age logit 𝜋𝑁𝐼 π𝑁𝐼 𝑃(NI = 1)

17 Applied Methods in Biostatistics - Week 2

You might also like