0% found this document useful (0 votes)

13 views28 pages

Instrumental Variable in Regression

Uploaded by

gervasvin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views28 pages

Instrumental Variable in Regression

Uploaded by

gervasvin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Introduction to Instrumental

Variables Methods
Dismas Alex
The institute of Finance Management
Introduction
Motivation
• We often use non experimental data to
conduct empirical investigations
• The simple regression model would be:

• For example, return to education, the effect of

class size on student achievement, etc
• What would be the problem with this simple
model?
Motivation
• Suppose that we extend the model to include
covariates/control variables:

• What would be the problem with this simple

model?
• Issue of Omitted Variable bias, then T is
endogenous (Its true for many variables we use)
• We need to think about the best way to actually
mitigate the issue
Motivation
• The problem of omitted variable bias or
unobserved heterogeneity can be quite
extensive
• Often times important personal variables
cannot be observed
• The unobservables are correlated with the
explanatory variables of interest, T.
• Thus T is endogenous.
The consequence of an endogenous T
• Recall the key assumption

• If T is endogenous, then
Cov =
• Thus, the estimated coefficient is biased

• Instrumental variables (IV) offers one approach

to estimating (when instruments are available…)
What are the solutions to OVB and
unobserved heterogeneity
• Ignore the problem – biased and inconsistent
estimate of the coefficients
• Find a suitable proxy variable for the
unobserved variable e.g. IQ test for ability
• Assume that the unobserved variable that
does not change overtime and we can obtain
panel data
– Fixed effects or
– First-differencing model
Example 1:The Case of Job Training and Earnings
• Suppose we want to measure the impact of job training on earnings.
We Observe data on earnings for people who have and have not
completed job training.

• We compare two groups: those who got trained and those who didn’t.

• Want to infer the causal effect of job training on earnings

• What if people who are more “motivated” are more likely to get
training and on average earn more than less “motivated”?
‒ Difference between average earnings across the trained and
untrained confounds the effects of motivation and training
‒ Omitted variables bias: Would like to control for unobserved
(and unobservable?) motivation
Example 1:The Case of Job Training and Earnings
• In this scenario, "motivation" acts as a potential confounding
variable, as it influences both whether someone receives job
training and their final earnings.
• This can indeed bias the observed relationship between
training and earnings, making it difficult to infer a causal
effect.
• Selection bias:
– If motivated individuals are more likely to pursue training, simply
comparing the earnings of those who trained vs. those who didn't will
be misleading.
– The trained group might have inherently higher earning potential due
to their motivation, not necessarily the training itself.
How to address this?
• Randomized controlled trials (RCTs):
– The gold standard for causal inference! If you randomly assign individuals to
receive training or not, any differences in earnings can be attributed to the
training, not pre-existing differences like motivation.

• Control variables:
– You can incorporate variables related to motivation (e.g., education level, prior
work experience) into your analysis. This statistically "controls" for their
influence, allowing you to isolate the effect of training while accounting for
motivation differences.

• Instrumental variables (IVs):

– Find a variable that influences the decision to train but not earnings directly.
This "instrument" can help identify the true causal effect of training by
separating it from the confounding influence of motivation.
Instrumental variables (IVs)

• A solution to the endogeneity problem is to

find an instrumental variable (IV)
Instrumental variables (IVs)
Instrumental variables (IVs)
Instrumental variables (IVs)
IV Estimation in Multiple regression
Two stage Least Square (2SLS) Estimation
Two stage Least Square (2SLS) Estimation
Example: Job Training
• OLS Results (from Stata):

regress earnings train x1-x13 , robust

Linear regression Number of obs = 5102

F( 14, 5087) = 38.35
Prob > F = 0.0000
R-squared = 0.0909
Root MSE = 18659

If intuition about source of endogeneity is correct, this should be an over-

estimate of the effect of training.
Example: Job Training
• First-Stage Results (from Stata):

regress train offer x1-x13 , robust

Linear regression Number of obs = 5102

F( 14, 5087) = 390.75
Prob > F = 0.0000
R-squared = 0.3570
Root MSE = .39619

------------------------------------------------------------------------------
| Robust
train | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
offer | .6088885 .0087478 69.60 0.000 .591739 .6260379
.
.
.
Strong evidence that E[zixi] ≠ 0
Example: Job Training
• Reduced-Form Results (from Stata):

regress earnings offer x1-x13 , robust

Linear regression Number of obs = 5102

F( 14, 5087) = 34.19
Prob > F = 0.0000
R-squared = 0.0826
Root MSE = 18744

------------------------------------------------------------------------------
| Robust
earnings | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
offer | 970.043 545.6179 1.78 0.075 -99.60296 2039.689.
.
.
.
Moderate evidence of a non-zero treatment effect
(maintaining exclusion restriction)
Example: Job Training Note: Some software
reports R2 after IV
• IV Results (from Stata): regression. This
object is NOT
meaningful and
should not be used.
ivreg earnings (train = offer) x1-x13 , robust

Instrumental variables (2SLS) regression Number of obs = 5102

F( 14, 5087) = 34.38
Prob > F = 0.0000
R-squared = 0.0879
Root MSE = 18689

------------------------------------------------------------------------------
| Robust
earnings | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
train | 1593.137 894.7528 1.78 0.075 -160.9632 3347.238
.
.
.
Moderate evidence of a positive treatment effect (maintaining
exclusion restriction). Substantially attenuated relative to OLS,
consistent with intuition.
Example: Returns to Schooling
• Structural Equation:

• First-Stage Equation:

– Note: E[zixi] ≠ 0 => π1,1 ≠ 0 or π1,2 ≠ 0 or π1,3 ≠ 0

• Reduced Form Equation:

Example: Returns to Schooling
• OLS Results (from Stata):

xi: reg lwage educ i.yob i.sob , robust

i.yob _Iyob_30-39 (naturally coded; _Iyob_30 omitted)
i.sob _Isob_1-56 (naturally coded; _Isob_1 omitted)

Linear regression Number of obs = 329509

F( 60,329448) = 649.29
Prob > F = 0.0000
R-squared = 0.1288
Root MSE = .63366

------------------------------------------------------------------------------
| Robust
lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .067339 .0003883 173.40 0.000 .0665778 .0681001
.
.
. If intuition about source of endogeneity is correct, this should be an over-
estimate of the effect of schooling.
Example: Returns to Schooling
• First-Stage Results (from Stata):
xi: regress educ i.qob i.sob i.yob , robust
Linear regression Number of obs = 329509
F( 62,329446) = 292.87
Prob > F = 0.0000
R-squared = 0.0572
Root MSE = 3.1863

------------------------------------------------------------------------------
| Robust
educ | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
_Iqob_2 | .0455652 .015977 2.85 0.004 .0142508 .0768797
_Iqob_3 | .1060082 .0155308 6.83 0.000 .0755683 .136448
_Iqob_4 | .1525798 .0157993 9.66 0.000 .1216137 .1835459

.
.
.
testparm _Iqob*

( 1) _Iqob_2 = 0
( 2) _Iqob_3 = 0
( 3) _Iqob_4 = 0 First-stage F-statistic.
F( 3,329446) = 36.06
Prob > F = 0.0000
Example: Returns to Schooling
• Reduced-Form Results (from Stata):
xi: regress lwage i.qob i.sob i.yob , robust

Linear regression Number of obs = 329509

F( 62,329446) = 147.83
Prob > F = 0.0000
R-squared = 0.0290
Root MSE = .66899

------------------------------------------------------------------------------
| Robust
lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
_Iqob_2 | .0028362 .0033445 0.85 0.396 -.0037188 .0093912
_Iqob_3 | .0141472 .0032519 4.35 0.000 .0077736 .0205207
_Iqob_4 | .0144615 .0033236 4.35 0.000 .0079472 .0209757
.
.

testparm _Iqob*

( 1) _Iqob_2 = 0
( 2) _Iqob_3 = 0
( 3) _Iqob_4 = 0

F( 3,329446) = 10.43
Prob > F = 0.0000
Example: Returns to Schooling
• 2SLS Results (from Stata):
xi: ivregress 2sls lwage (educ = i.qob) i.yob i.sob , robust

Instrumental variables (2SLS) regression Number of obs = 329509

Wald chi2(60) = 9996.12
Prob > chi2 = 0.0000
R-squared = 0.0929
Root MSE = .64652

------------------------------------------------------------------------------
| Robust
lwage | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .1076937 .0195571 5.51 0.000 .0693624 .146025
.
.
.

Bigger than OLS?

Midterm Fall2011
No ratings yet
Midterm Fall2011
13 pages
Econometrics by Example Solution
89% (19)
Econometrics by Example Solution
122 pages
Econometrics by Example Solution PDF
No ratings yet
Econometrics by Example Solution PDF
122 pages
Secondary Data Analysis
100% (1)
Secondary Data Analysis
257 pages
Econometrics: 2SLS & Hausman Test
No ratings yet
Econometrics: 2SLS & Hausman Test
4 pages
Econ 1630 HW1
No ratings yet
Econ 1630 HW1
6 pages
Experimental Design Insights
100% (3)
Experimental Design Insights
22 pages
Im ch01
No ratings yet
Im ch01
11 pages
Intro to Simple Regression Analysis
No ratings yet
Intro to Simple Regression Analysis
55 pages
October 25, 2011
No ratings yet
October 25, 2011
27 pages
Economics Exam Instructions
No ratings yet
Economics Exam Instructions
1 page
Dummy Variable Regression Guide
No ratings yet
Dummy Variable Regression Guide
48 pages
Econ 251 PS4 Solutions
No ratings yet
Econ 251 PS4 Solutions
11 pages
Quan Tile
No ratings yet
Quan Tile
4 pages
Omitted Variable Tests
No ratings yet
Omitted Variable Tests
4 pages
CH 5 - Multicollearity
No ratings yet
CH 5 - Multicollearity
27 pages
Multiple Regression Estimation Guide
No ratings yet
Multiple Regression Estimation Guide
76 pages
2013-01-18 Hansen IV Slides
No ratings yet
2013-01-18 Hansen IV Slides
71 pages
SOC-CMM Self-Assessment Guide
No ratings yet
SOC-CMM Self-Assessment Guide
149 pages
Tutorials2016s1 Week9 Answers
No ratings yet
Tutorials2016s1 Week9 Answers
4 pages
Centeno - Alexander PSET2 LBYMET2 Final
No ratings yet
Centeno - Alexander PSET2 LBYMET2 Final
11 pages
Econometrics Chapter 8 PPT Slides
100% (1)
Econometrics Chapter 8 PPT Slides
42 pages
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
No ratings yet
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
89 pages
Regression Analysis for Economists
No ratings yet
Regression Analysis for Economists
13 pages
Regression 101
No ratings yet
Regression 101
46 pages
YD Slides5 NonLin
No ratings yet
YD Slides5 NonLin
54 pages
Data Clustering: 50 Years Beyond K-Means
No ratings yet
Data Clustering: 50 Years Beyond K-Means
35 pages
Introduction To Econometrics, 5 Edition: Chapter 4: Nonlinear Models and Transformations of Variables
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 4: Nonlinear Models and Transformations of Variables
27 pages
QM 9 Instrumental Variables I
No ratings yet
QM 9 Instrumental Variables I
29 pages
Solutions To Practice Questions For Classes 6 To 12 ECO2151 Winter 2024 PDF
No ratings yet
Solutions To Practice Questions For Classes 6 To 12 ECO2151 Winter 2024 PDF
11 pages
Nonlinear Regression for Analysts
No ratings yet
Nonlinear Regression for Analysts
59 pages
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
No ratings yet
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
4 pages
Regression 101
No ratings yet
Regression 101
46 pages
Hierarchical Document Clustering Using Frequent Itemsets: Benjamin Fung
No ratings yet
Hierarchical Document Clustering Using Frequent Itemsets: Benjamin Fung
37 pages
Assignement 1 .Hridita. BUS 525
No ratings yet
Assignement 1 .Hridita. BUS 525
10 pages
Dougherty5e IM 2015 09 12 ch03
No ratings yet
Dougherty5e IM 2015 09 12 ch03
15 pages
Solution Assignment
No ratings yet
Solution Assignment
34 pages
QM 10 Instrumental Variables II
No ratings yet
QM 10 Instrumental Variables II
27 pages
Daftar Nilai Kelas Eksperimen Dan Pengujiannya
No ratings yet
Daftar Nilai Kelas Eksperimen Dan Pengujiannya
8 pages
Mining Resource Estimation Guide
No ratings yet
Mining Resource Estimation Guide
188 pages
Research Design for Agricultural Value-Chains
No ratings yet
Research Design for Agricultural Value-Chains
46 pages
Hypothesis Testing Part 3
No ratings yet
Hypothesis Testing Part 3
21 pages
A4-+PresentationTemplate Research
No ratings yet
A4-+PresentationTemplate Research
18 pages
Homework 2 Questions
No ratings yet
Homework 2 Questions
7 pages
Olcae Act Pre Post Test
No ratings yet
Olcae Act Pre Post Test
66 pages
Linear Regression Using R
No ratings yet
Linear Regression Using R
24 pages
Assignment No5
No ratings yet
Assignment No5
1 page
Homework 1 With Suggested Answers
No ratings yet
Homework 1 With Suggested Answers
10 pages
Digital Image Processing Chapter 8 Image Analysis and Pattern Recognition
No ratings yet
Digital Image Processing Chapter 8 Image Analysis and Pattern Recognition
36 pages
Causal-Inference Emsley
No ratings yet
Causal-Inference Emsley
54 pages
Solutions To Sample Final Exam ECO2151
No ratings yet
Solutions To Sample Final Exam ECO2151
7 pages
Ch3 Multiple Regression
No ratings yet
Ch3 Multiple Regression
56 pages
05 Week Economicsofeducation
No ratings yet
05 Week Economicsofeducation
11 pages
Pub 1032621
No ratings yet
Pub 1032621
11 pages
Simple Regression Analysis Guide
No ratings yet
Simple Regression Analysis Guide
54 pages
Instrumental Variable in Regression
No ratings yet
Instrumental Variable in Regression
28 pages
Chapter 4 Final
No ratings yet
Chapter 4 Final
11 pages
Dadm Research
No ratings yet
Dadm Research
11 pages
MLRM
No ratings yet
MLRM
67 pages
Statistical Hypothesis Testing Guide
No ratings yet
Statistical Hypothesis Testing Guide
7 pages
GMU Econ535-Applied Econometrics Problem Set3 (PS3) Solutions Spring 2024
No ratings yet
GMU Econ535-Applied Econometrics Problem Set3 (PS3) Solutions Spring 2024
15 pages
Audit Data Analytics, Machine Learning and Full Population Testing
No ratings yet
Audit Data Analytics, Machine Learning and Full Population Testing
7 pages
Autonomous LLM-driven Research From Data To Human-Verifiable Research Papers
No ratings yet
Autonomous LLM-driven Research From Data To Human-Verifiable Research Papers
38 pages
The Effects of Projects Funding On Their Performance in Rwanda
No ratings yet
The Effects of Projects Funding On Their Performance in Rwanda
32 pages
ML IA1 Answers
No ratings yet
ML IA1 Answers
26 pages
Assignment 5
No ratings yet
Assignment 5
3 pages
AE6207 - Solution 1 - 2024
No ratings yet
AE6207 - Solution 1 - 2024
8 pages
Degrees of Freedom
100% (1)
Degrees of Freedom
11 pages
Statistics Test for UiTM Students
No ratings yet
Statistics Test for UiTM Students
3 pages
283
No ratings yet
283
7 pages
Data Analyst Resume Template
100% (1)
Data Analyst Resume Template
4 pages
Ansprac 2
No ratings yet
Ansprac 2
6 pages
Examec605d19 20
No ratings yet
Examec605d19 20
2 pages
Diagnostic Tests
No ratings yet
Diagnostic Tests
44 pages
Statistics Test - Correlation-Regression, Index Number - 1474785
No ratings yet
Statistics Test - Correlation-Regression, Index Number - 1474785
5 pages
Statistics
No ratings yet
Statistics
6 pages
Part 2 - Multiple Regression Model
No ratings yet
Part 2 - Multiple Regression Model
49 pages
Part 2 - Simple Regression Model
No ratings yet
Part 2 - Simple Regression Model
56 pages
17-Econometrics-Linear Regression
No ratings yet
17-Econometrics-Linear Regression
18 pages
22CM1104
No ratings yet
22CM1104
2 pages
19-Econometrics-Linear Regression
No ratings yet
19-Econometrics-Linear Regression
17 pages
Dsa Lab Syllabus Aids
No ratings yet
Dsa Lab Syllabus Aids
1 page
Lecture16 Instrumental Variables
No ratings yet
Lecture16 Instrumental Variables
36 pages
EC311 Slides Spring25 Week14 Part1
No ratings yet
EC311 Slides Spring25 Week14 Part1
19 pages
(Ebook PDF) Investigating The Social World: The Process and Practice of Research 8th Edition PDF Download
100% (5)
(Ebook PDF) Investigating The Social World: The Process and Practice of Research 8th Edition PDF Download
38 pages
R in Action 3rd Edition Robert I. Kabacoff PDF Download
100% (2)
R in Action 3rd Edition Robert I. Kabacoff PDF Download
56 pages
BEN YAO AGBEYESRO - Assignment4
No ratings yet
BEN YAO AGBEYESRO - Assignment4
12 pages
PM Compiled Notes Vifhe
No ratings yet
PM Compiled Notes Vifhe
93 pages
Lecture 8
No ratings yet
Lecture 8
61 pages

Instrumental Variable in Regression

Uploaded by

Instrumental Variable in Regression

Uploaded by

Introduction to Instrumental

• For example, return to education, the effect of

• What would be the problem with this simple

• Instrumental variables (IV) offers one approach

• Want to infer the causal effect of job training on earnings

• Instrumental variables (IVs):

• A solution to the endogeneity problem is to

regress earnings train x1-x13 , robust

Linear regression Number of obs = 5102

If intuition about source of endogeneity is correct, this should be an over-

regress train offer x1-x13 , robust

Linear regression Number of obs = 5102

regress earnings offer x1-x13 , robust

Linear regression Number of obs = 5102

Instrumental variables (2SLS) regression Number of obs = 5102

– Note: E[zixi] ≠ 0 => π1,1 ≠ 0 or π1,2 ≠ 0 or π1,3 ≠ 0

• Reduced Form Equation:

xi: reg lwage educ i.yob i.sob , robust

Linear regression Number of obs = 329509

Linear regression Number of obs = 329509

Instrumental variables (2SLS) regression Number of obs = 329509

Bigger than OLS?

You might also like