0% found this document useful (0 votes)

7 views5 pages

Key Expressions & Concepts

The document outlines key concepts in econometrics, including issues like omitted variable bias, endogeneity, and the use of multiple regression to improve causal inference. It discusses various statistical tests such as the Hausman Test and Sargan Test for assessing the validity of instruments and the assumptions underlying OLS estimators. Additionally, it highlights the importance of panel data and randomized controlled experiments in mitigating biases and ensuring valid statistical inferences.

Uploaded by

shoppiih

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views5 pages

Key Expressions & Concepts

Uploaded by

shoppiih

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Key expressions & concepts

 Bad control problem

o Bias in OLS estimator because X is correlated with omitted variables Z
o Z/X2: regressors that could themselves be outcomes

 E.g earnings = α + β college degree + ui

 We add relevant regressor Z that is

 Correlated with X  OLS estimator is biased & inconsistent

 Can explain Y

 Problem: Y as well as Z can be outcomes of X

 Causal inference

o Usually we can’t give causal interpretation

o Why not?

 Omitted variables/ omitted individual characteristics that could cause Y

 Reverse causality – Y causes X

o Threatened by zero conditional mean assumption

 Central limit theroy

o When n is large, then averages of random variables are normally distributed

 Cross-sectional data

o Data on different entities (e.g. workers, consumers, firms, etc.) for a single
time period

o E.g. data on test scores in California -> data for 420 entitites (school districts)
for a single time period (1999)

 Errors-in-variables bias in the OLS estimator

o When an independent variable (X) is measured imprecisely
o This bias persists even in large sample sizes
 Error term ui
o All factors, other than Xi, that are determinants of Yi
 Endogeneity
o X is correlated with the error term (e.g. Y=income, X=education, u=skill X is
correlated with u)
 Exogeneity
o X is not correlated with the error term
o X is determined by other factors outside the model
 Hausman Test
o Test for endogeneity of regressor X
 Why? Using an instrument is only necessary if Xi is endogenous
(correlated with u)
o Test: H0: E[u|X] = 0
o Under H0, both the TSLS (FE) & OLS (RE) estimator are consistent but OLS is
more efficient
o H1: only the FE estimator is efficient
o If H-statistic > (e.g. 5%) critical value, then H0 is rejected -> X is endogenous
(correlated with u)
o If we reject H0: we prefer FE model
o
o H0: RE is appropriate
o H1: FE is appropriate
o Result: Prob>chi2 = 0  reject H0
 Homoskedasticity
o Variance - how far the points are away from the line
o Variance (ui|Xi) = constant (var doesn’t vary systematically with X)

 i.i.d. independently & identically distributed

o sample is randomly drawn from the population (independent)
o all observations in sample are drawn from same distribution (identically
distributed)
 Multicollinearity
o high intercorrelations among two or more independent variables in a
multiple regression model
o one of the 4 assumptions in multiple regression: “no perfect multicollinearity”

 if one of the regressors is a perfect linear function of the other regressors

 e.g. you want to estimate the coefficient on STR in a regression
of TestScorei on STRi and PctELi but you make a typo and
accidentally type in STRi a second time instead of PctELi -> now
you regress TestScorei on STRi and STRi  perfect
multicollinearity
 then: impossible to compute the OLS estimator
 solution: usally just modify the regressors to eliminate the problem
 Multiple regression
o A method that can eliminate omitted variable bias
 How? If we have data on the omitted variables, then we can include
them as additional regressors and thereby estimate the causal effect of
one regressor while holding constant the other variables

o Also a method to make predictions that are better than single regression by
using multiple variables as predictors
 OLS estimator
o A method to estimate the unknown parameters in a linear regression model
 Omitted variables
o Variables that are left out of the regression
 Omitted variables bias

o If the regressor (X) is correlated with a variable that has been omitted from
the analysis (variable in u) and that determines, in part, the dependent
variable (Y), then the OLS estimator will have omitted variable bias
o (1) X and u are correlated
o (2) omitted variable is also a determinant of Y
o First least squares assumption E(ui|Xi) = 0 does not hold  OLS estimator is
biased & inconsistent
o Solutions
 use of instrumental variables regressions (IV)
 panel data estimation
 use of randomized controlled experiments
 Overfitting
o When model is too complex  it begins to describe the random error in the
data rather than the relationships between variables
o Misleading R2 values, regression coefficients and p-values
 Panel Structure
o Allows us to control for unobserved heterogeneity
o Mitigate omitted variables bias
 Randomized controlled experiment

o Controlled: there are both a control group that receives no treatment and a
treatment group that receives treatment
o Randomized: the treatment is assigned randomly ; randomly pick who gets
the treatment
 Sargan Test (J-Test)
o Tests exogeneity of instruments
o If we have more instruments than regressors (m > k), then the coefficients are
overidentified
o In case of overidentification (m>k): If we want to test for instruments’ validity
(relevance & exogeneity), we can do so by using a J-Test
o H0 = instruments are exogenous
o Results of J-Test
 J-statistic > 5% critical value: reject H0  at least one instrument is
endogenous
 If: TSLS estimators are consistent & close to each other  all tested
instruments are exogenous
 If: one instrument produces very different estimates  one or both
instruments are probably not exogenous
 Two stage least squares (TSLS)
o If the instrument Z satisfies the conditions of instrument relevance and
exogeneity, the coefficient β1 can be estimated using an IV estimator (TSLS)
o (1) stage
 Decompose X into 2 components: a problematic component that may be
correlated with the regression error and another, problem-free
component that is uncorrelated with the error
o (2) stage
 Use the problem-free component to estimate β1
 Validity – internal
o Statistical inferences about causal effects are valid for the population being
studied
o Conditions
 (1) OLS estimator needs to be unbiased and consistent
 (2) Hypothesis tests should have the desired significance level,
confidence intervals should have the desired confidence intervals
(computed by the sandard error – SEs should be consistent)
 Validity – external
o Statistical inferences about causal effects can be generalized from the
population and setting studied to other populations and settings
 Weak instruments
o Instrumental variables that have a low predicitve power for the endogenous
regressor X
o Valid instruments (Z) should be
 (1) relevant – Z highly correlated with X
 (2) exogenous – Z is correlated with Y solely through its correlation with
X; so Z is uncorrelated with the error term u

o Test for instrument relevance

 Investigate the First-stage F-statistic (we want: at least one Z has

coefficient ≠ 0 in the 1st stage – then instrument not weak)
 If F > 10, then instrument is good (is relevant) (rule of thumb)

o Test for exogeneity

 Difficult to test (J-Test)
 Within estimator

o Exploits the within individual variation (over time)

Econometrics for Advanced Analysts
No ratings yet
Econometrics for Advanced Analysts
54 pages
MIT Microeconomics 14.32 Final Review
No ratings yet
MIT Microeconomics 14.32 Final Review
5 pages
Metrics Topic6 Part1 Multipleregression
No ratings yet
Metrics Topic6 Part1 Multipleregression
33 pages
Oversikt ECN402
No ratings yet
Oversikt ECN402
40 pages
Notes 11
No ratings yet
Notes 11
9 pages
Econometrics - Review Sheet ' (Main Concepts)
No ratings yet
Econometrics - Review Sheet ' (Main Concepts)
5 pages
Eh426 At4 2024 Iv
No ratings yet
Eh426 At4 2024 Iv
28 pages
TCH442E Quantitative Methods For Finance: Last Lecture: Next
No ratings yet
TCH442E Quantitative Methods For Finance: Last Lecture: Next
13 pages
Introduction To Econometrics - Summary
No ratings yet
Introduction To Econometrics - Summary
23 pages
Instrumental Variable Estimation 1: Framework: Instructor: Yuta Toyama Last Updated: 2021-05-18
No ratings yet
Instrumental Variable Estimation 1: Framework: Instructor: Yuta Toyama Last Updated: 2021-05-18
30 pages
PEV Onesided
No ratings yet
PEV Onesided
322 pages
Tests
No ratings yet
Tests
10 pages
5 Ivmf
No ratings yet
5 Ivmf
13 pages
Additional Cheatsheet en
No ratings yet
Additional Cheatsheet en
3 pages
Introduction To Multiple Regression
No ratings yet
Introduction To Multiple Regression
36 pages
Week 2, OLS
No ratings yet
Week 2, OLS
83 pages
2 Regression With Multiple Regressors 1
No ratings yet
2 Regression With Multiple Regressors 1
22 pages
TCH442E Quantitative Methods For Finance
No ratings yet
TCH442E Quantitative Methods For Finance
21 pages
Problem Set 3 SOLUTIONS
No ratings yet
Problem Set 3 SOLUTIONS
7 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
Econometrics
No ratings yet
Econometrics
13 pages
Econometrics for Advanced Learners
No ratings yet
Econometrics for Advanced Learners
6 pages
Week 12 Measurement Error Spring 2021
No ratings yet
Week 12 Measurement Error Spring 2021
20 pages
ZSMFG
No ratings yet
ZSMFG
43 pages
Week 10
No ratings yet
Week 10
42 pages
Serial Correlation:: ST ST
No ratings yet
Serial Correlation:: ST ST
7 pages
Econometrics Cheat Sheet
No ratings yet
Econometrics Cheat Sheet
3 pages
Lecture 4 MLR - 1
No ratings yet
Lecture 4 MLR - 1
30 pages
cn4 IV
No ratings yet
cn4 IV
18 pages
Additional Cheatsheet en
No ratings yet
Additional Cheatsheet en
3 pages
Cheat Sheet Quantitative Methods in Finance Nova Cheat Sheet Quantitative Methods in Finance Nova
0% (1)
Cheat Sheet Quantitative Methods in Finance Nova Cheat Sheet Quantitative Methods in Finance Nova
3 pages
Regression Analysis Essentials
100% (1)
Regression Analysis Essentials
2 pages
Ssss PDF
No ratings yet
Ssss PDF
50 pages
Vb V ε X = σ Vb = σ Vb = X'X Σx X'X: I X'X X'
No ratings yet
Vb V ε X = σ Vb = σ Vb = X'X Σx X'X: I X'X X'
9 pages
Lecture 4 MLR - 1
No ratings yet
Lecture 4 MLR - 1
30 pages
15 Instrumental Variables
No ratings yet
15 Instrumental Variables
27 pages
3a - Relaxing The Ols Assumptions
No ratings yet
3a - Relaxing The Ols Assumptions
37 pages
Wooldridge Notes
No ratings yet
Wooldridge Notes
15 pages
Econometrics Jimma Assignment
No ratings yet
Econometrics Jimma Assignment
6 pages
Class 7 After
No ratings yet
Class 7 After
23 pages
Yaregal Birhanu
No ratings yet
Yaregal Birhanu
8 pages
Bus 173 - Lecture 5
No ratings yet
Bus 173 - Lecture 5
38 pages
Economics 308: Econometrics Professor Moody: Describing The Relationship Between Two Variables
No ratings yet
Economics 308: Econometrics Professor Moody: Describing The Relationship Between Two Variables
8 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
Instrumental Variables: Ani Katchova
No ratings yet
Instrumental Variables: Ani Katchova
27 pages
Violations of Assumptions
No ratings yet
Violations of Assumptions
1 page
Instrumental Variables Regression Guide
No ratings yet
Instrumental Variables Regression Guide
63 pages
Instrumental Variables Slides 2021
No ratings yet
Instrumental Variables Slides 2021
26 pages
Instrumental Variables & 2SLS: y + X + X + - . - X + U X + Z+ X + - . - X + V
No ratings yet
Instrumental Variables & 2SLS: y + X + X + - . - X + U X + Z+ X + - . - X + V
21 pages
Instrumental PDF
No ratings yet
Instrumental PDF
69 pages
統計摘要
No ratings yet
統計摘要
12 pages
Multicollinearity AND Heteroskedasticity
No ratings yet
Multicollinearity AND Heteroskedasticity
75 pages
Econometric S
No ratings yet
Econometric S
10 pages
Model Specification & Data Issues
No ratings yet
Model Specification & Data Issues
45 pages
Econometrics: Instrumental Variables
No ratings yet
Econometrics: Instrumental Variables
21 pages
Econometrics Notes
No ratings yet
Econometrics Notes
95 pages
Econometrics for Advanced Students
No ratings yet
Econometrics for Advanced Students
73 pages
IV Notes1-2
No ratings yet
IV Notes1-2
56 pages
File Course Module in Biostatistics
No ratings yet
File Course Module in Biostatistics
203 pages
Heikkila Cairney Proof 24 4 17
No ratings yet
Heikkila Cairney Proof 24 4 17
27 pages
Practical Research 1 Paper Format
33% (3)
Practical Research 1 Paper Format
6 pages
Amusement Park Footfall Analysis
No ratings yet
Amusement Park Footfall Analysis
21 pages
Dissertation Help for Walden Students
100% (3)
Dissertation Help for Walden Students
7 pages
Final Manu
No ratings yet
Final Manu
47 pages
Current Practices and Challenges of Educational Leadership Development in Secondary School of Nuer Zone, Gambella
100% (1)
Current Practices and Challenges of Educational Leadership Development in Secondary School of Nuer Zone, Gambella
25 pages
Unit 9
No ratings yet
Unit 9
14 pages
MKTG 342-Marketing Research-Farrah Arif
No ratings yet
MKTG 342-Marketing Research-Farrah Arif
6 pages
The Mediating Effect of Vocabulary Knowledge On The Language Awareness and Reading Comprehension: A Quantitative Study
No ratings yet
The Mediating Effect of Vocabulary Knowledge On The Language Awareness and Reading Comprehension: A Quantitative Study
14 pages
Game Theory Model
No ratings yet
Game Theory Model
10 pages
Final Business Research
No ratings yet
Final Business Research
11 pages
Chapter 2
No ratings yet
Chapter 2
7 pages
RSCH
No ratings yet
RSCH
15 pages
Qualitative Research Is A Method of Inquiry Employed in Many Different Academic Disciplines
No ratings yet
Qualitative Research Is A Method of Inquiry Employed in Many Different Academic Disciplines
17 pages
Psychological Statistics Syllabus
No ratings yet
Psychological Statistics Syllabus
8 pages
Amazon Brand & Marketing Summerinternship Project
No ratings yet
Amazon Brand & Marketing Summerinternship Project
55 pages
Unit 27 - Identifying Entrepreneurial Opportunities
100% (2)
Unit 27 - Identifying Entrepreneurial Opportunities
17 pages
Introduction Assessment of The Trends in Urban Housing Demand 012424 1
No ratings yet
Introduction Assessment of The Trends in Urban Housing Demand 012424 1
7 pages
(Strategies For Social Inquiry) John Gerring - Case Study Research - Principles and Practices-Cambridge University Press (2017)
100% (1)
(Strategies For Social Inquiry) John Gerring - Case Study Research - Principles and Practices-Cambridge University Press (2017)
359 pages
BMG871 CRN63826 Assignment Brief and Template UpdatedFeb25
No ratings yet
BMG871 CRN63826 Assignment Brief and Template UpdatedFeb25
6 pages
Understanding Research Methods
No ratings yet
Understanding Research Methods
2 pages
Final Research Outline
No ratings yet
Final Research Outline
4 pages
Research Sampling Essentials
No ratings yet
Research Sampling Essentials
3 pages
Keyton 3 e
No ratings yet
Keyton 3 e
81 pages
Theory of Attributes - PDF - Literacy - Statistics
100% (1)
Theory of Attributes - PDF - Literacy - Statistics
14 pages
2ndedition 26 34
No ratings yet
2ndedition 26 34
10 pages
Che 256 Research Methodology
No ratings yet
Che 256 Research Methodology
42 pages
Quantitative Research Guide
100% (10)
Quantitative Research Guide
50 pages
(Ebook) Cross-Cultural Psychology: Critical Thinking and Contemporary Applications, 2nd Edition by Eric Shiraev, David Levy ISBN 0205386121 Newest Edition 2025
No ratings yet
(Ebook) Cross-Cultural Psychology: Critical Thinking and Contemporary Applications, 2nd Edition by Eric Shiraev, David Levy ISBN 0205386121 Newest Edition 2025
27 pages

Key Expressions & Concepts

Uploaded by

Key Expressions & Concepts

Uploaded by

Key expressions & concepts

 Bad control problem

 E.g earnings = α + β college degree + ui

 We add relevant regressor Z that is

 Correlated with X  OLS estimator is biased & inconsistent

 Problem: Y as well as Z can be outcomes of X

o Usually we can’t give causal interpretation

 Omitted variables/ omitted individual characteristics that could cause Y

 Reverse causality – Y causes X

o Threatened by zero conditional mean assumption

 Central limit theroy

o When n is large, then averages of random variables are normally distributed

 Errors-in-variables bias in the OLS estimator

 i.i.d. independently & identically distributed

 if one of the regressors is a perfect linear function of the other regressors

o Test for instrument relevance

 Investigate the First-stage F-statistic (we want: at least one Z has

o Test for exogeneity

o Exploits the within individual variation (over time)

You might also like