0% found this document useful (0 votes)

31 views25 pages

15-Econometrics-Linear Regression

The document discusses heteroskedasticity in econometrics, specifically focusing on a dataset from California's SMSAs in 1972. It covers various tests for heteroskedasticity, including the Breusch-Pagan and White tests, and discusses the implications of heteroskedasticity on OLS estimators. Additionally, it presents methods for addressing heteroskedasticity, such as using loglinear models and robust standard errors.

Uploaded by

Lorenzo Lucchesi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views25 pages

15-Econometrics-Linear Regression

Uploaded by

Lorenzo Lucchesi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Econometrics

University of Milan-Bicocca

Course lecturer:
Maryam Ahmadi
[email protected]

1
Heteroskedasticity

2
Problem 14 & Answer.

The data set AIRQ contains observations for 30 standard metropolitan

statistical areas (SMSAs) in California for 1972 on the following variables:

airq: indicator for air quality (the lower the better);

vala: value added of companies (in 1000 US$);
rain: amount of rain (in inches);
coas: dummy variable, 1 for SMSAs at the coast; 0 for others;
dens: population density (per square mile);
medi: average income per head (in US$).

3
a. Estimate a linear regression model that explains airq from the other variables using ordinary least
squares. Interpret the coefficient estimates.

coastal regions, ceteris paribus, have a better air quality.

Keeping other factors fixed, population density does not significantly affect air quality (the effect is negative but
insignificant).
A higher value added for a regions, or having more rain, or a higher household income do not significantly affect
air quality (the effect of each one, ceteris paribus, is positive but insignificant).
• The F-statistic, 2.98, with a p-value of 0.031, indicat a marginal rejection (of the joint effect being zero) at the
95% confidence level.
4
b. Perform a Breusch–Pagan test for heteroskedasticity related to all five
explanatory variables.

The test statistics of the Breusch-Pagan test is based on N* R2 = 3.141. Given CHI-2(0.1,
5df)= 9.24, We can not reject the null of homoskedasticity.
Note, however, that the non-rejection may be due to a lack of power due to the small
number of observations and the general nature of the alternative hypothesis. 5
c. Perform a White test for heteroskedasticity. Comment upon the appropriateness of the White test
in light of the number of observations and the degrees of freedom of the test.

The White test is based on including 5

variables, their squared(4) and interactions
(10), which leads to a large number of
regressors in the test regression, that is 19
regressors in addition to the intercept.

• The test statistics is N*R2 (small N)

• and it follows a chi2(df=19) (large df with
respect to N),
• Critical values for Chi-squared distribution
with high df is so likely be larger than NR2.

As a result, it is very unlikely to find a rejection,

indicating that the use of the White test is
inappropriate in this case, as the sample size is
too small (given the number of regressors).
We can not reject the null of homoskedasticity
6
d. Assuming that we have multiplicative heteroskedasticity related to coas and medi, estimate the
coefficients by running a regression of log e2 upon these two variables. Test the null hypothesis of
homoskedasticity on the basis of this auxiliary regression..

The F-statistic of this

regression is 5.07 with a p-
value of 0.013. So we reject
the null hypothesis of
homoskedasticity at a 95%
significance level. As a
consequence, we can say
that we have multiplicative
heteroskedasticity related
to coas and medi.

7
If A3, the assumption of Homoskedasticity, is violated, Heteroskedasticity arises.

Consequences of heteroskedasticity:
• OLS is still unbiased under heteroskedastictiy!

• Also, interpretation of R-squared is not changed

• Heteroskedasticity invalidates variance formulas for OLS estimators, therefore, Routinely computed

standard errors are incorrect.

• The usual F and t tests are not longer valid under heteroskedasticity

• Under heteroskedasticity, OLS is no longer the best linear unbiased estimator (BLUE); there may be more

efficient linear estimators.

8
Detection of Heteroskedasticity
The Breusch-Pagan test 𝑢ො 2 = 𝛿0 + 𝛿1 𝑥1 + ⋯ + 𝛿𝑘 𝑥𝑘 + 𝜈
The null hypothesis of the homoskedasticity is then 𝐻0 : 𝛿1 = 𝛿2 = ⋯ = 𝛿𝑘 = 0
against the alternative that is 𝐻1 : 𝐻0 is not true
Test statistic: N*R2 Having Chi-squared distribution (DF= # variables in auxiliary regression).

The White test

It is based on regressing the squared OLS residuals upon all regressors, their squares and their (unique) cross-products.
Test statistic: N*R2 having Chi-squared distribution(DF = # variables in auxiliary regression).

Multiplicative heteroskedasticity test

It is based on an auxiliary regression log 𝑒𝑖2 = log 𝜎 2 + 𝑧𝑖′ 𝛼 + 𝑢i
the simplest test is based on the standard F-test in auxiliary regression for the hypothesis that all coefficients, are equal
to zero. 9
Solutions of Heteroskedasticity
✓consider a loglinear model

It is quite common to find heteroskedasticity in the situations in which the size

of the observational units differs substantially. For example, in a sample
containing firms with one employee and firms with over 1000 employees. We
can expect that large firms have larger absolute values of all variables in the
model, including the unobservables collected in the error term.

A common approach to alleviate this problem is to use logarithms of all

variables rather than their levels. Consequently, our first step in handling the
heteroskedasticity problem is to consider a loglinear model.
10
✓robust standard errors
It is possible to estimate standard errors for OLS without specifying σ2i.
𝑛 −1 𝑛 𝑛 −1
var 𝛽መ = ෌𝑖=1 𝑥𝑖′ 𝑥𝑖 ෌𝑖=1 𝑢ො 𝑖2 𝑥𝑖′ 𝑥𝑖 ′
෌𝑖=1 𝑥𝑖 𝑥𝑖

If we use this formula to compute standard errors rather, we can continue as before with our (t- and F) tests.
These are standard errors that are robust to heteroskedasticity. That is, are correct even if errors are
heteroskedastic. We call the square root of this variance as “heteroskedasticity-robust standard error”.

heteroskedasticity robust standard errors can be used for any inferences (t and F tests).

Note: parameters estimates and goodness-of-fit measures do not change. Standard errors, and F-test are
adjusted.

11
✓robust standard errors

If heteroskedasticity is detected, with a large sample size, we can

estimate an OLS regression and obtain valid inferences (t and F tests) with
using heteroskedaticity robust standard errors.

In Stata, you can get the heteroskedasticity robust standard error by using the option (robust). for
example:
reg y x1 x2, robust

However, OLS is not the Best estimator anymore, and if we want to have a
more efficient estimator, we need to use an alternative estimator.

12
✓ Deriving an alternative estimator
•.

•.

13
✓Deriving an alternative estimator

Trick: we know that OLS is BLUE under the Gauss-Markov conditions.

1. Transform the model such that it satisfies the Gauss-Markov assumptions again.
′
𝑦𝑖 x𝑖 𝑢𝑖
V{εi} = σ2i = σ2h2i → The transformed model model: = 𝛽+ (has an homoskedastic error term)
ℎ𝑖 ℎ𝑖 ℎ𝑖

2. Apply OLS to the transformed model.

𝑛 −1 𝑛
This leads to the generalized least squares (GLS) estimator, 𝛽መ = ෌𝑖=1 ℎ𝑖−2 𝑥𝑖′ 𝑥𝑖 ෌𝑖=1 ℎ𝑖−2 𝑥𝑖′ 𝑦𝑖 , which is BLUE.

• However, it can only be applied if we know hi or if we can estimate it by making additional

restrictive assumptions on the form of hi.

• This leads to a feasible GLS (FGLS, EGLS) estimator for heteroskedasticity that is called also
weighted least square (WLS), which is BLUE.
14
Illustration: explaining labor demand
We estimate a simple labor demand function for a sample of 569 Belgian
firms (from 1996).
We explain labor from output, wage costs and capital stock.

Note that the variables are scaled (to obtain coefficients in the same
order of magnitude).

15
A Linear Model

16
Breusch-Pagan test

We see (very) high t-

ratios and the high R2.

This indicates that the

squared errors are
strongly related to xi.

Test statistic: N x R2,

gives 331.0, which
provides a very strong
rejection!
reg labor wage output capital
predict e, resid
Gen e2=e^2
In Stata reg e2 wage output capital
Display 569* 0.5818
17
1st solution: Loglinear Model

Recall that in the loglinear model

the coefficients have the
interpretation of elasticities.
We can perform the Breusch–Pagan
test in a similar way as before: the
auxiliary regression of squared OLS
residuals on the three explanatory
variables (in logs) leads to an R2 of
0.0136. The resulting test statistic is
7.74, which is on the margin of being
significant at the 5% level. gen llabor=log(labor) …
Reg llabor lwage loutput lcapital
In Stata
predict e, resid
Gen e2=e^2
reg e2 lwage loutput lcapital
A more general test is the White display 569* 0.8430
test(next slide). 18
The White Test

With an R2 of 0.1029, this

leads to a value for the
White test statistic of 58.5,
which is highly significant for
a Chi-squared with 9
degrees of freedom.

Given the strong rejection(of

Homoskedasticity), we next
estimate the loglinear model reg labor lwage loutput lcapital
using White standard errors
imtest wage output capital, white
(next slide).
19
2nd solution: Robust (White) s.e.’s

• In many cases, using White

(heteroskedasticity-consistent)
standard errors is appropriate and
a good solution to the problem of
heteroskedasticity.
• Standard errors , t-statictics and F-
statistic are adjusted.
• Qualitatively, the conclusions are
not changed: wages and output
are significant in explaining labour
demand, capital is not.

Sometimes, we would like to have a more efficient estimator, by making some assumption about the form of
heteroskedasticity(next slide).
20
Multiplicative heteroskedasticity

• If we are willing to make assumptions

about the form of heteroskedasticity,
the use of the more efficient EGLS
estimator is an option.
• We consider the multiplicative form,
and choose zi = xi.
• The variables log(capital ) and
log(output) appear to be important in
explaining the variance of the error
term. Also note that the F-value of
this auxiliary regression
• leads to rejection of the null
hypothesis of homoskedasticity.

The exponential of the predicted values of the regression can be used to transform the original data.
Transforming all variables and using an OLS procedure on the transformed equation yields the EGLS estimates
presented in Table 4.7.
21
3rd solution: EGLS loglinear model

To obtain the EGLS estimator,

compute

and transform all observations

to obtain
𝑦𝑖 𝑥𝑖 𝑢𝑖
= 𝛽+
ℎ෠ 𝑖 ℎ෠ 𝑖 ℎ෠ 𝑖

The error term in this model is (approximately) homoskedastic. Applying OLS to the
transformed model gives the EGLS estimator for β.

Note: the transformed regression is for computational purposes only. All economic
interpretations refer to the original model!
22
In Stata:

. reg llabor lwage loutput lcap

. predict u, resid
. gen u2=u^2
. gen lu2=log(u2)
. reg lu2 lwage loutput lcap
. predict yhat
. gen weight=1/exp(yhat)
. reg llabor lwage loutput lcap [aweight=weight]

23
• Comparing Table 4.7 and 4.5, we see that the efficiency gain is substantial.

• The standard errors for the EGLS approach are smaller.

• Comparison with Table 4.3 is not appropriate. This table is wrong and misleading.

• The coefficient estimates are fairly close to the OLS ones. Note that the effect of
capital is now statistically significant.

• The fact that the R2 in Table 4.7 is larger than in the OLS case is misleading because
the R2 is computed for the transformed model with a transformed endogenous
variable.
The R2 in table 4.7 expresses the amount of variation in llabor/h that is explained by the model,
not the variation in llabor itself. Because observations with large values for hi are less accurately
described by the model, this increases the value of the reported R2. 24
Problem 15 (Problem 14- Continued)

Consider the same data in problem 14.

e. Using the results from d, compute an EGLS estimator for the linear model. Compare
your results with those obtained under a. Redo the tests from b.

f. Comment upon the appropriateness of the R2 in the regression of e.

Topic 8 Regression Diagnostic II Analysis Heteroscedasticity (W9 W10)
No ratings yet
Topic 8 Regression Diagnostic II Analysis Heteroscedasticity (W9 W10)
34 pages
Heteroscedasticity Notes
No ratings yet
Heteroscedasticity Notes
9 pages
Week 9 - Heteroskedasticity
No ratings yet
Week 9 - Heteroskedasticity
25 pages
L1090 Lecture7 AU24
No ratings yet
L1090 Lecture7 AU24
27 pages
ECTRX Topic6 Heteroscedasticity
No ratings yet
ECTRX Topic6 Heteroscedasticity
31 pages
OLS Assumptions
No ratings yet
OLS Assumptions
40 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
21 pages
Econometrics I - Lecture 8 (Wooldridge)
No ratings yet
Econometrics I - Lecture 8 (Wooldridge)
38 pages
14 - Econometrics - Linear Regression
No ratings yet
14 - Econometrics - Linear Regression
18 pages
Slides 4 Iu
No ratings yet
Slides 4 Iu
23 pages
Economatrics Postmte 1
No ratings yet
Economatrics Postmte 1
46 pages
Lecture 4
No ratings yet
Lecture 4
43 pages
Chapter 4
No ratings yet
Chapter 4
62 pages
Heteroscedasticity 2024
No ratings yet
Heteroscedasticity 2024
36 pages
Heteroskedasticity
No ratings yet
Heteroskedasticity
9 pages
Lecture # 3 (Heteroskedasticity in Cross-Sectional Data)
No ratings yet
Lecture # 3 (Heteroskedasticity in Cross-Sectional Data)
5 pages
OMF Lecture 7
No ratings yet
OMF Lecture 7
72 pages
Chapter 6-Hetro
No ratings yet
Chapter 6-Hetro
27 pages
Heteroskedasticity
No ratings yet
Heteroskedasticity
49 pages
Understanding Heteroskedasticity
No ratings yet
Understanding Heteroskedasticity
52 pages
Chap8-9 Fall20 1129
No ratings yet
Chap8-9 Fall20 1129
33 pages
Heteroskedasticity 2024
No ratings yet
Heteroskedasticity 2024
19 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
15 pages
Econometrics: Heteroscedasticity
No ratings yet
Econometrics: Heteroscedasticity
65 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
13 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
3 pages
Lecture 5 - Heteroskedasticity
No ratings yet
Lecture 5 - Heteroskedasticity
25 pages
Applied Mixed Model Analysis A Practical Guide - 2nd Edition All Chapter
No ratings yet
Applied Mixed Model Analysis A Practical Guide - 2nd Edition All Chapter
17 pages
16-Econometrics-Linear Regression
No ratings yet
16-Econometrics-Linear Regression
14 pages
Chapter 4 New Edited
No ratings yet
Chapter 4 New Edited
45 pages
LEC12
No ratings yet
LEC12
21 pages
Lecture 1
No ratings yet
Lecture 1
6 pages
Lecture 10 Heteroscedasticity
No ratings yet
Lecture 10 Heteroscedasticity
6 pages
DRAFT - 15 Heteroskedasticity - V2
No ratings yet
DRAFT - 15 Heteroskedasticity - V2
12 pages
Eco 418 2.3
No ratings yet
Eco 418 2.3
8 pages
2A.3 Lecture Slides8 Heteroskedasticity
No ratings yet
2A.3 Lecture Slides8 Heteroskedasticity
20 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
8 pages
Heteroskedasticity
No ratings yet
Heteroskedasticity
20 pages
Lecture 7 Heteroskedasticity
No ratings yet
Lecture 7 Heteroskedasticity
41 pages
Heteroskedasticity - Lecture Notes
No ratings yet
Heteroskedasticity - Lecture Notes
20 pages
Lecture 10. Homoskedasticity
No ratings yet
Lecture 10. Homoskedasticity
12 pages
4 Heteroscedasticity
No ratings yet
4 Heteroscedasticity
7 pages
Econometrics: Heteroskedasticity Guide
No ratings yet
Econometrics: Heteroskedasticity Guide
20 pages
CH 08
No ratings yet
CH 08
22 pages
4 Heteroscedasticity
No ratings yet
4 Heteroscedasticity
10 pages
Understanding Heteroscedasticity in Econometrics
No ratings yet
Understanding Heteroscedasticity in Econometrics
45 pages
Ecd202 Lec09 2023
No ratings yet
Ecd202 Lec09 2023
18 pages
ETW2510 Lecture 8 Heteroskedasticity
No ratings yet
ETW2510 Lecture 8 Heteroskedasticity
29 pages
Outline: Basic Econometrics in Transportation Basic Econometrics in Transportation
No ratings yet
Outline: Basic Econometrics in Transportation Basic Econometrics in Transportation
7 pages
Non-Spherical Disturbances & Heteroskedasticity
No ratings yet
Non-Spherical Disturbances & Heteroskedasticity
8 pages
Econometrics (EM2008/EM2Q05) Heteroskedasticity: Irene Mammi
No ratings yet
Econometrics (EM2008/EM2Q05) Heteroskedasticity: Irene Mammi
15 pages
Violations of OLS
No ratings yet
Violations of OLS
64 pages
2012 Mancity Annual and Financial Report
No ratings yet
2012 Mancity Annual and Financial Report
59 pages
"Introductory Econometrics", Chapter 8 by Wooldridge: Heteroskedasticity
No ratings yet
"Introductory Econometrics", Chapter 8 by Wooldridge: Heteroskedasticity
14 pages
Further Regression Topics II
No ratings yet
Further Regression Topics II
32 pages
2024 Mancity Financial Report
No ratings yet
2024 Mancity Financial Report
53 pages
Heteroscedasticity Explained
No ratings yet
Heteroscedasticity Explained
16 pages
Econometrics Multiple Regression Analysis: Heteroskedasticity
No ratings yet
Econometrics Multiple Regression Analysis: Heteroskedasticity
19 pages
Points For Session 4 - Updated
No ratings yet
Points For Session 4 - Updated
9 pages
11 - Econometrics - Linear Regression
No ratings yet
11 - Econometrics - Linear Regression
20 pages
Physics Exam: Road Safety & Gases
No ratings yet
Physics Exam: Road Safety & Gases
5 pages
Physics Problem Set Analysis
No ratings yet
Physics Problem Set Analysis
1 page
Reading 2 Time-Series Analysis
No ratings yet
Reading 2 Time-Series Analysis
46 pages
Chapter 4
No ratings yet
Chapter 4
32 pages
17-Econometrics-Linear Regression
No ratings yet
17-Econometrics-Linear Regression
18 pages
Skripsi Monic
No ratings yet
Skripsi Monic
4 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
17 pages
4-Econometrics-Linear Regression
No ratings yet
4-Econometrics-Linear Regression
12 pages
Course Outline - ECON 443-23-24
No ratings yet
Course Outline - ECON 443-23-24
4 pages
Business Statistics - II Syllabus
No ratings yet
Business Statistics - II Syllabus
2 pages
Simple & Multiple Regression Models
No ratings yet
Simple & Multiple Regression Models
32 pages
Econometrics Assignment Guide
No ratings yet
Econometrics Assignment Guide
3 pages
12-Econometrics-Linear Regression
No ratings yet
12-Econometrics-Linear Regression
18 pages
Econometrics Lecture1
No ratings yet
Econometrics Lecture1
17 pages
19-Econometrics-Linear Regression
No ratings yet
19-Econometrics-Linear Regression
17 pages
Regression Modeling in Biostatistics
No ratings yet
Regression Modeling in Biostatistics
3 pages
8-Econometrics-Linear Regression
No ratings yet
8-Econometrics-Linear Regression
14 pages
21-Econometrics-Linear Regression
No ratings yet
21-Econometrics-Linear Regression
9 pages
Exponential Smoothing Guide
No ratings yet
Exponential Smoothing Guide
2 pages
Lec5 Classification
No ratings yet
Lec5 Classification
27 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
99 pages
Assign - 2 - 14home Take Exercise - AFCN - 16!09!2022
No ratings yet
Assign - 2 - 14home Take Exercise - AFCN - 16!09!2022
3 pages
(MAI 4.20-4.22) CONFIDENCE INTERVAL - HYPOTHESIS TEST FOR μ - solutions
No ratings yet
(MAI 4.20-4.22) CONFIDENCE INTERVAL - HYPOTHESIS TEST FOR μ - solutions
4 pages
Econometrics for Business Students
No ratings yet
Econometrics for Business Students
10 pages
Lecture 5 Dummy Variable
No ratings yet
Lecture 5 Dummy Variable
11 pages
Paired Sample T-Test Analysis
No ratings yet
Paired Sample T-Test Analysis
4 pages
MA 2marks
No ratings yet
MA 2marks
6 pages
Class Activity SU 4 - Memo - 2023 - Efundi
No ratings yet
Class Activity SU 4 - Memo - 2023 - Efundi
3 pages
Hadpop Revision Final
No ratings yet
Hadpop Revision Final
20 pages
Regression Analysis Insights
No ratings yet
Regression Analysis Insights
12 pages
Hasil Analisis Data Delvi
No ratings yet
Hasil Analisis Data Delvi
13 pages
Regression
No ratings yet
Regression
14 pages
Covariate Balancing in Treatment Effect Estimation
No ratings yet
Covariate Balancing in Treatment Effect Estimation
12 pages
Nama: Ahmad Jordiansyah Kelas: A NPM:170610200049
No ratings yet
Nama: Ahmad Jordiansyah Kelas: A NPM:170610200049
8 pages
Mechanisms of Biodiversity Evolution
No ratings yet
Mechanisms of Biodiversity Evolution
34 pages
Tugas Skill Lab Ebm Dr. Muhammad Fikri Aulia
No ratings yet
Tugas Skill Lab Ebm Dr. Muhammad Fikri Aulia
26 pages

15-Econometrics-Linear Regression

Uploaded by

15-Econometrics-Linear Regression

Uploaded by

Econometrics

The data set AIRQ contains observations for 30 standard metropolitan

airq: indicator for air quality (the lower the better);

coastal regions, ceteris paribus, have a better air quality.

The White test is based on including 5

• The test statistics is N*R2 (small N)

As a result, it is very unlikely to find a rejection,

The F-statistic of this

• Also, interpretation of R-squared is not changed

standard errors are incorrect.

efficient linear estimators.

The White test

Multiplicative heteroskedasticity test

It is quite common to find heteroskedasticity in the situations in which the size

A common approach to alleviate this problem is to use logarithms of all

If heteroskedasticity is detected, with a large sample size, we can

Trick: we know that OLS is BLUE under the Gauss-Markov conditions.

2. Apply OLS to the transformed model.

• However, it can only be applied if we know hi or if we can estimate it by making additional

We see (very) high t-

This indicates that the

Test statistic: N x R2,

Recall that in the loglinear model

With an R2 of 0.1029, this

Given the strong rejection(of

• In many cases, using White

• If we are willing to make assumptions

To obtain the EGLS estimator,

and transform all observations

. reg llabor lwage loutput lcap

• The standard errors for the EGLS approach are smaller.

Consider the same data in problem 14.

f. Comment upon the appropriateness of the R2 in the regression of e.

You might also like