0% found this document useful (0 votes)

173 views51 pages

Linear Regression for Statisticians

The document discusses the classical linear regression model and its assumptions. It covers: - The problem of estimation including ordinary least squares, method of moments, and maximum likelihood estimation. - Assumptions of the classical linear regression model including linearity, exogeneity of X, zero conditional mean of the error term, homoscedasticity, no autocorrelation, zero covariance between X and the error term, more observations than parameters, and no perfect multicollinearity. - Properties of ordinary least squares estimators such as being expressed in terms of observable quantities and having a zero mean error term.

Uploaded by

Akshit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

173 views51 pages

Linear Regression for Statisticians

Uploaded by

Akshit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Session 3 : Lecture Outline

Problem of Estimation

• Problem of Estimation:
– Ordinary Least Squares Method
– Method of Moment Estimation Procedure
– Maximum Likelihood Estimation Procedure

• Classical Linear Regression Model: Assumptions

• Precisions of the Estimator: The Standard Errors of Least Squares

Estimators

• Gauss-Markov Theorem

• Coefficient of Determination
Ref. Ch 3 Gujarati Book
Simple Linear Regression Model
Finds a linear relationship between:
- one independent variable X and
- one dependent variable Y
First prepare a scatter plot to verify the data has a linear
trend.
Use alternative approaches if the data is not linear.
Simple Linear Regression Model: Estimation

Model:

where
Y = dependent variable
X = independent variable
β1 = intercept/constant term
β2 = slope coefficient term
ui = error or random factor

Methods to Estimate SRF (estimator)

1. Least Squares Methods (Ordinary/Simple LSM)

2. Method of Moment Estimator
3. Maximum Likelihood Methods
Simple Linear Regression Model: Estimation

Yi Xi

70 80

65 100

90 120

95 140

110 160

115 180

120 200

140 220

155 240

150 260
Simple Linear Regression Model: Estimation
Model:
Yi = ˆ1 + ˆ2 X i + uˆi − − − SRF
Yˆi = ˆ1 + ˆ2 X i − − − − FittedLine
uˆ = (Y − ˆ − ˆ X )
i i 1 2 i

or
uˆi = Yi − Yˆi

Yi Xi
70 80 4.82 65.18
65 100 -10.36 75.36
90 120 4.45 85.55
95 140 -0.73 95.73
110 160 4.09 105.91
115 180 -1.09 116.09
120 200 -6.27 126.27
140 220 3.55 136.45
155 240 8.36 146.64
*
150 260 -6.82 156.82
Simple Linear Regression Model: Estimation
I .Using Methods of Ordinary Least Squares: OLS

We estimate the intercept and slope by minimizing the vertical distance of the
data point and the estimated sample regression function. We are minimizing
the sum of squared residual

uˆi = Yi − Yˆi = Yi − ( ˆ1 + ˆ2 X i )

𝑛 𝑛
2
𝑀𝑖𝑛 ෍ 𝑢ො 𝑖 = 𝑀𝑖𝑛 ෍ ቀ𝑌𝑖 − 𝛽መ1 − 𝛽መ2 𝑋𝑖 ሻ2 ≡ 𝑀𝑖𝑛 𝑆 𝛽መ1 𝛽መ2
𝑖=1 𝛽^1 ,𝛽^2 𝑖=1 𝛽^1 ,𝛽^2

We can obtain ˆ1 , ˆby

2
taking the derivative of S ˆ1 , ˆ2 ( )
with respect to ˆ1 , ˆ order conditions), and set them equal to
(first
2
zero.
First and Second Order condition

First order condition Second order condition:

1) S ( ˆ1 , ˆ2 ) n
Mostly satisfied.
= − (Yi − ˆ1 − ˆ2 X i ) = 0
ˆ1 i =1

S ( ˆ1 , ˆ2 ) n

2) = − (Yi − ˆ1 − ˆ2 X i )( X i ) = 0

ˆ2 i =1

Two Normal Equations of OLS

 Y = nˆ + ˆ  X
i 1 2 i
Solving we get
 Y X = ˆ  X + ˆ  X
2
i i 1 i 2 i
Estimation of Slope and Intercept
Further Simplifying these two normal equations together.
n

(X i − X )(Yi − Y )
 x y
1) ˆ2 = i =1
or i i
Slope
x
n 2

(X
i =1
i − X )2 i Coefficients

2) ˆ1 = Y − ˆ2 X Intercept

where and are the sample means

Estimated Regression Model: Yˆi = ˆ1 + ˆ2 X i

Estimated Regression Model: *

Simple Linear Regression Model: Estimation
II. Deriving OLS Using Method of Moment(MoM)

• Another way of establishing the OLS formula is through the Method of Moments
approach, Developed by Pearson 1894
• The basic idea of this method is to equate certain sample characteristics, such as
the mean, to the corresponding population expected values.
• Method of moments estimation is based solely on the law of large numbers
The Method of Moments (MM) and GMM
1. Unconditional Moment

E(X-µ)=mean
The Method of Moments (MM) and GMM
2. Conditional Moment
Simple Linear Regression Model: Estimation(MoM)
• To derive the OLS estimates we need to realize that our main
assumption of E(u|x) = E(u) = 0 also implies that Cov(x,u) = E(xu) =
0

• We can write our 2 restrictions just in terms of x, y, β1 and β2 , since

u = y – β1 – β2x

• E(y – β1 – β2x) = 0
• E[x(y – β1 – β2x)] = 0

• These are called moment restrictions

Simple Linear Regression Model: Estimation(MoM)

• We want to choose values of the parameters that will ensure that the
sample versions of our moment restrictions are true
• The sample versions are as follows:

Given the definition of a sample mean, and properties of summation, we

can rewrite the first condition as follows

=> OLS estimated Intercept

Simple Linear Regression Model: Estimation(MoM)

=> OLS estimated slope

Statistical Properties of OLS Estimators
Estimated regression Model: Yˆi = ˆ1 + ˆ2 X i
1. The OLS estimators are expressed solely in terms of the observable quantities
2. They are point estimators
3. Once the OLS estimates are obtained, the sample regression line can be
easily obtained. This regression line has the following properties

i) It passes through sample mean of Y & X

ii) Yi Yˆi = ˆ1 + ˆ2 X i

iii) E (uˆi ) = 0
Y

iv)
  − 
Cov( 1 ,  2 ) = − X var( 2 )
v)  (uˆi X i ) = 0
X Xi
Some theorem
In deviation from:

SRF:

4. R2=r2

1. 0
Assumptions of CLRM

1: The model is linear in the parameters and variables.

Yi = 1 +  2 X i + u i

2: The X values are fixed in repeated sampling. X is

Nonstochastic

3: Zero mean of the disturbance u

• Given the value of X, the mean, or expected value of the
disturbance term, ui , is zero.

E ( ui | X i ) =0
Assumptions of CLRM
4: Homoscedasticity or equal
variance of ui
• Given the value of X, the
variance of, ui, the
disturbance term is the
same for all observations.

var ( ui | X i ) = E ui − E ( ui | X i ) 
2

= E (ui2 | X i ) uses assumption 3

= 2
Assumptions of CLRM

5: No autocorrelation between the disturbances

• Given any two X values, Xi and Xj, (i  j ) , the

correlation between any two ui and uj, (i  j ) , is zero.

cov ( ui u j ) = E ui − E ( ui | X i )  u j − E ( u j | X j ) 

= E ( ui | X i ) ( u j | X j ) uses Assumption 3
=0
Autocorrelation: Residual Plot
uˆt uˆt

.. .. . .. ..
.. . ...
.
. ...... .
... . . . . . . .. .
.
uˆt −1 . .. . uˆt −1
.
.. . . . .
. . . ..

Positive autocorrelation Negative

autocorrelation
Assumptions of CLRM

6: Zero covariance between Xi and ui or E(Xi ui )=0

cov ( ui X i ) = E ui − E ( ui | X i )   X i − E ( X i ) 
= E ( ui ) ( X i − E ( X i ) ) uses Assumption 3
= E ( ui X i ) − E ( ui ) E ( X i ) since E ( X i ) is nonstochastic
= E ( ui X i ) since E ( ui ) = 0
= 0 by assumption
Assumptions of CLRM
7: The number of observations (n) must be greater
than the number of parameters to be estimated
(k) (Micronumeriosity)

8: Variability in X values
• Technically Var(X) must be a finite positive number

9: The regression model is correctly specified.

There is no specification error or bias in the model
used for empirical analysis

10: There is no perfect Multicollinearity. There are

no perfect linear relationships among the
explanatory variables
Simple Linear Regression Model: Classical Assumptions
● Assumptions for the Classical Linear Regression Model:

1. The regression model is linear in the parameters

2. X values are fixed in repeated sampling

3. Zero mean value of disturbance ui

4. Homoscedasticity or equal variance of ui

5. No autocorrelation between the disturbances

6. Zero covariance between u and X

7. The number of observations n must be greater than the number of parameters to be

estimated

8. Variability in X values

9. The regression model is correctly specified

10. There is no perfect multicollinearity.

Precision or S.E. of Least Squares Estimators

Population Regression results

β1 = 17.00
β2 = 0.60
σ = 11.32
σ2 = 128.42
Sample Regression Function SRF
Precision or S.E. of Least Squares Estimators

Population Regression results

β1 = 17.00
β2 = 0.60
σ = 11.32
σ2 = 128.42
Estimated Model: Y=24.455+0.509X
Precision or S.E. of Least Squares Estimators

● The standard errors for the OLS estimates can be obtained

as follows:

Var (β^ )= 41.0881

SE (β^ 1)= 6.41
Var (β^1 )= 0.0016
SE (β^ 2)= 0.04
2

Cov (β^ ,β^ )=-170*0.0016

1 2 =272

The more variation of X, the smaller the variance of and the more precise estimate of
Variance of ̂ 2

Variation of X is relatively
( )
Var ˆ2 = n
2

 (X − X)
2
Y small. Slope estimate is very i
i =1
imprecise.

. .
. ..
. .. .. . .
.
variation of X

X
Variance of ̂ 2

Variation of X is much bigger. Slope ( )

Var ˆ2 = n
2

 (X − X)
Y estimate is much more precise. 2
i

. i =1

.
. . . .. . .
. . .
. . .. . . .
. . .
.
.
.
variation of X
X
Precision or S.E. of Least Squares Estimators

However, since we do not know the variance of the

error term (population variance), we can estimate
it using sample variance as follows:

̂ 2
=
 ˆ
ui2

n−k
Where n-k are the number of degrees of freedom.

N.B. sample variance of the residual is the unbiased

estimate of the population variance of the error
term.
Precision or S.E. of Least Squares Estimators

• Three important elements will determine

the precision of the estimates:

1. The magnitude of the “noise”

2. The variance of X

3. The number of observations

Precision of the estimates

Yˆi = ˆ1 + ˆ2 X i

Yi
Yˆi

Residual
Yi

Slope

Intercept

Xi
X
1. Variance of ui

Y
Yˆi = ˆ1 + ˆ2 X i

The noise can be large…Or Not

X
2. Variation in X

True relationship

X
Variance in X in relation to variance in u
2. Variation in X

Y
True relationship

X
Variance in X in relation to variance in u
2.Variation in X

X
And this is your sample…
3. Number of observations

Y
True relationship

X
Variance in X in relation to variance in u
3.Number of observations

X
Precision or S.E. of Least Squares Estimators

Covariance Between slope and Intercept term.

  −  Y
Cov( 1 ,  2 ) = − X var( 2 )

Since var ( ˆ 2 ) is always positive , the cov between the slope

and intercept is always positive and depends on sign of X .

If X is positive , then covariance will be negative.

Thus if slope coefficient is over estimated (steep) the intercept

will be under estimated (too small).
Properties of OLS Estimators
(Gauss-Markov Theorem)
Under the assumption of CLRM the least squares
estimators are the Best Linear Unbiased Estimators
(BLUE).

– Best: OLS estimate has the smallest variance

(smallest margin of error or most precise estimate)

– Linear: OLS estimate is obtained by the linear

function of Yi.

– Unbiased: Expected value of OLS estimate is the

same as true population value.
E (ˆ i ) =  i
Gauss Markov Theorem
(1) Linear means linear in the dependent variable.
N

 (X
i =1
i − X )(Yi − Y )
May be rewritten as
̂ 2 = N

 (X
i =1
i − X )2

N
Where
 ( X i − X )(Yi ) N (Xi − X )
̂ 2 = i =1
N
= wY i i
wi = N

(X i − X) 2 i =1
(X
i =1
i − X )2
i =1


–  2 now is a linear function of Yi

–  1 can similarly be written as a linear function of Yi
Gauss Markov Theorem

(2) Unbiased
– The expected value of the estimator is the true
underlying parameter

E ( ˆ 2 ) =  2
(3) Efficiency : (or Minimum Variance)
~
Var ( ˆ )  Var (  2 )
OLS
2

~
– Of all the linear, unbiased estimators,  2 OLS
has the smallest variance
Gauss Markov Theorem
(4) Consistency: An Estimator is called consistent if it converges
stochastically to the true parameter value with probability
approaching one as the sample size increased indefinitely. This
implies
p lim ˆ 2 =  2
n →
n is the number of samples
Pr{| ˆ 2 −  2 |  } → 1 as n goes to infinity for small 

Sufficient condition
(1) E ( ˆ 2 ) →  2
( 2)Var ( ˆ 2 ) → 0
as n goes to infinity


Similarly, we can generalise this with  1
The Overall Goodness of Fit: R2
This measure helps to determine the goodness of fit, or how well the sample
regression line fits the data

TSS= Σ(Yi - Y)2

(total variability of the Y
dependent variable about its
mean) .
.
RSS= Σ(Ŷi - Y)2
. . . . .
(variability in Y explained by
the sample regression) . . . .
. .. . .
ESS= Σ(Yi - Ŷi)2 . . . . .
(variability in Y unexplained
by the dependent variable x)
. .
This regression line gives
the minimum ESS among all
possible straight lines.
X
The Overall Goodness of Fit: R2
The Overall Goodness of Fit: r2 or R2

Decomposition of Variance of Yi

Yi = Yˆi + uˆ i
or
y i = yˆ i + uˆ i
Squaring this equation and summing over the sample, we obtain

 = 2
yi ˆy i2 +  yˆ uˆ
ˆu i2 + 2 i i

 y =  yˆ + uˆ
2
i
2
i
2
i

TSS = ESS + RSS

The Overall Goodness of Fit: r2 or R2
Then, TSS = ESS + RSS
 (Yˆi − Y ) 2  i
2
ESS RSS ˆ
u
1= + = +
TSS TSS  (Y i −Y ) 2
 (Y
i − Y )2

we define r 2 as

r =
2  (Yˆi − Y ) 2
=
ESS
 (Yi − Y ) 2 TSS
or
 i
2
ˆ
u RSS
r 2 = 1− = 1−
 i
(Y − Y ) 2
TSS

The coefficient of determination measures the proportion or percentage

of the total variation in Y explained by the regression model
TSS = ESS + RSS
y 2
i = r 2  yi2 + (1 − r 2 ) yi2
Problem with r2 or R2
1. Spurious regression

2. High correlation of Xt with another variable Zt.

3. Correlation does not necessarily implies Causality

4. Time series equation always generate high R2 value than cross

section equation

5. Low R2 does not means wrong choice of Xt

6. R2s from equation with different forms of Yt are not comparable.

7. R2 can be negative if the model is a bad fit or if RSS > TSS

Thanks

Econ3061 Chapter 2
No ratings yet
Econ3061 Chapter 2
67 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
55 pages
Chapter 3 Econometrics Edited
No ratings yet
Chapter 3 Econometrics Edited
48 pages
4) MLR4: Zero Cond Mean E (U - x1,..xk) 0: 2) NO BIAS Condition
No ratings yet
4) MLR4: Zero Cond Mean E (U - x1,..xk) 0: 2) NO BIAS Condition
3 pages
ECN 318 - Introductory Econometrics I Week 3 4
No ratings yet
ECN 318 - Introductory Econometrics I Week 3 4
39 pages
Simple&multiple Regressions - 2023
No ratings yet
Simple&multiple Regressions - 2023
72 pages
Properties of OLS
No ratings yet
Properties of OLS
13 pages
Econometrics for Students
No ratings yet
Econometrics for Students
28 pages
Lecture 03 JEB109 2023
No ratings yet
Lecture 03 JEB109 2023
26 pages
Ch3 Slides Ed4 2024 20
No ratings yet
Ch3 Slides Ed4 2024 20
72 pages
Week 2 - The Simple Linear Regression Model PDF
No ratings yet
Week 2 - The Simple Linear Regression Model PDF
47 pages
CLRM
No ratings yet
CLRM
15 pages
Introduction To Regression Chapter 2
No ratings yet
Introduction To Regression Chapter 2
11 pages
Simple Regression
No ratings yet
Simple Regression
45 pages
Simple Linear Regression1
No ratings yet
Simple Linear Regression1
36 pages
Week 3-4
No ratings yet
Week 3-4
75 pages
Classical Linear Regression Model (CLRM)
100% (1)
Classical Linear Regression Model (CLRM)
68 pages
Econometrics II: Revision Class: Introduction To Econometrics
No ratings yet
Econometrics II: Revision Class: Introduction To Econometrics
55 pages
Derex Econom
No ratings yet
Derex Econom
13 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
9 pages
Ch3 Slides Ed4 2024
No ratings yet
Ch3 Slides Ed4 2024
72 pages
Chapter 2 Simple Linear Regression
No ratings yet
Chapter 2 Simple Linear Regression
31 pages
Econometrics Final
No ratings yet
Econometrics Final
13 pages
Linear Regression for Engineers
No ratings yet
Linear Regression for Engineers
56 pages
Econometrics for Finance Students
No ratings yet
Econometrics for Finance Students
64 pages
Chapter3
No ratings yet
Chapter3
52 pages
Econometrics 8
No ratings yet
Econometrics 8
35 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
52 pages
HW 1
No ratings yet
HW 1
9 pages
Lecture Set 2
No ratings yet
Lecture Set 2
47 pages
Basic Econometrics - II
No ratings yet
Basic Econometrics - II
30 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
f23 Econ103 Week2 Ta Note
No ratings yet
f23 Econ103 Week2 Ta Note
5 pages
Cheatsheet
No ratings yet
Cheatsheet
2 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
42 pages
Chapter Three
No ratings yet
Chapter Three
22 pages
Lecture 3
No ratings yet
Lecture 3
27 pages
Chapter 3 Two Variable Regression Model
No ratings yet
Chapter 3 Two Variable Regression Model
7 pages
Econometrics Lecture 3 Simple Linear Regression (SLR) For Cross Sectional Data Part 2
No ratings yet
Econometrics Lecture 3 Simple Linear Regression (SLR) For Cross Sectional Data Part 2
39 pages
Two-Variable Regression & OLS Guide
No ratings yet
Two-Variable Regression & OLS Guide
24 pages
Previewpdf
No ratings yet
Previewpdf
97 pages
OLS in Two-Variable Regression
No ratings yet
OLS in Two-Variable Regression
65 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
C1 English
No ratings yet
C1 English
26 pages
The Simple Regression Model
No ratings yet
The Simple Regression Model
24 pages
Econometrics Chap 3
No ratings yet
Econometrics Chap 3
19 pages
Lecture 2 SLR - 1
No ratings yet
Lecture 2 SLR - 1
28 pages
02 Simple Regression
No ratings yet
02 Simple Regression
29 pages
Cheat Sheet
No ratings yet
Cheat Sheet
5 pages
Two-Variable Regression Model: The Problem of Estimation: Gujarati 4e, Chapter 3
No ratings yet
Two-Variable Regression Model: The Problem of Estimation: Gujarati 4e, Chapter 3
15 pages
Regression Analysis
No ratings yet
Regression Analysis
37 pages
Lind Chapter 09 MCW
No ratings yet
Lind Chapter 09 MCW
20 pages
Lecture 2: Simple Linear Regression Model: Recap
No ratings yet
Lecture 2: Simple Linear Regression Model: Recap
5 pages
Chapter 2
No ratings yet
Chapter 2
58 pages
Business Statistics For Contemporary Decision Making 7th Edition by Ken Black Ebook and TestBank Bundle Verified PDF
No ratings yet
Business Statistics For Contemporary Decision Making 7th Edition by Ken Black Ebook and TestBank Bundle Verified PDF
410 pages
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
No ratings yet
FECO Note 2 - Simple Linear Regression: Xuan Chinh Mai
7 pages
Two-Variable Regression Model - The Problem of Estimation
No ratings yet
Two-Variable Regression Model - The Problem of Estimation
35 pages
Chapter 02
No ratings yet
Chapter 02
14 pages
ISI MStat 05
No ratings yet
ISI MStat 05
4 pages
A Comprehensive Textbook of Sample Surveys First Edition Arijit Chaudhuri - Quickly Access The Ebook and Start Reading Today
No ratings yet
A Comprehensive Textbook of Sample Surveys First Edition Arijit Chaudhuri - Quickly Access The Ebook and Start Reading Today
77 pages
ASM Question Paper
No ratings yet
ASM Question Paper
2 pages
Network Theory and Agent Based Modeling in Economics and Finance 1st Ed 2019 978 981-13-8318 2 978 981 13 8319 9 Compress
0% (1)
Network Theory and Agent Based Modeling in Economics and Finance 1st Ed 2019 978 981-13-8318 2 978 981 13 8319 9 Compress
454 pages
Unit 3
No ratings yet
Unit 3
12 pages
Unit-I 9.4.1.1 Subjective Questions: - Given The Following Probability Function: 2 2
No ratings yet
Unit-I 9.4.1.1 Subjective Questions: - Given The Following Probability Function: 2 2
23 pages
July 2024 New Home Sales Report
No ratings yet
July 2024 New Home Sales Report
5 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
17 pages
Sampling Methods for Managers
No ratings yet
Sampling Methods for Managers
26 pages
Notes07 Point Estimation
No ratings yet
Notes07 Point Estimation
46 pages
B.Tech. (CSE) Semester-3
No ratings yet
B.Tech. (CSE) Semester-3
23 pages
Ch-7 Estimation of Parameter
No ratings yet
Ch-7 Estimation of Parameter
2 pages
Handbook of Econometrics Vol 1
No ratings yet
Handbook of Econometrics Vol 1
757 pages
Lecture 3 Slides (ANOVA)
No ratings yet
Lecture 3 Slides (ANOVA)
19 pages
Time Averages and Ergodicity
No ratings yet
Time Averages and Ergodicity
24 pages
Robust Estimation of The Process Standard Deviation For Control Charts (Tatum 1997)
No ratings yet
Robust Estimation of The Process Standard Deviation For Control Charts (Tatum 1997)
16 pages
Glosten Jagannathan Runkle - On The Relation Between The Expected Value and The Volatility of The Nominal Excess Return On Stocks
No ratings yet
Glosten Jagannathan Runkle - On The Relation Between The Expected Value and The Volatility of The Nominal Excess Return On Stocks
24 pages
TP05 Econometrics p1
No ratings yet
TP05 Econometrics p1
22 pages
Sparsely Constrained NN
No ratings yet
Sparsely Constrained NN
6 pages
Correcting Heterogeneous and Biased Forecast Error at Intel For Supply Chain Optimization
No ratings yet
Correcting Heterogeneous and Biased Forecast Error at Intel For Supply Chain Optimization
14 pages
Statistics Estimation Guide
No ratings yet
Statistics Estimation Guide
17 pages
Bayesian Gamma Estimation
No ratings yet
Bayesian Gamma Estimation
16 pages
The Jackknife
No ratings yet
The Jackknife
24 pages
Variance-Ratio Tests of Random Walk: An Overview: Audencia Nantes, School of Management
No ratings yet
Variance-Ratio Tests of Random Walk: An Overview: Audencia Nantes, School of Management
25 pages
Chapter 9
No ratings yet
Chapter 9
48 pages
Statistical Inference Guide
No ratings yet
Statistical Inference Guide
59 pages

Linear Regression for Statisticians

Uploaded by

Linear Regression for Statisticians

Uploaded by

Session 3 : Lecture Outline

• Classical Linear Regression Model: Assumptions

• Precisions of the Estimator: The Standard Errors of Least Squares

Methods to Estimate SRF (estimator)

1. Least Squares Methods (Ordinary/Simple LSM)

uˆi = Yi − Yˆi = Yi − ( ˆ1 + ˆ2 X i )

We can obtain ˆ1 , ˆby

First order condition Second order condition:

2) = − (Yi − ˆ1 − ˆ2 X i )( X i ) = 0

Two Normal Equations of OLS

2) ˆ1 = Y − ˆ2 X Intercept

Estimated Regression Model: Yˆi = ˆ1 + ˆ2 X i

Estimated Regression Model: *

• We can write our 2 restrictions just in terms of x, y, β1 and β2 , since

• These are called moment restrictions

Given the definition of a sample mean, and properties of summation, we

=> OLS estimated Intercept

=> OLS estimated slope

i) It passes through sample mean of Y & X

ii) Yi Yˆi = ˆ1 + ˆ2 X i

1: The model is linear in the parameters and variables.

2: The X values are fixed in repeated sampling. X is

3: Zero mean of the disturbance u

= E (ui2 | X i ) uses assumption 3

5: No autocorrelation between the disturbances

• Given any two X values, Xi and Xj, (i  j ) , the

cov ( ui u j ) = E ui − E ( ui | X i )  u j − E ( u j | X j ) 

Positive autocorrelation Negative

6: Zero covariance between Xi and ui or E(Xi ui )=0

9: The regression model is correctly specified.

10: There is no perfect Multicollinearity. There are

1. The regression model is linear in the parameters

2. X values are fixed in repeated sampling

3. Zero mean value of disturbance ui

4. Homoscedasticity or equal variance of ui

5. No autocorrelation between the disturbances

6. Zero covariance between u and X

7. The number of observations n must be greater than the number of parameters to be

9. The regression model is correctly specified

10. There is no perfect multicollinearity.

Population Regression results

Population Regression results

● The standard errors for the OLS estimates can be obtained

Var (β^ )= 41.0881

Cov (β^ ,β^ )=-170*0.0016

Variation of X is much bigger. Slope ( )

However, since we do not know the variance of the

N.B. sample variance of the residual is the unbiased

• Three important elements will determine

1. The magnitude of the “noise”

3. The number of observations

Yˆi = ˆ1 + ˆ2 X i

The noise can be large…Or Not

Covariance Between slope and Intercept term.

Since var ( ˆ 2 ) is always positive , the cov between the slope

If X is positive , then covariance will be negative.

Thus if slope coefficient is over estimated (steep) the intercept

– Best: OLS estimate has the smallest variance

– Linear: OLS estimate is obtained by the linear

– Unbiased: Expected value of OLS estimate is the

TSS= Σ(Yi - Y)2

TSS = ESS + RSS

The coefficient of determination measures the proportion or percentage

2. High correlation of Xt with another variable Zt.

3. Correlation does not necessarily implies Causality

4. Time series equation always generate high R2 value than cross

5. Low R2 does not means wrong choice of Xt

6. R2s from equation with different forms of Yt are not comparable.

7. R2 can be negative if the model is a bad fit or if RSS > TSS

You might also like