0% found this document useful (0 votes)

17 views17 pages

Lec 4

The document discusses the K-variable linear model, presenting its mathematical formulation and the least squares method for estimating parameters. It covers key concepts such as the geometry of least squares, properties of projection matrices, and the Gauss-Markov theorem, which states that the least squares estimator is the best linear unbiased estimator (BLUE). Additionally, it emphasizes the importance of reporting sample characteristics and the adjusted squared correlation coefficient in regression analysis.

Uploaded by

nevaass533

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views17 pages

Lec 4

Uploaded by

nevaass533

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

LECTURE 4: THE K-

VARIABLE LINEAR
MODEL I
Consider the system

y1 = α + β x 1 + ε 1
y 2 = α + βx 2 + ε 2

…….
……..
yN = α + βx N + ε N

or in matrix form
y = Xβ * + ε

where y is Nx1, X is Nx2, β is 2x1, and ε

is Nx1.

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 1

K-Variable Linear Model
1 x1 
1 x2  α
X =   , β* =   .
. .  β 
 
 
1 xN 

Good practice requires inclusion of

the column of ones .
Consider the general model

y = Xβ * + ε
Convention: y is Nx1, X is NxK,
β is Kx1, and ε is Nx1.
 β1 
β 
1 x 21 . . . x K1   2
1 x . . . x  . 
X =  22 K2 
, β =   .
. . . . . .  . 
  . 
1 x 2N . . . x KN 
 
βK 
N.M. Kiefer, Cornell University, Economics 620, Lecture 4 2
More on the Linear Model

a typical row looks like:

yi = β1 + β2 x 2i + β3 x 3i +...+ βK x Ki + εi

THE LEAST SQUARES METHOD:

First assumption: Ey = Xβ

S(b) = (y - Xb)’(y - Xb)

= y’y - 2b’X’y + b’X’Xb

NORMAL EQUATIONS
^
X’X β - X’y = 0
These equations always have a solution.
If X’X is invertible
β$ = (X ′X) -1 X ′y.
N.M. Kiefer, Cornell University, Economics 620, Lecture 4 3
^
Proposition: β is a minimizer.

Proof: Let b be any other K-vector.

(y - Xb)' (y - Xb)
^ ^ ^ ^
= (y - X β + X( β - b))' (y - X β + X( β - b))
^ ^ ^ ^
= (y - X β ) ′ (y - X β ) + ( β - b) ′ X′ X( β - b)
^ ^
≥ (y - X β ) ′ (y - X β ). (why?)

^
Definition: e = y - Xβ is the vector of residuals.

Note: Ee = 0 and X’e = 0.

Proposition: The LS estimator is unbiased.

^
Proof: E β = E[(X’X)-1X’y]

= E[(X’X) -1X’(X β+ ε)] = β

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 4

GEOMETRY OF LEAST SQUARES:

consider y = Xβ +ε with
y1  x1 
y =  ,X =  .
 y2   x2 

Definition: The space spanned by matrix

X is the vector space which consists of all
linear combinations of the column
vectors of X.

Definition: X(X’X)-1X’y is the orthogonal

projection of y to the space spanned by X

Proposition: e is perpendicular to X,
i.e. X’e = 0.
Proof:
^
e = y - Xβ = y - X(X’X)-1X’y

e = (I - X(X’X)-1X’)y
N.M. Kiefer, Cornell University, Economics 620, Lecture 4 5
⇒ X′e = (X′ - X′)y = 0

Thus the equation y = X β$ + e gives y as the

sum of a vector in R[X] and a vector in
N[X′].
Common (friendly) projection matrices:

1. The matrix which projects to the space

orthogonal to the space spanned by X (i.e. to
N[X′] is
M = I - X(X′X) X′ .
−1

Note: e = My. If X is full column rank, M has

rank (N - K).

2. The matrix which projects to the space

spanned by X is
I - M = X(X′X) X′ . −1

Note: y$ = y - e = y - My = (I - M)y. If X is
full column rank, (I - M) has rank K.

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 6

Example in R2
yi = xiβ + εi i= 1,2

e xb

xβ

What is the case of singular X’X?

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 7

Properties of projection
matrices
1. Projection matrices are idempotent.

I.g. (I - M)(I - M) = (I - M).

Proof: (I - M)(I - M)
= (X(X′X) X′)(X(X′X) X′)
−1 −1

= X(X′X) X′ = (I - M)
−1

2. Idempotent matrices have eigenvalues equal

to zero or one.

Proof: Consider the characteristic equation

Mz = λz⇒ M z = Mλz = λ z
2 2

2
Since M is idempotent, M z = Mz.

Thus, λ z = λz, which implies that λ is either

0 or 1.
N.M. Kiefer, Cornell University, Economics 620, Lecture 4 8
3. The number of nonzero eigenvalues of a
matrix is equal to its rank.

⇒ For idempotent matrices, trace = rank.

More assumptions to the K-variable linear

model:
Second assumption: V(y) = V(ε) = σ 2 I N
where y and ε are N-vectors.

With this assumption, we can obtain the

sampling variance of β$ .

Proposition: V( β$ ) = σ 2 (X′X) −1

Proof:

β$ = (X′X) −1 X′y
= (X′X) −1 X′Xβ + (X′X) −1 X′ε
hence
β$ = β + (X′X) −1 X′ε
N.M. Kiefer, Cornell University, Economics 620, Lecture 4 9
V( β$ ) = E( β$ −E β$ ) ( β$ −E β$ )′
= E (X′X) −1 X′εε′X(X′X) −1

V( β$ ) = (X′X) −1 X′ (Eεε′) X(X′X) −1

= σ 2 (X′X) −1

Gauss-Markov Theorem: The LS estimator is

BLUE.

Proof: Consider estimating c′β for some c.

A possible estimator is c′ β$
with variance σ 2 c′(X′X) −1 c.

An alternative linear unbiased estimator: b = a′

Eb = a′Ey = a′Xβ.

Since both c′ β$ and b are unbiased, a′X = c′.

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 10

Thus, b = a′y = a′(Xβ + ε)
= a′Xβ + a′ε = c′β + a′ε.

Hence, V(b) = σ 2 a′a.

Now, V(c′ β$ ) = σ 2 a′X(X′X) −1 X′a since c′ = a

′X.

So V(b) - V(c′ β$ ) = σ 2 a′Ma , p.s.d.

Hence V(b) ≥V(c′ β$ )

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 11

2
Estimation of σ

Proposition: s 2 = e′e/(N - K) is an unbiased

estimator for σ 2 .

Proof: e = y - X β$ = My = Mε ⇒
e′e = ε′Mε

Ee′e = Eε′Mε = E tr ε′Mε (why?)

= tr Eε′Mε = tr EMεε′ (important trick)

= tr M Eεε′ = σ 2 tr M = σ 2 (N -K)

⇒ s 2 = e′e/(N - K) is unbiased for σ 2 .

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 12

FIT: DOES THE REGRESSION MODEL
EXPLAIN THE DATA?

We will need the useful idempotent matrix

A = I − 1(1′1)−11′ = I - 11′/N
which sweeps out means.

Here 1 is an N-vector of ones.

Note that AM = M when X contains a constant

term

Definition: The correlation coefficient in the K-

variable case is
2
R = (Sum of squares due to X)/(Total sum of
squares)
= 1 - (e′e/y′Ay).

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 13

∑ i
( y − y ) 2

Using A, y′Ay =: i =1

y′Ay = (Ay)′(Ay) = (A y$ + Ae)′(A y$ + Ae)

= y$ ′A y$ + e′Ae since y$′e = 0

Thus, y′Ay = y$ ′A y$ + e′e since Ae = e.

Scaling yields:

y$ ′Ay$ e ′e
1 = +
y ′Ay y ′Ay

What are the two terms of this splitup?

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 14

2
R gives the fraction of variation explained by
X:

R = 1 - (e′e/y′Ay).
2

Note: The adjusted squared correlation

coefficient is given by

2 e ′e / (N − K)
R = 1 -
y ′Ay / ( N − 1)

(Why might this be preferable?)

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 15

REPORTING
Always report characteristics of the sample, i.e.
means, standard deviations, anything unusual or
surprising, how the data set is collected and how
the sample is selected.

Report β$ and standard errors (not t-statistics).

The usual format is
β$
(s. e. of β$ )

Specify s or σ 2ML .
2

Report N and R 2 .

Plots are important. For example, predicted vs.

actual values or predicted and actual values over
time in time series studies should be presented.

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 16

COMMENTS ON LINEARITY:
Consider the following argument: Economic functions don't
change suddenly. Therefore they are continuous. Thus they
are differentiable and hence nearly linear by Taylor's
Theorem.

This argument is false (but irrelevant).

f(x) f(x)

x x
Continuous, not diff, Continuous, diff,
but well-approximated and not well-
by a line. approximated…

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 17

Gauss-Markov Theorem
No ratings yet
Gauss-Markov Theorem
5 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
Econometrics 1 Cumulative Final Study Guide
No ratings yet
Econometrics 1 Cumulative Final Study Guide
35 pages
Multiple Regression Model - Matrix Form
No ratings yet
Multiple Regression Model - Matrix Form
22 pages
3 The Basic Linear Model Finite Sample Results
No ratings yet
3 The Basic Linear Model Finite Sample Results
9 pages
Gauss Markov Theorem
100% (1)
Gauss Markov Theorem
3 pages
Gauss-Markov Theorem Proof
0% (1)
Gauss-Markov Theorem Proof
2 pages
Lecture Note 4 To 7 OLS
No ratings yet
Lecture Note 4 To 7 OLS
29 pages
Econ-607 - Unit2-W1-3
No ratings yet
Econ-607 - Unit2-W1-3
117 pages
TOPIC 3 - Measures of Central Tendency and Variability
No ratings yet
TOPIC 3 - Measures of Central Tendency and Variability
5 pages
The Linear Regression Model
No ratings yet
The Linear Regression Model
24 pages
MOOC Econometrics: Christiaan Heij
No ratings yet
MOOC Econometrics: Christiaan Heij
3 pages
Lesson01 PDF 02
No ratings yet
Lesson01 PDF 02
5 pages
Lec3 2019 PDF
No ratings yet
Lec3 2019 PDF
43 pages
Education and Research: UP School of Statistics Student Council
No ratings yet
Education and Research: UP School of Statistics Student Council
26 pages
T - Table (Critical Values For The Student's T Distribution)
No ratings yet
T - Table (Critical Values For The Student's T Distribution)
1 page
Econometric S If All 2020
No ratings yet
Econometric S If All 2020
119 pages
Chapter 8 Ken Black
No ratings yet
Chapter 8 Ken Black
31 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
3.2 Power Point 2
No ratings yet
3.2 Power Point 2
35 pages
Matrix Model
No ratings yet
Matrix Model
6 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
11 pages
Advanced Econometrics: Masters Class
No ratings yet
Advanced Econometrics: Masters Class
24 pages
Communication Technology and Its Relationship To The Performance of Media Institutions Jordanian T.V. and Radio Corporation As Model
No ratings yet
Communication Technology and Its Relationship To The Performance of Media Institutions Jordanian T.V. and Radio Corporation As Model
115 pages
UMVUE Statmat 2 2022
No ratings yet
UMVUE Statmat 2 2022
43 pages
OLS Matrix Analysis for Statisticians
No ratings yet
OLS Matrix Analysis for Statisticians
14 pages
Chapter 4. Gauss-Markov Model
No ratings yet
Chapter 4. Gauss-Markov Model
20 pages
7772 LectureNotes
No ratings yet
7772 LectureNotes
120 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
BLUE Properties of OLS Estimators and Gauss Markov
No ratings yet
BLUE Properties of OLS Estimators and Gauss Markov
9 pages
Statistics II - Least Squares Regression: Marcelo Sant'Anna
No ratings yet
Statistics II - Least Squares Regression: Marcelo Sant'Anna
18 pages
Lect 6
No ratings yet
Lect 6
20 pages
Econometric Theory: Module - Iii
No ratings yet
Econometric Theory: Module - Iii
10 pages
Chapter 3 Multiple Regression
No ratings yet
Chapter 3 Multiple Regression
49 pages
Chapter 6: Regression
No ratings yet
Chapter 6: Regression
7 pages
Econometrics I Lecture 3 Wooldridge
No ratings yet
Econometrics I Lecture 3 Wooldridge
50 pages
Linear Regression for Stat Students
No ratings yet
Linear Regression for Stat Students
11 pages
Finite-Sample OLS Analysis
No ratings yet
Finite-Sample OLS Analysis
35 pages
Presence Tasks 2
No ratings yet
Presence Tasks 2
8 pages
Trees Handout
No ratings yet
Trees Handout
51 pages
Econometrics: Matrix Algebra Exercises
No ratings yet
Econometrics: Matrix Algebra Exercises
4 pages
Multivariate Statistics - An Introduction 8th Edition
100% (1)
Multivariate Statistics - An Introduction 8th Edition
202 pages
Lecture3 Module1 Anova 1
No ratings yet
Lecture3 Module1 Anova 1
10 pages
Gaussian Process Regression For Dummies: Greg Cox Rich Shiffrin
No ratings yet
Gaussian Process Regression For Dummies: Greg Cox Rich Shiffrin
47 pages
ANOVA Analysis for Researchers
No ratings yet
ANOVA Analysis for Researchers
32 pages
3 Fall 2007 Exam PDF
No ratings yet
3 Fall 2007 Exam PDF
7 pages
8614 - Assignment 2 Solved (AG)
No ratings yet
8614 - Assignment 2 Solved (AG)
19 pages
Excel Solver for Curve Fitting
No ratings yet
Excel Solver for Curve Fitting
3 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
Chapter 5 Econometrics Practice MC
No ratings yet
Chapter 5 Econometrics Practice MC
36 pages
Reading 3 Statistical Measures of Asset Returns
No ratings yet
Reading 3 Statistical Measures of Asset Returns
22 pages
PPKU06 07 Modelling Asosiasi Korelasi Regresi
No ratings yet
PPKU06 07 Modelling Asosiasi Korelasi Regresi
58 pages
Theoretical Aspects of Selection For Yield in Stress and Non-Stress Environments 1
No ratings yet
Theoretical Aspects of Selection For Yield in Stress and Non-Stress Environments 1
4 pages
MLRM
No ratings yet
MLRM
22 pages
Lecture II - Docx - 12
No ratings yet
Lecture II - Docx - 12
12 pages
Chapter2 Econometrics MultipleLinearRegressionModel 1 1
No ratings yet
Chapter2 Econometrics MultipleLinearRegressionModel 1 1
34 pages
5 Curve Fitting and Interpolation
No ratings yet
5 Curve Fitting and Interpolation
20 pages
IIT Roorkee 2013 Data Structures Grades
No ratings yet
IIT Roorkee 2013 Data Structures Grades
5 pages
Ec 2
No ratings yet
Ec 2
12 pages
Student - Dummy Variable Issue
No ratings yet
Student - Dummy Variable Issue
3 pages
Holmes 1 4/6/2015: Pstat 5Ls HW2
No ratings yet
Holmes 1 4/6/2015: Pstat 5Ls HW2
2 pages
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
No ratings yet
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
3 pages
Week 2 DrBuddhananda Banerjee Vector RV
No ratings yet
Week 2 DrBuddhananda Banerjee Vector RV
10 pages
Unit - 1
No ratings yet
Unit - 1
8 pages
Standard Deviation
No ratings yet
Standard Deviation
13 pages
Linear Model
No ratings yet
Linear Model
14 pages
MathModel - Lecture 8 1
No ratings yet
MathModel - Lecture 8 1
8 pages
Lecture 3 (Measure of Central Tendency-Median)
No ratings yet
Lecture 3 (Measure of Central Tendency-Median)
7 pages
Determination of The Selection Statistics and Best Significance Level in Backward Stepwise Logistic Regression
No ratings yet
Determination of The Selection Statistics and Best Significance Level in Backward Stepwise Logistic Regression
12 pages
Classical Estimation
No ratings yet
Classical Estimation
11 pages
Chi-Square Practical
No ratings yet
Chi-Square Practical
3 pages
Answer Key To Exercises - LN3 - Ver2
No ratings yet
Answer Key To Exercises - LN3 - Ver2
16 pages
Econometrics for Finance Assignment
100% (1)
Econometrics for Finance Assignment
3 pages
Practical 1 Data Analysis Descriptive Statistics
No ratings yet
Practical 1 Data Analysis Descriptive Statistics
12 pages
10 24331-Ijere 453512-523855
No ratings yet
10 24331-Ijere 453512-523855
8 pages
LM Sensitivity Notes
No ratings yet
LM Sensitivity Notes
2 pages
Linear Model Recap 2
No ratings yet
Linear Model Recap 2
313 pages
Solutions Manual For Econometric Analysis 7th Edition by Greene Sample Chapter
No ratings yet
Solutions Manual For Econometric Analysis 7th Edition by Greene Sample Chapter
13 pages
R Basic Cheatsheet
No ratings yet
R Basic Cheatsheet
2 pages
EC501 Lecture 01
No ratings yet
EC501 Lecture 01
28 pages
EC501 Lecture 02
No ratings yet
EC501 Lecture 02
27 pages
Analysis of The Prestige Dataset
No ratings yet
Analysis of The Prestige Dataset
19 pages
L4 2025 Spring
No ratings yet
L4 2025 Spring
27 pages
L5 2025 Spring
No ratings yet
L5 2025 Spring
40 pages
STA1505 Assignment 2 - 2025
No ratings yet
STA1505 Assignment 2 - 2025
3 pages
Chap 2
No ratings yet
Chap 2
40 pages

Lec 4

Uploaded by

Lec 4

Uploaded by

LECTURE 4: THE K-

where y is Nx1, X is Nx2, β is 2x1, and ε

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 1

Good practice requires inclusion of

a typical row looks like:

THE LEAST SQUARES METHOD:

S(b) = (y - Xb)’(y - Xb)

= y’y - 2b’X’y + b’X’Xb

Proof: Let b be any other K-vector.

Note: Ee = 0 and X’e = 0.

Proposition: The LS estimator is unbiased.

= E[(X’X) -1X’(X β+ ε)] = β

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 4

Definition: The space spanned by matrix

Definition: X(X’X)-1X’y is the orthogonal

Thus the equation y = X β$ + e gives y as the

1. The matrix which projects to the space

Note: e = My. If X is full column rank, M has

2. The matrix which projects to the space

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 6

What is the case of singular X’X?

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 7

I.g. (I - M)(I - M) = (I - M).

2. Idempotent matrices have eigenvalues equal

Proof: Consider the characteristic equation

Thus, λ z = λz, which implies that λ is either

⇒ For idempotent matrices, trace = rank.

More assumptions to the K-variable linear

With this assumption, we can obtain the

V( β$ ) = (X′X) −1 X′ (Eεε′) X(X′X) −1

Gauss-Markov Theorem: The LS estimator is

Proof: Consider estimating c′β for some c.

An alternative linear unbiased estimator: b = a′

Since both c′ β$ and b are unbiased, a′X = c′.

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 10

Hence, V(b) = σ 2 a′a.

Now, V(c′ β$ ) = σ 2 a′X(X′X) −1 X′a since c′ = a

So V(b) - V(c′ β$ ) = σ 2 a′Ma , p.s.d.

Hence V(b) ≥V(c′ β$ ) 

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 11

Proposition: s 2 = e′e/(N - K) is an unbiased

Ee′e = Eε′Mε = E tr ε′Mε (why?)

= tr Eε′Mε = tr EMεε′ (important trick)

⇒ s 2 = e′e/(N - K) is unbiased for σ 2 . 

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 12

We will need the useful idempotent matrix

Here 1 is an N-vector of ones.

Note that AM = M when X contains a constant

Definition: The correlation coefficient in the K-

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 13

y′Ay = (Ay)′(Ay) = (A y$ + Ae)′(A y$ + Ae)

Thus, y′Ay = y$ ′A y$ + e′e since Ae = e.

What are the two terms of this splitup?

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 14

Note: The adjusted squared correlation

(Why might this be preferable?)

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 15

Report β$ and standard errors (not t-statistics).

Plots are important. For example, predicted vs.

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 16

This argument is false (but irrelevant).

N.M. Kiefer, Cornell University, Economics 620, Lecture 4 17

You might also like

Hence V(b) ≥V(c′ β$ )

⇒ s 2 = e′e/(N - K) is unbiased for σ 2 .