Econometrics Lecture Note Chapter 4 and 5
Econometrics Lecture Note Chapter 4 and 5
Chapter Four
Econometrics
Yadeta Ahmed
Model (5.01) may enable us to find out whether sex makes any difference in a
college professor’s salary, assuming, of course, that all other variables such as age,
degree attained, and years of experience are held constant. Assuming that the
disturbance satisfy the usually assumptions of the classical linear regression
model, we obtain from (5.01).
Mean salary of female college professor: E (Yi / Di 0) -------(5.02)
Mean salary of male college professor: E (Yi / Di 1)
that is, the intercept term gives the mean salary of female college professors
and the slope coefficient tells by how much the mean salary of a male college
professor differs from the mean salary of his female counterpart, reflecting
the mean salary of the male college professor. A test of the null hypothesis that
there is no sex discrimination ( H 0 : 0) can be easily made by running
regression (5.01) in the usual manner and finding out whether on the basis of the t
test the estimated is statistically significant.
4.2. Regression on one quantitative variable and one qualitative variable with
two classes, or categories
Consider the model: Yi i 2 Di X i u i ----------------------------(5.03)
Where: Yi annual salary of a college professor
Econometrics
Yadeta Ahmed
Di 1 if male
=0 otherwise
Model (5.03) contains one quantitative variable (years of teaching experience) and
one qualitative variable (sex) that has two classes (or levels, classifications, or
categories), namely, male and female. What is the meaning of this equation?
Assuming, as usual, that E (u i ) 0, we see that
Mean salary of female college professor: E (Yi / X i , Di 0) 1 X i ---------(5.04)
Mean salary of male college professor: E (Yi / X i , Di 1) ( 2 ) X i ------(5.05)
Geometrically, we have the situation shown in fig. 5.1 (for illustration, it is
assumed that 1 0 ). In words, model 5.01 postulates that the male and female
college professors’ salary functions in relation to the years of teaching experience
have the same slope but different intercepts. In other words, it is assumed that
the level of the male professor’s mean salary is different from that of the female
professor’s mean salary (by 2 ) but the rate of change in the mean annual salary
by years of experience is the same for both sexes.
Econometrics
Yadeta Ahmed
If the assumption of common slopes is valid, a test of the hypothesis that the two
regressions (5.04) and (5.05) have the same intercept (i.e., there is no sex
discrimination) can be made easily by running the regression (5.03) and noting the
statistical significance of the estimated 2 on the basis of the traditional t test. If
the t test shows that ̂ 2 is statistically significant, we reject the null hypothesis that
the male and female college professors’ levels of mean annual salary are the same.
Before proceeding further, note the following features of the dummy variable
regression model considered previously.
Econometrics
Yadeta Ahmed
= 0 otherwise
Note that in the preceding assignment of the dummy variables we are arbitrarily
treating the “less than high school education” category as the base category.
Econometrics
Yadeta Ahmed
Therefore, the intercept 1 will reflect the intercept for this category. The
differential intercepts 2 and 3 tell by how much the intercepts of the other two
categories differ from the intercept of the base category, which can be readily
checked as follows: Assuming E (u i ) 0 , we obtain from (5.06)
E (Yi | D2 0, D3 0, X i ) 1 X i
E (Yi | D2 1, D3 0, X i ) ( 1 2 ) X i
E (Yi | D2 0, D3 1, X i ) ( 1 3 ) X i
which are, respectively the mean health care expenditure functions for the three
levels of education, namely, less than high school, high school, and college.
Geometrically, the situation is shown in fig 5.2 (for illustrative purposes it is
assumed that 3 2 ).
Econometrics
Yadeta Ahmed
assume that color has two categories: black and white. We can now write (5.03)
as :
Yi 1 2 D2i 3 D3i X i u i -------------------------------------------(5.07)
D2 1 if female
=0 otherwise
D3 1 if white
=0 otherwise
Notice that each of the two qualitative variables, sex and color, has two categories
and hence needs one dummy variable for each. Note also that the omitted, or base,
category now is “black female professor.”
Assuming E (u i ) 0 , we can obtain the following regression from (5.07)
Mean salary for black female professor:
E (Yi | D2 0, D3 0, X i ) 1 X i
Once again, it is assumed that the preceding regressions differ only in the intercept
coefficient but not in the slope coefficient .
An OLS estimation of (5.06) will enable us to test a variety of hypotheses. Thus,
if 3 is statistically significant, it will mean that color does affect a professor’s
salary. Similarly, if 2 is statistically significant, it will mean that sex also affects
a professor’s salary. If both these differential intercepts are statistically
Econometrics
Yadeta Ahmed
From the preceding discussion it follows that we can extend our model to include
more than one quantitative variable and more than two qualitative variables. The
only precaution to be taken is that the number of dummies for each qualitative
variable should be one less than the number of categories of that variable.
D2 1 if female
= 0 if male
D3 1 if college graduate
= 0 otherwise
Implicit in this model is the assumption that the differential effect of the sex
dummy D2 is constant across the two levels of education and the differential
Econometrics
Yadeta Ahmed
effect of the education dummy D3 is also constant across the two sexes. That is, if,
say, the mean expenditure on clothing is higher for females than males this is so
whether they are college graduates or not. Likewise, if, say, college graduates on
the average spend more on clothing than non college graduates, this is so whether
they are female or males.
higher (than the base category, which here is male nongraduate), but it is much
more so if the females also happen to be graduates. Similarly, the average
expenditure on clothing by a college graduate tends to be higher than the base
category but much more so if the graduate happens to be a female. This shows
how the interaction dummy modifies the effect of the two attributes considered
individually. Whether the coefficient of the interaction dummy is statistically
Econometrics
Yadeta Ahmed
significant can be tested by the usual t test. If it turns out to be significant, the
simultaneous presence of the two attributes will attenuate or reinforce the
individual effects of these attributes. Needless to say, omitting a significant
interaction term incorrectly will lead to a specification bias.
Econometrics
Yadeta Ahmed
It pays commissions based on sales in such manner that up to a certain level, the
target, or threshold, level X*, there is one (stochastic) commission structure and
beyond that level another. (Note: Besides sales, other factors affect sales
commission. Assume that these other factors are represented by the stochastic
disturbance term.) More specifically, it is assumed that sales commission increases
linearly with sales until the threshold level X*, after which also it increases
linearly with sales but at a much steeper rate. Thus, we have a piece-wise linear
regression consisting of two linear pieces or segments, which are labeled I and II
in fig. 5.3, and the commission function changes its slope at the threshold value.
Given the data on commission, sales, and the value of the threshold level X*, the
technique of dummy variables can be used to estimate the (differing) slopes of the
two segments of the piecewise linear regression shown in fig. 5.3. We proceed as
follows:
Yi 1 X 2 ( X i X *)Di u i ------------------------------------(5.11)
Econometrics
Yadeta Ahmed
= 0 if X i X *
Assuming E (u i ) 0, we see at once that
E (Yi | Di 0, X i , X *) 1 1 X i ---------------------------------------(5.12)
which gives the mean sales commission up to the target level X* and
E (Yi | Di 1, X i , X *) 1 2 X * ( 1 2 ) X i ----------------------(5.13)
which gives the mean sales commission beyond the target level X*.
Thus, 1 gives the slope of the regression lien in segment I, and 1 2 gives the
slope of the regression line in segment II of the piecewise linear regression shown
in fig 5.3. A test of the hypothesis that there is no break in the regression at the
threshold value X* can be conducted easily by noting the statistical significance of
the estimated differential slope coefficient ̂ 2 .
Summary:
1. Dummy variables taking values of 1 and 0 (r their linear transforms) are a
means of introducing qualitative regressors in regression analysis.
2. Dummy variables are a data-classifying device in that they divide a sample
into various subgroups based on qualities or attributes (sex, marital status,
race, religion, etc.) and implicitly allow one to run individual regressions
for each subgroup. If there are differences in the response of the regress
and to the variation in the quantitative variables in the various subgroups,
they will be reflected in the differences in the intercepts or slope
coefficients, or both, of the various subgroup regressions.
3. Although a versatile took, the dummy variable technique needs to be
handled carefully. First, if the regression contains a constant term, the
number of dummy variables must be less than the number of classifications
of each qualitative variable. Second, the coefficient attached to the dummy
Econometrics
Yadeta Ahmed
Econometrics
Yadeta Ahmed
Econometrics
Yadeta Ahmed
Chapter Five
Econometrics
Yadeta Ahmed
The bias arising from application of such procedure of estimation which treats each
equation of the simultaneous equations model as though it were a single model is known
as simultaneity bias or simultaneous equation bias. To avoid this bias we will use other
methods of estimation, such as, Indirect Least Square (ILS), Two Stage Least Square
(2SLS), three Stage Least Square(3SLS), Maximum Likelihood Methods and the Method
of Instrumental Variable (IV).
Econometrics
Yadeta Ahmed
(U ) 0 , (V ) 0
(U ) ,
2 2
u (V 2 ) u2
(U iU j ) 0 , (ViV j ) 0, also (UiVi ) 0;
0 0 1 2 U V
X Z 1 (11)
1 1 1 1 1 1 1 1 1
Applying OLS to the first equation of the above structural model will result in
biased estimator because cov( X iU i ) ( X iU j ) 0 . Now, let’s proof whether this
expression.
cov( XU ) X ( X )U (U )
0 1 2 U V 0 0 1 2
0 Z 1 Z U
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
U
( 1U V )
1 1 1
1
( 1U 2 UV )
1
1 1
1 2
(U 2 ) 1 u 0 , since E(UV) = 0
1 1 1 1 1 1
Econometrics
Yadeta Ahmed
Econometrics
Yadeta Ahmed
lagged endogenous variable. This is on the assumption that X’s symbolize the
exogenous variables and Y’s symbolize the endogenous variables. Thus, X t , X t 1
and Yt 1 are regarded as predetermined (exogenous) variables.
Q s 0 1 P 2 R U 2 (15)
Here P and Q are endogenous variables and Y and R are exogenous variables.
Structural models
A structural model describes the complete structure of the relationships among the
economic variables. Structural equations of the model may be expressed in terms
of endogenous variables, exogenous variables and disturbances (random
variables). The parameters of structural model express the direct effect of each
explanatory variable on the dependent variable. Variables not appearing in any
function explicitly may have an indirect effect and is taken into account by the
simultaneous solution of the system. For instance, a change in consumption affects
the investment indirectly and is not considered in the consumption function. The
effect of consumption on investment cannot be measured directly by any structural
parameter, but is measured indirectly by considering the system as a whole.
Econometrics
Yadeta Ahmed
Y C Z ----------------------------------------------------(17)
for >0 and 0<<1
where: C=consumption expenditure
Z=non-consumption expenditure
Y=national income
C and Y are endogenous variables while Z is exogenous variable.
C C Z U
C (1 ) Z U
U
C Z ----------------------------------(18)
1 1 1
Econometrics
Yadeta Ahmed
Equation (18) and (19) are called the reduced form of the structural model of the
above. We can write this more formally as:
Structural form equations Reduced form equations
C Y U U
C Z
1 1 1
Y CZ 1 U
Y Z
1 1 1
Parameters of the reduced form measure the total effect (direct and indirect) of a
change in exogenous variables on the endogenous variable. For instance, in the
above reduced form equation(18), measures the total effect of a unit
1
change in the non-consumption expenditure on consumption. This total effect is
1
, the direct effect, times ,the indirect effect.
1
The reduced form equations can be obtained in two ways:
1) To express the endogenous variables directly as a function of the
predetermined variables.
2) To solve the structural system of endogenous variables in terms of the
predetermined variables, the structural parameters, and the disturbance
terms.
Consider the following simple model for a closed economy.
Ct = a1Yt + U1 ---------------------------------------------------------(i)
It = b1Yt + b2Yt-1 + U2-----------------------------------------------(ii)
Yt = Ct +It + Gt-------------------------------------------------------(iii)
This model has three equations in three endogenous variables (Ct , It , and Yt ) and
two predetermined variables (Gt, andYt-1).
Econometrics
Yadeta Ahmed
To obtain the reduced form of this model, we may use two methods (direct method
and solving the structural model method).
Direct Method: Express the three endogenous variables(Ct , It , and Yt ) as
functions of the two predetermined variables (Gt, andYt-1) directly using ’s as the
parameters of the reduced form model as follows.
Ct = 11Yt-1 + 12Gt + V1 ------------------------------------(iv)
It , =21Yt-1 + 22Gt +V2 -------------------------------------(v)
Yt =31Yt-1 + 32Gt + V3 ------------------------------------(vi)
Note: 11 , 12 , 21 , 22 , 31 , and 32 are reduced from parameters. By solving the
structural system of endogenous variables in terms of predetermined variables,
structural parameters and disturbances, the expressions for the reduced parameters
can be obtained easily. For instance, the third structural equation (iii) can be
expressed in reduced form as follows:
Yt = b2/ (1-a1-b1)Yt-1 + 1/(1-a1-b1) Gt + (U1 +U2)/ (1-a1-b1). This equation is
obtained by simply substituting structural equations (i) and (ii) in (iii). Form this
expression: 31 = b2/ (1-a1-b1)
32 = b2/ (1-a1-b1)
Test yourself Questions:
a) Determine the reduced form equations for the structural equations (ii) and
(iii).
b) Indicate the expressions for 11 , 12, 21 , and 22 form (a) above.
How to estimate the reduced form parameters?
The estimates of the reduced from coefficients (’s ) may be obtained in two ways.
1) Direct estimation of the reduced coefficients by applying OLS.
2) Indirect estimation of the reduced form coefficients:
Steps:
Econometrics
Yadeta Ahmed
Econometrics
Yadeta Ahmed
In the above illustration, as usual, the X’s and Y’s are exogenous and endogenous
variables respectively. The disturbance terms follow the following assumptions.
(U 1U 2 ) (U 1U 3 ) (U 2U 3 ) 0
The above assumption is the most crucial assumption that defines the recursive
model. If this does not hold, the above system is no longer recursive and OLS is
also no longer valid. The first equation of the above system contains only the
exogenous variables on the right hand side. Since by assumption, the exogenous
variable is independent of U 1 , the first equation satisfies the critical assumption of
the OLS procedure. Hence OLS can be applied straight forwardly to this equation.
Consider the second equation. It contains the endogenous variable Y1 as one of the
explanatory variables along with non-stochastic X’s. OLS can be applied to this
equation only if it can be shown that Y1 and U 2 are independent of each other. This
is true because U1, which affects Y1 is by assumption uncorrelated with U 2 , i.e.
(U 1U 2 ) 0 . Y1 acts as a predetermined variable in so far as Y2 is concerned.
Hence OLS can be applied to this equation. Similar argument can be stretched to
the 3rd equation because Y1 and Y2 are independent of U 3 . In this way, in the
recursive system OLS can be applied to each equation separately.
factor X 4 = disposable income. Finally the price obtained by the producer = Y3 can
be expressed in terms of the retail price Y2 and exogenous factor X j = the cost of
Econometrics
Yadeta Ahmed
In the first equation, there are only exogenous variables and are assumed to be
independent of U 1 . In the second equation, the causal relation between Y1 and
Y2 is in one direction. Also Y1 is independent of U 2 and can be treated just like
Econometrics
Yadeta Ahmed
*
These methods of estimation are not discussed in this module as they are beyond the scope of this
introductory course.
Econometrics
Yadeta Ahmed
B A B B
A AU V
W V A
P -------------------(iv)
E
A B A B A B A B
Equation (iv) is what is known as a linear combination of (i) and (ii). The point
about equation (iv) is that it is of the same statistical form as the wage equation (i).
That is, it has the form:
W = constant + (constant)P + (constant)E + disturbance
Moreover, since A and B can take any values we like, this implies that our wage
price model generates an infinite number of equations such as (iv), which are all
Econometrics
Yadeta Ahmed
statistically indistinguishable from the wage equation (i). Hence, if we apply OLS
or any other technique to data on W, P and E in an attempt to estimate the wage
equation, we can’t know whether we are actually estimating (i) rather than one of
the infinite number of possibilities given by (iv). Equation (i) is said to be
unidentified, and consequently there is now no way in which unbiased or even
consistent estimators of its parameters may be obtained.
Notice that, in contrast, price equation (ii) cannot be confused with the linear
combination (iv), because it is a relationship involving W and P only and does not,
like (iv), contain the variable E. The price equation (ii) is therefore said to be
identified, and in principle it is possible to obtain consistent estimates of its
parameters. A function (an equation) belonging to a system of simultaneous
equations is identified if it has a unique statistical form, i.e. if there is no other
equation in the system, or formed by algebraic manipulations of the other
equations of the system, contains the same variables as the function(equation) in
question.
Identification problems do not just arise only on two equation-models. Using the
above procedure, we can check identification problems easily if we have two or
three equations in a given simultaneous equation model. However, for ‘n’
equations simultaneous equation model, such a procedure is very cumbersome. In
general for any number of equations in a given simultaneous equation, we have
two conditions that need to be satisfied to say that the model is in general
identified or not. In the following section we will see the formal conditions for
identification.
5.5. Formal Rules (Conditions) for Identification
Identification may be established either by the examination of the specification of
the structural model, or by the examination of the reduced form of the model.
Econometrics
Yadeta Ahmed
Traditionally identification has been approached via the reduced form. Actually the term
‘identification’ was originally used to denote the possibility (or impossibility) of
deducing the values of the parameters of the structural relations from a knowledge of the
reduced form parameters. In this section we will examine both approaches. However, we
think that the reduced form approach is conceptually confusing and computationally
more difficult than the structural model approach, because it requires the derivation of the
reduced from first and then examination of the values of the determinant formed form
some of the reduced form coefficients. The structural form approach is simpler and more
useful.
In applying the identification rules we should either ignore the constant term, or, if we
want to retain it, we must include in the set of variables a dummy variable (say X0) which
would always take on the value 1. Either convention leads to the same results as far as
identification is concerned. In this chapter we will ignore the constant intercept.
5.5.1. Establishing identification from the structural form of the model
There are two conditions which must be fulfilled for an equation to be identified.
1. The order condition for identification
This condition is based on a counting rule of the variables included and excluded
from the particular equation. It is a necessary but not sufficient condition for the
identification of an equation. The order condition may be stated as follows.
For an equation to be identified the total number of variables (endogenous and
exogenous) excluded from it must be equal to or greater than the number of
endogenous variables in the model less one. Given that in a complete model the
number of endogenous variables is equal to the number of equations of the model,
the order condition for identification is sometimes stated in the following
equivalent form. For an equation to be identified the total number of variables
excluded from it but included in other equations must be at least as great as the
number of equations of the system less one.
Let: G = total number of equations (= total number of endogenous variables)
Econometrics
Yadeta Ahmed
Order condition:
( K M ) (G 1)
; that is, the order condition is not satisfied.
(15 11) (10 1)
b. For the second equation we have
G 10 K 15 M 5
order condition:
( K M ) (G 1)
; that is, the order condition is satisfied.
(15 5) (10 1)
The order condition for identification is necessary for a relation to be identified,
but it is not sufficient, that is, it may be fulfilled in any particular equation and yet
the relation may not be identified.
2. The rank condition for identification
The rank condition states that: in a system of G equations any particular equation
is identified if and only if it is possible to construct at least one non-zero
determinant of order (G-1) from the coefficients of the variables excluded from
that particular equation but contained in the other equations of the model. The
Econometrics
Yadeta Ahmed
y 2 y 3 x3 u 2
y3 y1 y 2 2 x3 u 3
where the y’s are the endogenous variables and the x’s are the predetermined
variables. This model may be rewritten in the form
y1 3 y 2 0 y3 2 x1 x 2 0 x3 u1 0
0 y1 y 2 y3 0 x1 0 x2 x3 u 2 0
y1 y 2 y3 0 x1 0 x2 2 x3 u 3 0
Ignoring the random disturbance the table of the parameters of the model is as
follows:
Variables
Equations Y1 Y2 Y3 X1 X2 X3
1st equation -1 3 0 -2 1 0
2nd equation 0 -1 1 0 0 1
3rd equation 1 -1 -1 0 0 -2
Secondly. Strike out the row of coefficients of the equation which is being
examined for identification. For example, if we want to examine the identifiability
of the second equation of the model we strike out the second row of the table of
coefficients.
Thirdly. Strike out the columns in which a non-zero coefficient of the equation
being examined appears. By deleting the relevant row and columns we are left
with the coefficients of variables not included in the particular equation, but
contained in the other equations of the model. For example, if we are examining
Econometrics
Yadeta Ahmed
for identification the second equation of the system, we will strike out the second,
third and the sixth columns of the above table, thus obtaining the following tables.
Table of structural parameters Table of parameters of excluded variables
Y1 Y2 Y3 X1 X2 X3 Y3 X1 X2
st
1 -1 3 0 -2 1 0 -1 -2 1
2nd 0 -1 1 0 0 1
3rd 1 -1 -1 0 0 -2 1 0 0
Fourthly. Form the determinant(s) of order (G-1) and examine their value. If at
least one of these determinants is non-zero, the equation is identified. If all the
determinants of order (G-1) are zero, the equation is underidentified.
In the above example of exploration of the identifiability of the second structural
equation we have three determinants of order (G-1)=3-1=2. They are:
1 2 2 1 1 1
1 0 2 0 3 0
1 0 0 0 1 0
(the symbol stands for ‘determinant’) We see that we can form two non-zero
determinants of order G-1=3-1=2; hence the second equation of our system is
identified.
Fifthly. To see whether the equation is exactly identified or overidentified we use
the order condition ( K M ) (G 1). With this criterion, if the equality sign is
satisfied, that is if ( K M ) (G 1) , the equation is exactly identified. If the
inequality sign holds, that is, if ( K M ) (G 1) , the equation is overidentified.
In the case of the second equation we have:
G=3 K=6 M=3
And the counting rule ( K M ) (G 1) gives
(6-3)>(3-1)
Therefore the second equation of the model is overidentified.
The identification of a function is achieved by assuming that some variables of the
model have zero coefficient in this equation, that is, we assume that some
Econometrics
Yadeta Ahmed
variables do not directly affect the dependent variable in this equation. This,
however, is an assumption which can be tested with the sample data. We will
examine some tests of identifying restrictions in a subsequent section. Some
examples will illustrate the application of the two formal conditions for
identification.
Example 1. Assume that we have a model describing the market of an agricultural
product. From the theory of partial equilibrium we know that the price in a market
is determined by the forces of demand and supply. The main determinants of the
demand are the price of the commodity, the prices of other commodities, incomes
and tastes of consumers. Similarly, the most important determinants of he supply
are the price of the commodity, other prices, technology, the prices of factors of
production, and weather conditions. The equilibrium condition is that demand be
equal to supply. The above theoretical information may be expressed in the form
of the following mathematical model.
D a0 a1 P1 a 2 P2 a3Y a 4 t u
D b0 b1 P1 b2 P2 b3C b4 t w
DS
Where: D= quantity demanded
S= quantity supplied
The above model is mathematically complete in the sense that it contains three
equations in three endogenous variables, D,S and P1. The remaining variables, Y,
P2, C, t are exogenous. Suppose we want to identify the supply function. We
apply the two criteria for identification:
Econometrics
Yadeta Ahmed
1. Order condition: ( K M ) (G 1)
In our example we have: K=7 M=5 G=3
Therefore, (K-M)=(G-1) or (7-5)=(3-1)=2
Consequently the second equation satisfies the first condition for identification.
2. Rank condition
The table of the coefficients of the structural model is as follows.
Variables
Equations P1 P2 t S C
D Y
st
1 equation -1 a1 a2 a3 a4 0 0
2nd equation 0 b1 b2 0 b4 -1 b3
3rd equation 1 0 0 0 0 1 0
Following the procedure explained earlier we strike out the second row an the
second, third, fifth, sixth and seventh columns. Thus we are left with the table of
the coefficients of excluded variables:
Complete table of Table of parameters of
Structural parameters variables excluded from
the second equation
-1 a1 a2 a3 a4 0 0 -1 a3
0 b1 b2 0 b4 1 b3
1 0 0 00 1 1 -1 0
From this table we can form only one non-zero determinant of order
(G-1) = (3-1) =2
1 a3
(0)(1) (1)(a3 ) a3
1 0
Econometrics
Yadeta Ahmed
We strike out the first row and the three first columns of the table and thus obtain
the table of coefficients of excluded variables.
Econometrics
Yadeta Ahmed
-1 a1 a 2 0 0 0
0 0 0 -1 b1 0 -1 b1 0
0 c1 -1 0 0 0 0 0 0
1 -1 0 1 0 1 -1 0 0
We evaluate the determinant of this table. Clearly the value of this determinant is
zero, since the second row contains only zeros. Consequently we cannot form any
nonzero determinant of order 3(=G-1). The rank condition is violated. Hence we
conclude that the consumption function is not identified, despite the satisfaction of
the order criterion.
B. The investment function is overidentified
1. Order condition
The investment function includes two variables. Hence
K-M = 6-2
Clearly (K-M) > (G-1), given that G-1=3. The order condition is fulfilled.
2. Rank condition
Deleting the second row and the fourth and fifth columns of the structural
coefficients table we obtain .
Complete table of structural Table of coefficients of
Parameters excluded variables
-1 a1 a2 0 0 0
0 0 0 -1 b1 0 -1 a1 a2 0
0 c1 -1 0 0 0 0 c1 -1 0
1 -1 0 1 0 1 -1 -1 0 1
The value of the first 3x3 determinant of the parameters of excluded variables is
c1 1 0 1 0 c1
1 1 a1 a2 1 a1 a 2 c1 0
1 0 1 0 1 1
(provided a1 a2 c1 1)
The rank condition is satisfied since we can construct at least one non-zero
determinant of order 3=(G-1).
Econometrics
Yadeta Ahmed
Applying the counting rule ( K M ) (G 1) we see that the inequality sign holds:
4>3; hence the investment function is overidentified.
Econometrics
Yadeta Ahmed
y 2 b23 y3 y 23 x3 u 2
This model is complete in the sense that it contains three equations in three
endogenous variables. The model contains altogether six variables, three
endogenous ( y1 , y 2 , y3 ) and three exogenous ( x1 , x2 , x3 ).
The reduced form of the model is obtained by solving the original equations for
the exogenous variables. The reduced form in the above example is:
y1 11 x1 12 x 2 13 x3 v1
y 2 21 x1 22 x2 23 x3 v2
y3 31 x1 32 x 2 33 x3 v3
Econometrics
Yadeta Ahmed
Equations x1 x2 x3
1st equation: y1 11 12 13
2nd equation: y 2
3rd equation: y 3 21 22 23
31 32 33
Strike out the rows corresponding to endogenous variables excluded from the
particular equation being examined for identifiability. Also strike out all the
columns referring to exogenous variables included in the structural form of the
particular equation.
After these deletions we are left with the reduced form coefficients of exogenous
variables excluded (absent) from the structural equation. For example, assume that
we are investigating the identification procedure are found by striking out the first
row (since y1 , does not appear in the second equation) and the third column
(since x3 , is included in this equation).
Complete table of reduced Table of reduced form
form coefficients coefficients of excluded
exogenous variables
11 12 13 21 22
21 22 23 31
31 32 33 32
Thirdly. Examine the order of the determinants of the π’s of excluded exogenous
variables and evaluate them. If the order of the larges non-zero determinant is
G*-1, the equation is identified. Otherwise the equation is not identified.
Econometrics