Regression Analysis
Regression Analysis
6
REGRESSION ANALYSIS
OBJECTIVES
From our discussion in the previous chapter, we were able to find a way of determining whether
expressed by a
or nota relationship existed between two variables. If such a relationship can be
mathematical formula, we will then be able to use it for the purpose
of making predictions. The
strength of the relationship between the
Telability of any prediction wil, of course, depend on the
in the formula. Regression analysis attempts to establish the "nature ofrelationship"
Yatables included and thereby
variables-that is. to study the functional relationship between the variables
Vetween
or forecasting, A mathematical equation that allows us to
prediction,
Povde a mechanism for one or more other variables is called a regression
known values of
value of one variable from is called the derendent variable or
explained
equation. The variable whose value is to be predicted values of a dependent variable are
called
variable. The variables which are used to predict the analysis confined to
The regression
the study of
regression
variables. called simple
independent variables or explanatory
independent variable, is
independent variable
is
only two variables, a dependent variable and an
dependent variable
and the
analysis. When the relationship between thesimple linear regression. The
linear, the technique for prediction is called regression analysis.
linear
study of simple
to the two variables.
between
In this chapter ourselves relationship
we shallconfine determine the
linear
objective of simple linear regression is to
Quantital1ve
3emen Regress
() Tall
fathers have
short fathers
is more than to
the sons of average heights of their fathes. used criterion is the least
e s t a b l i s h a
reference to biometry.
Regression analysis, in the
values of one or more other
prediction Y = a t bX,
Fig. 6.1
of the unknown value of
one variable from the known
variables.
variable
The hen the method of least squares requires that
dependent variable or explained
variable whose value is to be predicted is called the a dependent variable are called e must determnine constants a and bso as to
variables which are used to predict the values of independen
by Galton, mentioned above. the heisl
minimize
variables or explanaton variables. In the study done vark S = (Y, -a - bX, +(Y, -a- bX, +.... (Y,- a- bX,}
the height of children was the dependent
the parents was the independent variable and DË+D} +..... . D?
definitions of the term regression.
In the following we shall give some important
ofthe average relationship between two or more variables in termt where D,= Y,- a-bX, represents the vertical deviation ofthe ith observed point from the line of
1. Regression is the measure
data. - M.M. Blair best fit, as indicated in Fig.6.1.The determination of aand bso as to minimize Scan be accomplished
of the original units of the by means of differential calculus. We omit the proof and state the final two equations which are
techniques in economics and business research, to find a
2. One of the most frequently used Used to determine the values of a and b. These equations, known as the normal equations for
relation beween two or more variables that are related causally, is regression analysis. estimating a and b, are given by
Taro Yamane .... ()
Y = na +b SX
3. The tem 'regression analysis 'refers to the methods by which estimates are made of the values XY = a )X + b Ex .. (2)
of a variable from a knowledge of the values of one or more other variables and to ie Solving Eqs. (1) and (2) simultaneously for a and b, we obtain
measurement of the errors involved in this estimation process. -Morris Hamburg
corresponding the
Fig 61) However, if we want to estimate or predict the value of observed
Xfrom known
points values
to the oflineY, (see
we
Y
m i n i m i z e s
the change by
regression of Yon
X and is given
EXY-2X)(E) willuse regression line of Xon Y which is the line of "best fit" in the sense that it minimizes the
n2 XY-(Ex)(EY) sumofthe squares of the horizontal distances from the observed points to the line (see Fig. 6.2).
nx?-(x)? (6) equations are not reversible
T h e t w or e g r e s s i o n
because of the simple reason that
for deriving these equations are quite different. However,
the basis and
shows that the line of regression of Y it
value of "a" clearly case of perfect correlation (positive or negative), the two regression lines maywould
be remarked
coincide. that in
assumptions
the on X
The formula (5)
estimating equation of the line of regression of Yon X
point (X, Y) and hence the can
passes through the
X=c+
dY
also be written as
.. ()
X On the other hand, if
to estimate a value of
Yfor a given value of we
This equation is then used
Xfor a given value of Y, we have to obtain the regression line of Xon X-c-dY
wish to estimate a value of
Y:
&Operations Research in
6.6
Quantitative
Techniques
Y the
Managemen 28
equation oftwo
variablesXand
following results =
7
= 4 and
estimation of regressjon 35
Example 2. In the n 7
were obtained : XY = 3900, N= 10
=6360, Y =2860, n XY -(E X)(EY) 7x151- 28 ×35
Y= 700, 5X2 [IP Uni%. BBA 2011
2X= 900,
Obtain two regression
equations. (Modifed) byx nx?-(x)? 7x140 -(28) 151-140
140 -112
11
039
Solution. We have (10x 3900) - (900 x 700) n XY (X)(EY)
- 7x151-28 x35 151-140 l1
NEXY-(EX)(E)
byy ny-(EY)?
byy NE X²-(x)
10 x6360-(900) 7x191-(35) 191-175 16 0.69
39000 -630000
-591000
= 0.792 Regression Equation of Yon Xis : Regression Equation of Xon Yis:
63600 -810000 -746400 Y-ø =byy (X- X) X-X - by (Y -Y)
(10 x 3900)-(900 x 700) Y-5 = 0.39 (X - 4)
NE XY - (X)(2) 10x2860-(700)
or,
Y= 3.44 + 0.39X
X-4 = 0.69 (Y- S)
or,