0% found this document useful (0 votes)

55 views7 pages

Multiple Regression and Issues in Regression Analysis

Uploaded by

pier Acosta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views7 pages

Multiple Regression and Issues in Regression Analysis

Uploaded by

pier Acosta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Page 1

2018, Study Session # 3, Reading # 10

“MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS”

MSR = Mean Regression Sum of Squares = Critical F taken from F
MSE = Mean Squared Error 1. INTRODUCTION Distribute Table
RSS = Regression Sum of Squares = Null Hypothesis
SSE = Sum of Squared Errors/Residuals ∝ = Alternative Hypothesis
α = Level of Significance Multiple linear regression models are more X = Independent Variable
sophisticated. Y = Dependent Variable
They incorporate more than one independent F = F Statistic (calculated)
variable.

2. MULTIPLE LINEAR REGRESSIONS

Allows determining effects of more than one independent variable on a

particular dependent variable
= + + + ⋯ +
Tells the impact on Y by changing X1 by 1 unit keeping other independent
variables same.
Individual slope coefficients (e.g. b1) in multiple regressions known as partial
regression/slope coefficients.

2.1 Assumption of the Multiple Linear Regression Model

Relationship b/w Y and , , , … is linear.

Independent variables are not random and no exact linear relationship exists
b/w 2 or more independent variables.
Expected value of error terms is 0.
Variance of error term is same for all observations.
Error term is uncorrelated across observations.
Error term is normally distributed.

2.2 Predicting the Dependent Variable in a Multiple Regression Model

Obtain estimates of regression parameters.

= ^ , ^ , ^ , … ^

= , , , …
Determine assumed values of , …
Compute predicted value of using = + + + ⋯ +
To predict dependent variable:
Be confident that assumptions of the regression are met.
Predictions regarding X must be within reliable range of data used to estimate the model.

2.3 Testing Whether All Population Regression Coefficients Equals Zero

⇒ All slope coefficients are simultaneously = 0, none of the X

variable helps explain Y.
To test F-test is used.
T-test cannot be used.
=

/
=

/( ())

Copyright © FinQuiz.com. All rights reserved.

Page 2
2018, Study Session # 3, Reading # 10

2.3 Testing Whether All Population Regression Coefficients Equals Zero

Where

= −

n = no. of observation
k = no. of slope coefficients
Decision rule ⇒ reject if F > FC (for given α).
It is a one-tailed test.
df numerator =k
df denominator =n-(k+1).
For k and n the test statistic representing H0, all slope coefficients are
equal to 0, is , ()
In F-distribution table , () where K represents column and n-
(k+1) represents row.
Significance of F in ANOVA table represents ‘p value’.
F-statistic chances of Type I error.

2.4 Adjusted R2

R2 with addition of independent variables (X) in regression

= 1 − 1 − !.

When k ≥ 1 ⇒ >

can be –ve but R2 is always +ve.

If is used for comparing regression models.

Sample size must be the same

Dependent variable is defined in the same way.
Does not necessarily indicate regression is well specified.

3. USING DUMMY VARIABLES IN REGRESSION

Dummy variable ⇒ takes 1 if particular condition is

true & 0 when it is false.
Diligence is required in choosing no. of dummy
variables.
Usually n-1 dummy variables are used
where n= no. of categories.

Copyright © FinQuiz.com. All rights reserved.

Page 3
2018, Study Session # 3, Reading # 10

4. VIOLATIONS OF REGRESSION ASSUMPTIONS

4.1 Heteroskedasticity

Variance of errors differs across observations ⇒ heteroskedastic

Variance of errors is similar across observations ⇒ homoskedastic
Usually no systematic relationship exists b/w X & regression residuals.
If systematic relationship is present ⇒ heteroskedasticity can exist.

4.1.1 The Consequence of Heteroskedasticity

It can lead to mistake in inference.

Does not affect consistency.
F-test becomes unreliable.
Due to biased estimators of standard errors, t-test also becomes unreliable.
Most likely result of heteroskedasticity is that the:
estimated standard errors will be underestimated.
t-statistic will be inflated.
Ignoring heteroskedasticity leads to significant relationship that does not exist actually.
It becomes more serious while developing investment strategy using regression analysis.
Unconditional heteroskedasticity ⇒ when heteroskedasticity of error variance is not correlated with
independent variables in the multiple regression.
Create major problems for statistical inference.
Conditional heteroskedasticity ⇒ when heteroskedasticity of error variance is correlated with the
independent variables.
It causes most problems.
Can be tested & corrected easily through many statistically software packages.

4.1.2 Testing for Heteroskedasticity

Breush-Pagan test is widely used.

Regression squared residuals of regression on independent variables.
Independent variables explain much of the variation of errors ⇒
conditional heteroskedasticity exists.
= no conditional heteroskedasticity exists.
= conditional heteroskedasticity exist

Under Breush-pagan test statistic = nR2

R2: from regression of squared residuals on X

Critical value ⇒ calculated χ2 distribution.

df = no. of independent variables

Reject if test-static > critical value.

Copyright © FinQuiz.com. All rights reserved.

Page 4
2018, Study Session # 3, Reading # 10

4.1.3 Correcting for Heteroskedasticity

Robust Standard Errors Generalized Least Squares

Corrects standard error of estimated Modify original equation.

coefficients. Requires economic expertise to
Also known as heteroskedasticity implement correctly on financial data.
consistent standards errors or white-
corrected standards errors.

4.2 Serial Correlation

Regression errors correlated across observations.

Usually arises in time-series regression.

4.2.1 The Consequences of Serial Correlation

Incorrect estimate of regression coefficient standard errors

Parameter estimates become inconsistent & invalid when Y is lagged onto X under serial
correlation.
Positive serial correlation ⇒ positive (negative) errors chance of positive (negative) errors
Negative serial correlation ⇒ positive (negative) errors chance of negative (positive) errors
It leads to wrong inferences
If positive serial correlation:
Standard errors underestimated
T-statistic & F-statistics inflated
Type-I error
If negative serial correlation
Standard errors overestimated
T-statistics & F-statistics understated
Type-II error

4.2.2 Testing for Serial Correlation

Variety of tests, most common → Durbin-Watson test

∑೅
"# =
೟ ೟షభమ
೟సమ
∑೅ మ
೟సభ ೟

Where = regression residual for period t.

For large sample size Durbin-Watson statistic (d) is approximately
→DW ≈ 2(1-r)
→where r = sample correlation b/w regression residuals of t and t-1
Values of DW can range from 0 to 4.
DW = 2 ⇒ r=0 ⇒ no serial correlation.
DW = 0 ⇒ r=1 ⇒ perfectly positively serially correlated.
DW = 4 ⇒ r = -1 ⇒ perfectly negatively serially correlated.
For positive serial correlation:
⇒ No positive serial correlation
⇒ Positive serial correlation
"# < $ ⇒ reject
"# > ⇒ do not reject
dl ≤ "# ≤ ⇒ inconclusive.

Copyright © FinQuiz.com. All rights reserved.

Page 5
2018, Study Session # 3, Reading # 10

4.2.2 Testing for Serial Correlation

For negative serial correlation:

⇒ No negative serial correlation.
⇒ Negative serial correlation.
"# > 4 − $ ⇒ Reject .
"# < 4 − ⇒ do not reject
4 − ≤ "# ≤ 4 − $ ⇒ inconclusive.

4.2.3 Correcting for Serial Correlation

Adjust the coefficient standard errors. Modify regression equation.

→ Recommended method Extreme care is required.
Hansen’s method ⇒ most prevalent one. May lead to inconsistent parameters
estimates.

4.3 Multicollinearity

Occurs when two or more independent variables (X) are highly

correlated with each other.
Regression can be estimated but result becomes problematic.
Serious practical concern due to commonly found approximate linear
relation among financial variables.

4.3.1 The Consequences of Multicollinearity

Difficulty in detecting significant relationships.

Estimates become extremely imprecise & unreliable though consistency is unaffected.
F-statistic is unaffected.
Standard errors of regression can .
Causing insignificant t-tests
Wide confidence interval
Type II error

4.3.2 Detecting Multicollinearity

Multicollinearity is a matter of degree rather than the presence / absence.

Pair wise correlation does not necessarily indicate presence of Multicollinearity
Pair wise correlation does not necessarily indicate absence of Multicollinearity
With 2 independent variables ⇒ correlation is a useful indicator.
R2 significant, F-statistic significant, insignificant t-statistic on slope coefficients ⇒
classic symptom of Multicollinearity

4.3.3 Correcting Multicollinearity

Exclude one or more regression variables.

In many cases, experimentation is done to determine
variable causing Multicollinearity

Page 6
2018, Study Session # 3, Reading # 10

5. MODEL SPECIFICATION AND ERRORS IN SPECIFICATION

Model specification ⇒ set of variables included in

regression.
Incorrect specification leads to biased & inconsistent
parameters

5.1 Principles of Model Specification

Model grounded on economic reasoning.

Functional form of variables compatible with nature of variables
Parsimonious ⇒ each included variable should play an essential role
Model is examined for the violation of regression assumptions.
Model is tested for the validity & usefulness of the out of sample data.

5.2 Misspecified Functional Form

One or more variables are omitted. If omitted variable is correlated with

remaining variable, error term will also be correlated with the latter and
the:
result can be biased & inconsistent.
estimated standard errors of the coefficients will be inconsistent.
One or more variables may require transformation.
Pooling of data from different samples that should not be pooled.
Can lead to spurious results.

5.3 Times-Series Misspecification (Independent Variables Correlated with Errors)

Including lagged variables (dependent) as independent

with serial correlation.
Including a function of the dependent variable as an
independent variable.
Independent variables measured with error

5.4 Other Types of Time-Series Misspecification

Nonstationarity: variable properties, e.g. mean, are

not constant through time.
In practice nonstationarity is a serious problem.

Page 7
2018, Study Session # 3, Reading # 10

6. MODELS WITH QUALITATIVE DEPENDENT VARIABLES

Qualitative dependent variables ⇒ dummy variables used as dependent instead of

independent.
Probit model⇒ based on normal distribution estimates the probability:
of discrete outcome, given values of independent variables used to explain that
outcome.
that Y=1, implying a condition is met.
Logit model:
Identical to Probit model.
Based on logistic distribution.
Both Logit and Probit models must be estimated using maximum likelihood methods.
Discriminate analysis ⇒ can be used to create an overall score that is used for classification.
Qualitative dependent variable models can be used for portfolio management and business
management.

Salt Cfa Level 2 Formulasheet 2025
No ratings yet
Salt Cfa Level 2 Formulasheet 2025
19 pages
Salt Cfa Level 2 Formulasheet 2024
100% (3)
Salt Cfa Level 2 Formulasheet 2024
19 pages
Chapter 5 - Violations of Regression Assumptions
No ratings yet
Chapter 5 - Violations of Regression Assumptions
44 pages
OLS Assumptions
No ratings yet
OLS Assumptions
40 pages
Geographical Information System: Unit 1 Fundementals of GIS
100% (6)
Geographical Information System: Unit 1 Fundementals of GIS
81 pages
Chapter 5 Violations of CLRM Assumptions
100% (2)
Chapter 5 Violations of CLRM Assumptions
25 pages
Ch5 - Slides 2022 - 11 - 29 - L1
No ratings yet
Ch5 - Slides 2022 - 11 - 29 - L1
35 pages
Reading 07-Correlation and Regression
No ratings yet
Reading 07-Correlation and Regression
18 pages
CFA Level 2 1712974289
No ratings yet
CFA Level 2 1712974289
19 pages
SST Ûr Var: Principles of Econometrics - Class of October 14 Feunl
No ratings yet
SST Ûr Var: Principles of Econometrics - Class of October 14 Feunl
18 pages
Lecture 2
No ratings yet
Lecture 2
13 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Assumption Checking On Linear Regression
No ratings yet
Assumption Checking On Linear Regression
65 pages
MFIN 305 - Lecture3
No ratings yet
MFIN 305 - Lecture3
66 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Econometrics
No ratings yet
Econometrics
46 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
66 pages
4 Regression Issues
No ratings yet
4 Regression Issues
44 pages
CFA Level II: Quantitative Methods
No ratings yet
CFA Level II: Quantitative Methods
20 pages
2023 Level II Key Facts and Formula Sheet (KFFS)
No ratings yet
2023 Level II Key Facts and Formula Sheet (KFFS)
14 pages
Business Forecasting J. Holton (1) - 251-300
No ratings yet
Business Forecasting J. Holton (1) - 251-300
50 pages
Chris Brooks - Chapter 5 - Slides
No ratings yet
Chris Brooks - Chapter 5 - Slides
71 pages
2 Quantitative
No ratings yet
2 Quantitative
19 pages
High Yield Notes
No ratings yet
High Yield Notes
251 pages
Diagnostic Tests
No ratings yet
Diagnostic Tests
51 pages
05 Diagnostic Test of CLRM 2
No ratings yet
05 Diagnostic Test of CLRM 2
39 pages
Economic
No ratings yet
Economic
11 pages
ECF For Graduate Exam Revision 2021
No ratings yet
ECF For Graduate Exam Revision 2021
6 pages
Lecture 3
No ratings yet
Lecture 3
35 pages
FinQuiz - Curriculum Note, @InsightSquad Study Session 2, Reading 5
No ratings yet
FinQuiz - Curriculum Note, @InsightSquad Study Session 2, Reading 5
11 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Formulasheet 2025
No ratings yet
Formulasheet 2025
19 pages
Practice Questions - Quantitative - Reading 7
No ratings yet
Practice Questions - Quantitative - Reading 7
13 pages
Econometrics Essentials for Analysts
No ratings yet
Econometrics Essentials for Analysts
4 pages
Heteroscedasticity & Autocorrelation
No ratings yet
Heteroscedasticity & Autocorrelation
5 pages
Econometrics For Finance Chapter 4
No ratings yet
Econometrics For Finance Chapter 4
44 pages
Chapter 4
No ratings yet
Chapter 4
38 pages
OLS Assumptions and Diagnostics
No ratings yet
OLS Assumptions and Diagnostics
18 pages
Regression and Assumptions
No ratings yet
Regression and Assumptions
49 pages
01 - Quantitative Methods
No ratings yet
01 - Quantitative Methods
28 pages
Level 2 r12 Multiple Regression
No ratings yet
Level 2 r12 Multiple Regression
29 pages
Ontents: Foreword Preface To The Fourth Edition
No ratings yet
Ontents: Foreword Preface To The Fourth Edition
12 pages
Chapter Four
No ratings yet
Chapter Four
47 pages
Ch5 Slides Ed3 Feb2021
No ratings yet
Ch5 Slides Ed3 Feb2021
49 pages
L1 QM07 High Yield Notes
No ratings yet
L1 QM07 High Yield Notes
4 pages
Multiple Regression and Issues in Regression Analysis
No ratings yet
Multiple Regression and Issues in Regression Analysis
25 pages
Ch5 Slides
No ratings yet
Ch5 Slides
32 pages
FinQuiz - Smart Summary, Study Session 3, Reading 10
No ratings yet
FinQuiz - Smart Summary, Study Session 3, Reading 10
7 pages
New Section 1
No ratings yet
New Section 1
39 pages
MAPEH
No ratings yet
MAPEH
8 pages
Fin Quiz Bank
100% (1)
Fin Quiz Bank
23 pages
George Kelly Construct Theory: - Early Cognitive Personality Theorist - Phenomonological - Clinician
No ratings yet
George Kelly Construct Theory: - Early Cognitive Personality Theorist - Phenomonological - Clinician
15 pages
Asia Score For Vertebra Injury
100% (1)
Asia Score For Vertebra Injury
2 pages
Worksheet On Regression
No ratings yet
Worksheet On Regression
2 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
MEC562 Midterm Examination Questionnaire - Final
No ratings yet
MEC562 Midterm Examination Questionnaire - Final
3 pages
Thesis of Prelude To The Modern World
100% (3)
Thesis of Prelude To The Modern World
7 pages
Econometrics
No ratings yet
Econometrics
23 pages
FinQuiz - Curriculum Note, Study Session 2, Reading 4
No ratings yet
FinQuiz - Curriculum Note, Study Session 2, Reading 4
5 pages
Intro To Econometrics Latter Half Chanon-1016098-17101310898743
No ratings yet
Intro To Econometrics Latter Half Chanon-1016098-17101310898743
15 pages
Thesis Jur Erbrink
No ratings yet
Thesis Jur Erbrink
245 pages
Grade 5 q2 Mathematics Las
No ratings yet
Grade 5 q2 Mathematics Las
103 pages
GT-100 System Update Procedure
No ratings yet
GT-100 System Update Procedure
4 pages
CFA Level I 1 Mock Exam June, 2018 Revision 1
No ratings yet
CFA Level I 1 Mock Exam June, 2018 Revision 1
74 pages
Multivariate Regression Analysis Guide
No ratings yet
Multivariate Regression Analysis Guide
20 pages
Features of Academic Writing
No ratings yet
Features of Academic Writing
50 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
10 pages
Test Bank For Cognitive Psychology: Connecting Mind, Research, and Everyday Experience, 5th Edition, E. Bruce Goldstein
100% (11)
Test Bank For Cognitive Psychology: Connecting Mind, Research, and Everyday Experience, 5th Edition, E. Bruce Goldstein
36 pages
Legal Research Search Techniques
100% (1)
Legal Research Search Techniques
12 pages
Geotechnical Report for HPPWD
No ratings yet
Geotechnical Report for HPPWD
25 pages
CFA Level I 1 Mock Exam June, 2018 Revision 1
No ratings yet
CFA Level I 1 Mock Exam June, 2018 Revision 1
38 pages
Social Media's Impact on Pakistani E-Shoppers
No ratings yet
Social Media's Impact on Pakistani E-Shoppers
31 pages
Ultima Forte Required Data Inputs For Ericsson Infrastructure
100% (1)
Ultima Forte Required Data Inputs For Ericsson Infrastructure
55 pages
Original Operating Manual HT-S Sintering Furnace HT-S Speed Sintering Furnace
No ratings yet
Original Operating Manual HT-S Sintering Furnace HT-S Speed Sintering Furnace
39 pages
Michael Salla, Elena Danaan, Commander Thor Han Eredyon - Decimation of The Dark Fleet and The Liberation of Terra, An Nonfiction Galactic Anthology (2021)
No ratings yet
Michael Salla, Elena Danaan, Commander Thor Han Eredyon - Decimation of The Dark Fleet and The Liberation of Terra, An Nonfiction Galactic Anthology (2021)
269 pages
The Role of Media in Public Relations Crisis Commu
No ratings yet
The Role of Media in Public Relations Crisis Commu
10 pages
L 3 Ss 12 Los 25
No ratings yet
L 3 Ss 12 Los 25
12 pages
Group Discussion
No ratings yet
Group Discussion
6 pages
Data Presentation Methods Explained
No ratings yet
Data Presentation Methods Explained
13 pages
Sabar Rutoto, Henry Suryo Bintoro, Ika Oktavianti, Sumaji
No ratings yet
Sabar Rutoto, Henry Suryo Bintoro, Ika Oktavianti, Sumaji
9 pages
L 3 Formulasheetjune 2016 Sample
No ratings yet
L 3 Formulasheetjune 2016 Sample
6 pages
Smartwatch Instruction
No ratings yet
Smartwatch Instruction
6 pages
Pulo, Dalahican, Cavite City
No ratings yet
Pulo, Dalahican, Cavite City
3 pages
SFTY100 Questions
No ratings yet
SFTY100 Questions
3 pages
Grade 10 Singapore and Asian Schools Math Olympiad: Choose Correct Answer(s) From The Given Choices
No ratings yet
Grade 10 Singapore and Asian Schools Math Olympiad: Choose Correct Answer(s) From The Given Choices
2 pages
The Time Value of Money
No ratings yet
The Time Value of Money
2 pages
California Academy For Lilminius (Cal) : Lesson Plan
No ratings yet
California Academy For Lilminius (Cal) : Lesson Plan
2 pages
Wipro 2
No ratings yet
Wipro 2
8 pages
Kujawski Anna 7 1
No ratings yet
Kujawski Anna 7 1
9 pages
Grade 2 Poi 2012
No ratings yet
Grade 2 Poi 2012
1 page

Multiple Regression and Issues in Regression Analysis

Uploaded by

Multiple Regression and Issues in Regression Analysis

Uploaded by

Page 1

2018, Study Session # 3, Reading # 10

“MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS”

2. MULTIPLE LINEAR REGRESSIONS

Allows determining effects of more than one independent variable on a

2.1 Assumption of the Multiple Linear Regression Model

Relationship b/w Y and  ,  ,  , …  is linear.

2.2 Predicting the Dependent Variable in a Multiple Regression Model

Obtain estimates of regression parameters.

2.3 Testing Whether All Population Regression Coefficients Equals Zero

⇒ All slope coefficients are simultaneously = 0, none of the X

Copyright © FinQuiz.com. All rights reserved.

2.3 Testing Whether All Population Regression Coefficients Equals Zero

 =  −  

R2 with addition of independent variables (X) in regression

 can be –ve but R2 is always +ve.

If  is used for comparing regression models.

Sample size must be the same

3. USING DUMMY VARIABLES IN REGRESSION

Dummy variable ⇒ takes 1 if particular condition is

Copyright © FinQuiz.com. All rights reserved.

4. VIOLATIONS OF REGRESSION ASSUMPTIONS

Variance of errors differs across observations ⇒ heteroskedastic

4.1.1 The Consequence of Heteroskedasticity

It can lead to mistake in inference.

4.1.2 Testing for Heteroskedasticity

Breush-Pagan test is widely used.

Copyright © FinQuiz.com. All rights reserved.

4.1.3 Correcting for Heteroskedasticity

Robust Standard Errors Generalized Least Squares

Corrects standard error of estimated Modify original equation.

4.2 Serial Correlation

Regression errors correlated across observations.

4.2.1 The Consequences of Serial Correlation

Incorrect estimate of regression coefficient standard errors

4.2.2 Testing for Serial Correlation

Variety of tests, most common → Durbin-Watson test

Where  = regression residual for period t.

Copyright © FinQuiz.com. All rights reserved.

4.2.2 Testing for Serial Correlation

For negative serial correlation:

4.2.3 Correcting for Serial Correlation

Adjust the coefficient standard errors. Modify regression equation.

Occurs when two or more independent variables (X) are highly

4.3.1 The Consequences of Multicollinearity

Difficulty in detecting significant relationships.

4.3.2 Detecting Multicollinearity

Multicollinearity is a matter of degree rather than the presence / absence.

4.3.3 Correcting Multicollinearity

Exclude one or more regression variables.

Copyright © FinQuiz.com. All rights reserved.

5. MODEL SPECIFICATION AND ERRORS IN SPECIFICATION

Model specification ⇒ set of variables included in

5.1 Principles of Model Specification

Model grounded on economic reasoning.

5.2 Misspecified Functional Form

One or more variables are omitted. If omitted variable is correlated with

5.3 Times-Series Misspecification (Independent Variables Correlated with Errors)

Including lagged variables (dependent) as independent

5.4 Other Types of Time-Series Misspecification

Nonstationarity: variable properties, e.g. mean, are

Copyright © FinQuiz.com. All rights reserved.

6. MODELS WITH QUALITATIVE DEPENDENT VARIABLES

Qualitative dependent variables ⇒ dummy variables used as dependent instead of

Copyright © FinQuiz.com. All rights reserved.

You might also like

Relationship b/w Y and , , , … is linear.

= −

can be –ve but R2 is always +ve.

If is used for comparing regression models.

Where = regression residual for period t.