0% found this document useful (0 votes)

6 views4 pages

Stepwise Regression

Uploaded by

siamahmed15115

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views4 pages

Stepwise Regression

Uploaded by

siamahmed15115

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Selecting the best regression equation

Criteria for model selection:

There are several criteria for model selection such as

2 2
i. R , adjusted R Criterion
ii. Akaike Information Criterion (AIC)
iii. Corrected Akaike Information Criterion(AICc)
iv. Schwartz Information Criteria or Bayes Information Criterion(BIC)
v. Mallows Cp Criterion

Criteria used for testing the validity/accuracy of the model:

i. Absolute Mean Error (AME)

ii. Root Mean Square Error (RMSE)
iii. Mean Absolute Percent Error (MAPE)
iv. Theil U Statistic

2 Explained ss 2
R , adjusted R Criterion: We know R = ,0≤R ≤1
2 2
Total ss

The closer it is to 1, the better the regression fit. It measures how close is fitted y to the
observed y, without the same dependent variable we cannot compare R2. Thus we
compute adjusted R2 defined by
n−1
2
R =1− ( 1−R 2)
n−k

It is clear that R2 ≤ R 2. Addition of regressors does not necessarily increase R2. It can also
increase if t is greater than unity for the corresponding regressor. For comparison of R2
here also regressand must be same.

Akaike Information Criterion (AIC): AIC is an important and leading statistics by

which we can determine the order of an autoregressive (AR) model. Mr. Akaike(1973)
developed this statistics. According to his name this statistics is known as Akaike
Information Criterion (AIC). AIC due to Akaike is defined as

AIC=n ln ( RSSn )+ 2 p
Where RSS= Residual sum of square, p= number of parameters in the model and n=
sample size. The models with smaller AIC is preferred.
Corrected Akaike Information Criterion (AICc): Sometimes the AIC does not provide
the efficient order of model selection. Shibata in 1976 shown that AIC criterion is not
consistent too. Thus Hurvich and Tsai (1989) provide a criterion of AIC for bias. The
criterion is defined as

2 ( Ρ+2 ) ( Ρ+ 3 )
AIC c= AIC+
( Ν −Ρ−3 )

Thus AICc is the sum of AIC and an additional non-stochastic penalty term. The model,
which adequately describes the series, has a minimum AICc.

Bayes Information Criteria ( BIC sch): Several modifications of AIC have been
suggested. One popular variation called Bayes Information Criteria ( BIC sch), originally
proposed by Schwartz (1978), is defined as

BIC sch=n ln ( RSSn )+ p ln ( n)

Lower BIC value indicates better model.

Mallows Cp Criterion: To judge the performance of an equation we should consider the

mean square error of the predicted value rather than the variance. The standardized total
mean square error of prediction for the observed data is measured by
n
1
J p= ∑ MSE ( ^y i )
σ 2 i=1

To estimate J p, Mallows (1973) uses the statistics

RSS
C p= + ( 2 p−n )
σ^
2

Where σ^ 2 is an estimate of σ 2. In choosing a model we look for low C p.

Absolute Mean Error (AME):

The mean of the absolute deviation of predicted and observed values is called absolute
mean error and is defined as
1
AME= ∑ n |Y −Y^ |
n 0 i=¿¿ 0 i

Where n 0=¿ number of period being forecast

Y i=¿ observed value and Y^ =¿ predicted value

Root Mean Square Error (RMSE):

The square root of the sum of square of the deviation of the predicted values from the
observed value dividing by their number of observation is known as the root mean square
error. The root mean square error is defined as

RMSE=
√ 1
∑
n0 i=¿ ¿
n0 ( Y i−Y^ )
2

Mean Absolute Percent Error (MAPE):

The mean of the sum of absolute deviation of predicted and observed value dividing
by the observed value is called mean absolute error. For comparison we have multiplied
by 100, which is called mean absolute percent error and which is defined as

1 |Y i −Y^ |
MAPE= ∑
n 0 i=¿¿
n0
Yi
× 100

Theil’s U Statistic: Theil’s U statistic is a relative accuracy measure that compares the
forecasted results with the results of forecasting with minimal historical data. The
formula for calculating Theil’s U statistic:

√ ( )
n−1
Y^ t+1 −Y t +1 2

∑ Yt
t=1
U=
∑( )
n−1 2
Y t +1−Y t
t=1 Yt

where Yt is the actual value of a point for a given time period t, n is the number of data points, and

Y^ t is the forecasted value.

Stepwise regression:

Step-wise regression is one of several computer-based iterative variable-selection

procedures. In statistics, stepwise regression is a method of fitting regression models in
which the choice of predictor/ explanatory variables is carried out by an automatic
procedure. In each step, a variable is considered for addition to or subtraction from the set
of explanatory variables based on some prespecified criterion. Usually, this takes the
form of a forward, backward, or combined sequence of F-tests or t-tests.
The main approaches for stepwise regression are:

 Forward selection:
The forward selection procedure starts with an equation containing no
predictor/explanatory variables, only a constant term in the model. The first variable
included in the model is the one which has the highest R-Squared. At each step, select
the predictor variable that increases R-Squared the most. Stop adding variables when
none of the remaining variables are significant. Note that once a variable enters the
model, it cannot be deleted.
 Backward elimination:
The backward selection model starts with all predictor variables in the model. At each
step, the variable that is the least significant is removed. This process continues until
no non-significant variables remain. The user sets the significance level at which
variables can be removed from the model.

Stage wise regression:

Forward stage wise regression follows a very simple strategy for constructing a sequence
of sparse regression estimates; it starts with all coefficients equal to zero, and iteratively
updates the coefficient of the variable that achieves the maximal absolute inner product
with the current residual.

Note: When the number of samples n is less than the signal dimension/parameters p then
we say it is sparse regression model.

Module07 - Model Selection and Regularization
No ratings yet
Module07 - Model Selection and Regularization
46 pages
SRM Notes
No ratings yet
SRM Notes
38 pages
Lecture 5 Model Selection I: STAT 441: Statistical Methods For Learning and Data Mining
No ratings yet
Lecture 5 Model Selection I: STAT 441: Statistical Methods For Learning and Data Mining
17 pages
Chapter 2
No ratings yet
Chapter 2
37 pages
Multicollinearity & Model Selection
No ratings yet
Multicollinearity & Model Selection
30 pages
Akaike Information Criterion Guide
No ratings yet
Akaike Information Criterion Guide
19 pages
Chapter 9: Selection of Variables
No ratings yet
Chapter 9: Selection of Variables
30 pages
WINSEM2023-24 MAT6015 ETH VL2023240501308 2024-03-19 Reference-Material-I
No ratings yet
WINSEM2023-24 MAT6015 ETH VL2023240501308 2024-03-19 Reference-Material-I
39 pages
New Criterion for Model Selection
No ratings yet
New Criterion for Model Selection
12 pages
Model Selection-Handout PDF
No ratings yet
Model Selection-Handout PDF
57 pages
Model Selection for Statisticians
No ratings yet
Model Selection for Statisticians
41 pages
SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria
No ratings yet
SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria
6 pages
Mathematics 07 01215
No ratings yet
Mathematics 07 01215
12 pages
Regression and Analysis
No ratings yet
Regression and Analysis
132 pages
An Alternative Approach To AIC and Mallow's CP Statistics Based Relative Influence Measure (RIMs) in Regression Variable Selection
No ratings yet
An Alternative Approach To AIC and Mallow's CP Statistics Based Relative Influence Measure (RIMs) in Regression Variable Selection
6 pages
Reg 07
No ratings yet
Reg 07
22 pages
STEPAIC
No ratings yet
STEPAIC
36 pages
Yang-39 2 Proof 27
No ratings yet
Yang-39 2 Proof 27
11 pages
Unit 4
No ratings yet
Unit 4
7 pages
Glmulti Walkthrough
No ratings yet
Glmulti Walkthrough
29 pages
ch03 Regression
No ratings yet
ch03 Regression
10 pages
SST307 Complete
No ratings yet
SST307 Complete
72 pages
Stat 136 Chapter 6 Variable Selection and Comparison of Regression Coefficients
No ratings yet
Stat 136 Chapter 6 Variable Selection and Comparison of Regression Coefficients
40 pages
3 Da
No ratings yet
3 Da
16 pages
Diagnostic Tests2
No ratings yet
Diagnostic Tests2
25 pages
Economic Efficiency of Smallholder Farmers in Tomato Production Inbakotibe District Oromia Region Ethiopia PDF
100% (1)
Economic Efficiency of Smallholder Farmers in Tomato Production Inbakotibe District Oromia Region Ethiopia PDF
8 pages
Stepwise Regression Explained
No ratings yet
Stepwise Regression Explained
4 pages
Rio Thesis - 054559
No ratings yet
Rio Thesis - 054559
53 pages
Ch5 Slide VariableSelection
No ratings yet
Ch5 Slide VariableSelection
36 pages
Stepwise Regression
0% (1)
Stepwise Regression
9 pages
Stepwise Regression: Forward (Step-Up) Selection
No ratings yet
Stepwise Regression: Forward (Step-Up) Selection
7 pages
Additional Notes 3 - Forecasting Model Performance
No ratings yet
Additional Notes 3 - Forecasting Model Performance
5 pages
Unit 3
No ratings yet
Unit 3
16 pages
BIOSTATISTICS
No ratings yet
BIOSTATISTICS
15 pages
3rd Module EDBA Contiuation1
No ratings yet
3rd Module EDBA Contiuation1
6 pages
L2D-Multiple Regression D 2022-03-03 21 - 20 - 03
No ratings yet
L2D-Multiple Regression D 2022-03-03 21 - 20 - 03
31 pages
Chapter 5
No ratings yet
Chapter 5
30 pages
Week8 Lecture 1 ML SPR25
No ratings yet
Week8 Lecture 1 ML SPR25
20 pages
Multi-Collineartity, Variance Inflation and Orthogonalization in Regression
No ratings yet
Multi-Collineartity, Variance Inflation and Orthogonalization in Regression
5 pages
A Guide On How To Compare Different Models
No ratings yet
A Guide On How To Compare Different Models
44 pages
Chapter 14
No ratings yet
Chapter 14
15 pages
DATT - Class 05 - Assignment - GR 9
No ratings yet
DATT - Class 05 - Assignment - GR 9
9 pages
Ridge Regression and Lasso Estimators For Data Analysis - 1749804481151
No ratings yet
Ridge Regression and Lasso Estimators For Data Analysis - 1749804481151
38 pages
Methodology Expliained
No ratings yet
Methodology Expliained
2 pages
Estimation, Diagnosis, and Identification of Time Series Models
No ratings yet
Estimation, Diagnosis, and Identification of Time Series Models
15 pages
Stepwise Regression
No ratings yet
Stepwise Regression
9 pages
WINSEM2024-25 CSE3008 ETH AP2024254000248 2025-01-24 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE3008 ETH AP2024254000248 2025-01-24 Reference-Material-I
27 pages
Problems with Stepwise Regression
No ratings yet
Problems with Stepwise Regression
1 page
Chap3 Variable Selection
No ratings yet
Chap3 Variable Selection
23 pages
Chapter 6: How To Do Forecasting by Regression Analysis
No ratings yet
Chapter 6: How To Do Forecasting by Regression Analysis
7 pages
Intronumericalrecipes v01 Chapter02 Regress
No ratings yet
Intronumericalrecipes v01 Chapter02 Regress
15 pages
Table of Evaluation: No. Full Name Level of Completion 1. Le Ngoc Phương Khanh 100%
No ratings yet
Table of Evaluation: No. Full Name Level of Completion 1. Le Ngoc Phương Khanh 100%
27 pages
Class2 Slides
No ratings yet
Class2 Slides
26 pages
Akaike's and Other Information Criteria
No ratings yet
Akaike's and Other Information Criteria
5 pages
SEM Techniques for Researchers
100% (1)
SEM Techniques for Researchers
10 pages
3SLS and FIML
No ratings yet
3SLS and FIML
5 pages
Solution
No ratings yet
Solution
5 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
Introductory Econometrics For Finance © Chris Brooks 2014 1
No ratings yet
Introductory Econometrics For Finance © Chris Brooks 2014 1
21 pages
A Guide On How To Compare Different Models in Linear Progression
No ratings yet
A Guide On How To Compare Different Models in Linear Progression
8 pages
STATA Panel Data Workshop
No ratings yet
STATA Panel Data Workshop
17 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
9 pages
Introduction To Regression With Statsmodels in Python
No ratings yet
Introduction To Regression With Statsmodels in Python
142 pages
Econometrics Chapter 1 UNAV
No ratings yet
Econometrics Chapter 1 UNAV
38 pages
BA Economics Syllabus 2017-18
No ratings yet
BA Economics Syllabus 2017-18
47 pages
ECMT6007/ECON4954: Panel Data Econometrics: Lecture 3: Pooled OLS, LSDV and FD Estimators
No ratings yet
ECMT6007/ECON4954: Panel Data Econometrics: Lecture 3: Pooled OLS, LSDV and FD Estimators
48 pages
DataAnalysis1 Lectures12and13
No ratings yet
DataAnalysis1 Lectures12and13
31 pages
Ols 2
No ratings yet
Ols 2
19 pages
186 344 1 SM PDF
No ratings yet
186 344 1 SM PDF
12 pages
Mathematical-Economics (Set 1)
No ratings yet
Mathematical-Economics (Set 1)
21 pages
Econometrics - Exercise Set 4 (Solution)
No ratings yet
Econometrics - Exercise Set 4 (Solution)
16 pages
Specification Errors in Regression Analysis
No ratings yet
Specification Errors in Regression Analysis
7 pages
Analisa Pengaruh Fasilitas Dan Kepuasan Pelanggan Terhadap Loyalitas Pelanggan Menginap Di Mikie Holiday Resort Dan Hotel Berastagi
No ratings yet
Analisa Pengaruh Fasilitas Dan Kepuasan Pelanggan Terhadap Loyalitas Pelanggan Menginap Di Mikie Holiday Resort Dan Hotel Berastagi
13 pages
Huynh Chau Giang A4
No ratings yet
Huynh Chau Giang A4
21 pages
South Africa's TFP and Output Growth Analysis
No ratings yet
South Africa's TFP and Output Growth Analysis
22 pages
ID Team Learning Ditinjau Dari Team Diversi
No ratings yet
ID Team Learning Ditinjau Dari Team Diversi
13 pages
RLB Contoh
No ratings yet
RLB Contoh
13 pages
Sas Sur
No ratings yet
Sas Sur
9 pages
CH 05 Wooldridge 6e PPT Updated
No ratings yet
CH 05 Wooldridge 6e PPT Updated
8 pages
Regression Vs Correlation
No ratings yet
Regression Vs Correlation
8 pages
Business Statistics Analysis
No ratings yet
Business Statistics Analysis
6 pages
Ridge vs Lasso: A Python Guide
No ratings yet
Ridge vs Lasso: A Python Guide
3 pages
Problems On Statistics 2
No ratings yet
Problems On Statistics 2
2 pages
Applied Linear Regression (4th Edition) Weisberg
No ratings yet
Applied Linear Regression (4th Edition) Weisberg
10 pages
(Ebook) Basic Econometrics by Damodar N. Gujarati ISBN 9780072335422, 0072335424 Ready To Read
No ratings yet
(Ebook) Basic Econometrics by Damodar N. Gujarati ISBN 9780072335422, 0072335424 Ready To Read
137 pages

Stepwise Regression

Uploaded by

Stepwise Regression

Uploaded by

Selecting the best regression equation

Criteria for model selection:

There are several criteria for model selection such as

Criteria used for testing the validity/accuracy of the model:

i. Absolute Mean Error (AME)

Akaike Information Criterion (AIC): AIC is an important and leading statistics by

BIC sch=n ln ( RSSn )+ p ln ( n)

Mallows Cp Criterion: To judge the performance of an equation we should consider the

To estimate J p, Mallows (1973) uses the statistics

Where σ^ 2 is an estimate of σ 2. In choosing a model we look for low C p.

Absolute Mean Error (AME):

Where n 0=¿ number of period being forecast

Root Mean Square Error (RMSE):

Mean Absolute Percent Error (MAPE):

Y^ t is the forecasted value.

Step-wise regression is one of several computer-based iterative variable-selection

Stage wise regression:

You might also like