0% found this document useful (0 votes)

22 views67 pages

IS4242 W3 Regression Analyses

This document discusses using linear regression to analyze the relationship between advertising budgets and product sales across 200 markets. Specifically, it examines how TV, radio, and newspaper advertising budgets relate to sales. It introduces simple and multiple linear regression models. Key points covered include estimating regression coefficients, assessing the accuracy of estimates using standard error, confidence intervals, and p-values, testing hypotheses about coefficients, and evaluating model fit using R-squared and residual standard error. The document also discusses the assumptions of linear models and their limitations.

Uploaded by

wongdeshun4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views67 pages

IS4242 W3 Regression Analyses

Uploaded by

wongdeshun4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 67

Regression

Analyses
SUN Chenshuo
Department of Information Systems & Analytics
Application
Advertising
Sales of a particular product in 200 different
markets
Advertising budgets in each of the markets, for
3 different media
TV,radioandnewspaper

Client: can control advertising budget, not sales

directly
A marketing plan to improve sales
Advertising

Sales (thousands of units)

Budget for TV, radio, newspaper in thousands of dollars

Questions
Is there a relationship bet ween advertising budget and sales?

If so, how strong?

Given a budget, can we predict sales with high accuracy? Is

the relationship linear?

Which media contribute to sales?

All three? Just one or t wo?

How accurately can we estimate the effect of each

medium on sales?
Simple Linear Regression
Simple Linear Regression

Quantitative response
Y

Predictor variable X
Linear relationship: Y ≈ β0 + β1X
“regressing Y onto X”
sales ≈ β0 + β1TV
Simple Linear Regression
Y ≈ β0 + β1X

Model parameters or coefficients β0, β1

Training data used to estimate β0̂ , β1

Predict ion ŷ = β0̂ + β1̂ x
Simple Linear Regression:
Coefficient Estimates
& theirAccuracy
Finding the Estimates
Training data: (x1, y1), (x2, y2), …, (xn, yn)

Find β0̂ , β1̂ that fits the training data

well

Linear model: Line as close as

possible to the 200 points
Training: through Least Squares
Finding the Estimates
ŷ = β0̂ + β1̂ x

i-th Residual: ei = yi − yi

Residual Sum of Squares (RSS)

RSS = e21+ e2 2+ … + e2 n

Least Squares: Find β0̂ , β1̂ that minimises

RSS
25
20
Sales

15
10

β0̂ = 7.03
β1̂ = 0.0475
5

0 50 100 150 200 250 300

Least squares fit for regression of ‘sales’ onto ‘TV’

Linear Regression

sales = 7.03 + 0.0475 TV

Additional $1000 spent on TV
advertising sells approximately
47.5 additional units of the product
Unbiasedness
Using one set of observations, an estimate
may over/under estimate a parameter
Average of a large number of estimates
from a large number of datasets:
accurate estimate of parameter
No systematic over/under-estimation Least

square estimates are unbiased

Accuracy of Coefficient Estimates

Least Square Estimates are unbiased

Average over many datasets will be

accurate
A single estimate may over/under-
estimate
By howmuch?
Accuracy of Coefficient Estimates

Standard Error
Confidence Interval
P-Value
Standard Error
ConfidenceIntervals

95% confidence interval: range of values

With 95% probability the range will
contain the true unknown parameter
value

β̂0 − 2SE(β0̂ ), β0̂ + 2SE(β0̂ )

[ ]

β̂1 − 2SE(β1̂ ), β1̂ + 2SE(β1̂ )

[ ]
ConfidenceIntervals
sales = 7.03 + 0.0475 TV
β0 : [6.130, 7.935]
β1 : [0.042, 0.053]
In the absence of any advertising, sales will,
on average, be bet ween 6130 and 7935 units.

For each 1000 unit increase in TV advertising, there will be

an average increase in sales of 42 to 53 units.
Statistical Hypothesis Testing
State Null and Alternative Hypotheses mathematically
Select relevant test statistic
Obtain distribution of test statistic under the null hypothesis
Select significance level: probability threshold below which null
hypothesis is rejected (usually 0.01 or 0.05)
Compute observed value of test statistic fromdata
P-value: probability, under the null hypothesis, of observed value
(and more extremal values)
Reject null hypothesis if p-value is less than significance level
Hypothesis Test
Null Hypothesis

There is no relationship bet ween X & Y

H0 : β1 = 0

Alternative Hypothesis

There is some relationship bet ween X & Y

H1 : β1 ≠ 0
Hypothesis Test
β1
t-statistic: t=
SE(β 1̂ )

Is our estimate sufficiently far from zero, so we can be confident of

rejecting the null hypothesis?
If SE is low, then smaller non-zero estimates may
besufficient IfSEishigh,thenlargerestimatesrequired
Hypothesis Test
Small p-value (< 0.05)

Assuming null hypothesis (i.e. in the absence

of any real association bet ween X and
Y)
Unlikely to observe such a substantial
association (bet ween X and Y) due to chance

Reject the null hypothesis

Accuracy of Coefficient Estimates

Standard Error
Confidence Interval
P-Value
Simple Linear Regression:
Model Fit
Accuracy of the model
Y ≈ β0 + β1X
Y = g(X) + ϵ
Y = β0 + β1X + ϵ

Evenifweknewthetruecoefficients,we maynot
have accurate predictions due to errors (that
have not been modelled)
Model Fit

Residual Standard Error (RSE)

Estimate of the standard deviation of the error

RSS
RSE =
n−2
n
RSS = e21+ e2 2+ … + e2 n = ∑ (y i − ŷ )
i
2

i=1
Model Fit
2 RSS ∑i (yi − yî)2
R =1− =1−
TSS ∑i (yi − y¯)2

TSS:totalvarianceinY(beforeregression)

RSS:varianceleftunexplained(afterregression)

R-squared: proportion of variability in Y that can be

explained using X

y¯:mean of yi
Model Fit

R-squared: proportion of variability in Y that can be

explained using X

~1:Goodfit,Largeproportionexplained
~0: Regression did not explain much variability Linear

model wrong and/or inherent error high

Model Fit

sales = 7.03 + 0.0475 TV

R-squared: 0.61

61% of variability in sales is

explained by regression on
TV
Model Fit

For simple linear regression

R-squared = squared linearcorrelation

Simple Linear Regression
sales ≈ β0 + β1TV

25
Least Squares Estimates

20
Sales

15
10
Accuracy of Estimates

5
0 50 100 150 200 250 300

SE,ConfidenceIntervals,P-value

Model Fit

RSE,R-squared β1

β0
Multiple Linear
Regression
Multiple Linear
Regression

Multiple Predictors
25

25
20

20
Sales

Sales

Sales
15

15
10

10
5

5
0 50 100 200 300 0 10 20 30 40 50 0 20 40 60 80 100

TV Radio Newspaper
Multiple Linear Regression

Y = β0 + β1X1 + β2X2 + … + βpXp + ϵ

βj : average effect on Y of one unit increase

in Xj holding all other predictors fixed

sales = β0 + β1TV + β2radio + β3newspaper + ϵ

Regression Coefficients
ŷ = β0̂ + β1̂ x1 + β2̂ x2 + … + βp̂ xp

Least Squares Estimate

Find Coefficients β0̂ , …, βp̂ to minimise

RSS:
n
RSS = ∑ (yi − yî )2
i=1
Average effect of Xj keeping other predictors fixed

Average effect of Xj ignoring other predictors

Predictors->Response

Is at least one of the predictors useful in

predicting the response?
Hypothesis Test

Null hypothesis
β1 = β2 = … = βp =0

Alternative hypothesis

At least one βj is non-zero

Hypothesis Test
(TSS − RSS)/p
F=
RSS/(n − p − 1)
TSS = ∑ (yi − y¯)2, RSS = ∑ (yi − yî )2
i i

F-statistic

For large n, F > 1 is evidence against null

F-statistic: 570.3
P-value: significant
At least one of the 3 media budget has effect on sales
Model Fit

RSE

R-squared
Predictors Response R-squared RSE

TV Sales 0.61 3.26

TV
Sales 0.89719 1.681
Radio

TV Radio
Sales 0.8972 1.686
Newspap
er
Linear Models:
Assumptions & Limitations
Linear Model: Assumptions

Y = β0 + β1X1 + β2X2 + … + βpXp + ϵ

Additive

Linear
Linear Model: Assumptions
Y = β0 + β1X1 + β2X2 + … + βpXp + ϵ

Additive

Effect of change in a predictor on

response is independent of other predictor
values
Change in response due to a unit change in a
predictor is constant, irrespective of the
value of the predictor
Non-linear Relations

Y = β0 + β1X1 + β2X2 + β3X2 + β4X3 + …

1 2

Polynomial Regression

Model is linear in terms of coefficients

Potential Problems

Non-linearity of the data

Correlation of error terms
Outliers

Collinearity
Non-linearity
Conclusions from Linear Model unreliable

Predictions may be inaccurate

Binary
Classification
Classification

Will a credit card customer

default on his/her credit card
payment?
Data: annual income, monthly credit card
balance, previous defaulters
Classification

Outcome: Default
Binary
(Yes/No) Predictor:
Balance

Pr( default=Yes | balance )

1.0
Regression?

1.0
||| | || | ||||||| || |||||||||||||||||| |||||||||||| |||||||||||||||||||||||| |||||||||||||||||||||||||||| |||||||||||||||| |||| | | | | | | ||| | || | ||||||| || |||||||||||||||||| |||||||||||| |||||||||||||||||||||||| |||||||||||||||||||||||||||| |||||||||||||||| |||| | | | | | |
Probability of Default

Probability of Default
0.8

0.8
0.6

0.6
0.4

0.4
0.2

0.2
0.0

0.0
||||||||||||||||||||||||||||||||||| ||||||||| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| ||| |||||||||||||||||| |||| ||||||||||||||||||||||||||||| ||||| ||||||||||||||||||||| ||||||| | | | | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| ||| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| ||||||||||||||||||||| ||||||| | || |

0 500 1000 1500 2000 2500 0 500 1000 1500 2000 2500

Balance Balance

Left: linear regression -> negative

probabilities!

Right: All probabilities lie bet ween 0 and 1

Logistic Regression
Logistic Regression
p(Y )
odds: = eβ0+β1X = eβ0eβ1X
1 − p(Y )

p(Y )
logit/log-odds: log ( = β0 + β1X
1 − p(Y ) )

Logistic function (sigmoid) lies bet ween 0 and 1

Unit increase in X changes log-odds by β1 or multiplies the odds by
β
e1
Regression Coefficients

Likelihood l(β0, β1) = ∏ P(xi) ∏ (1 − P(xj))

i:yi=1 j:yj=0
Find coefficients that maximise likelihood
Multiple Logistic Regression
p(Y )
log = β0 + β1X1 + … + βpXp
( 1 − p(Y ) )

Predict ion:
β ̂ +β1̂ x1+…+βp̂ xp
e 0
p̂(y) = ̂ +β1̂ x1+…+βp̂ xp
1+ eβ0
Decision Boundary

Logistic Regression

Target Variable: probabilities

Statistical Models

Linear Models

Linear Regression

Logistic Regression -
Classification
Statistical Models
Linear Models

Linear Regression

Logistic Regression - Classification

Key Idea: Approximate Data with a Hyperplane

Evaluate fit of the model

Accuracy of model parameter estimates

Classification with 3-NN
Important

Analytics Compendium
No ratings yet
Analytics Compendium
41 pages
3 Unequal
No ratings yet
3 Unequal
7 pages
Simple Linear Regression (Solutions To Exercises)
No ratings yet
Simple Linear Regression (Solutions To Exercises)
28 pages
Linear Regression for Marketers
No ratings yet
Linear Regression for Marketers
20 pages
AA3 - Linear Regression - 2024
No ratings yet
AA3 - Linear Regression - 2024
26 pages
Lecture 4
No ratings yet
Lecture 4
62 pages
Linear Regression
No ratings yet
Linear Regression
64 pages
Vapitulo 3 Big Data
No ratings yet
Vapitulo 3 Big Data
65 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
29 pages
Lecture 3
No ratings yet
Lecture 3
47 pages
Regression I
No ratings yet
Regression I
41 pages
Regression I
No ratings yet
Regression I
41 pages
1.1 Regression Analysis
No ratings yet
1.1 Regression Analysis
33 pages
Predicting Pregnancies of Our Customers I - Regression Model
No ratings yet
Predicting Pregnancies of Our Customers I - Regression Model
50 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
28 pages
Regression and Prediction
No ratings yet
Regression and Prediction
56 pages
Chapter 2
No ratings yet
Chapter 2
53 pages
Linear Regression for Students
No ratings yet
Linear Regression for Students
60 pages
TNP Lecture 2 G1G2
No ratings yet
TNP Lecture 2 G1G2
58 pages
Supply Chain Analytics
No ratings yet
Supply Chain Analytics
8 pages
MLR Insights for Data Analysts
No ratings yet
MLR Insights for Data Analysts
34 pages
StatLearning2r PDF
No ratings yet
StatLearning2r PDF
267 pages
Lecture 9-10
No ratings yet
Lecture 9-10
28 pages
Chap 5
No ratings yet
Chap 5
13 pages
Regression Analysis
No ratings yet
Regression Analysis
20 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
An Introduction To Statistical Learning
No ratings yet
An Introduction To Statistical Learning
19 pages
Chapter Simple Linear Regression 1
100% (1)
Chapter Simple Linear Regression 1
77 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
43 pages
Sbe10 10 Simple Regression
No ratings yet
Sbe10 10 Simple Regression
100 pages
3 Linear Regression 1
No ratings yet
3 Linear Regression 1
5 pages
Chap5.Linear Regression
No ratings yet
Chap5.Linear Regression
50 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
95 pages
Lekcija 11 - Multiple Regression
No ratings yet
Lekcija 11 - Multiple Regression
58 pages
Regression
No ratings yet
Regression
44 pages
Lecture 12 Regression
No ratings yet
Lecture 12 Regression
55 pages
445 Lecture 4
No ratings yet
445 Lecture 4
28 pages
Session 5 Marked B PDF
No ratings yet
Session 5 Marked B PDF
36 pages
11 SimpleRegression
No ratings yet
11 SimpleRegression
42 pages
5 Noman Naseer Linear Regression
No ratings yet
5 Noman Naseer Linear Regression
15 pages
F Regression
No ratings yet
F Regression
65 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
Data Analysis for Regression Models
No ratings yet
Data Analysis for Regression Models
58 pages
Intro To Reg Models
No ratings yet
Intro To Reg Models
27 pages
MLR
No ratings yet
MLR
48 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
26 pages
Chapter 3 Notes
No ratings yet
Chapter 3 Notes
5 pages
Elementary Regression Analysis
No ratings yet
Elementary Regression Analysis
25 pages
Chapter 5
No ratings yet
Chapter 5
73 pages
Simple Regression
No ratings yet
Simple Regression
35 pages
Data Science Q&A - Latest Ed (2020) - 3 - 1
No ratings yet
Data Science Q&A - Latest Ed (2020) - 3 - 1
2 pages
Lecture 13 BA
No ratings yet
Lecture 13 BA
36 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Lec 3
No ratings yet
Lec 3
69 pages
AI & ML: Linear Regression Guide
No ratings yet
AI & ML: Linear Regression Guide
55 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
Ridge Regression - Ryan
No ratings yet
Ridge Regression - Ryan
22 pages
JURNAL
No ratings yet
JURNAL
18 pages
Cross-Validation, Regularization, and Principal Components Analysis (PCA)
No ratings yet
Cross-Validation, Regularization, and Principal Components Analysis (PCA)
47 pages
Chris Brooks Chapter 3 Slides
No ratings yet
Chris Brooks Chapter 3 Slides
80 pages
FIDP Statistics and Probability SY 24 25
No ratings yet
FIDP Statistics and Probability SY 24 25
15 pages
8.3 Residuals and Outliers Notes FILLED IN
No ratings yet
8.3 Residuals and Outliers Notes FILLED IN
3 pages
Influence Networks in International Relations: Social in Uence Regression, Provides A Way
No ratings yet
Influence Networks in International Relations: Social in Uence Regression, Provides A Way
34 pages
19 - Tugas Kelompok Perilaku Organisasi Review Jurnal
No ratings yet
19 - Tugas Kelompok Perilaku Organisasi Review Jurnal
103 pages
Principles of Econometrics 4th Edition Carter Hill
No ratings yet
Principles of Econometrics 4th Edition Carter Hill
302 pages
The Effect of Multicollinearity in Nonlinear Regression Models
No ratings yet
The Effect of Multicollinearity in Nonlinear Regression Models
4 pages
Topics in Applied Econometrics MIT 14.387 J. Angrist Spring 2004 W. Newey
No ratings yet
Topics in Applied Econometrics MIT 14.387 J. Angrist Spring 2004 W. Newey
7 pages
Generalized Linear Models: Simon Jackman Stanford University
No ratings yet
Generalized Linear Models: Simon Jackman Stanford University
7 pages
Reading 7a
No ratings yet
Reading 7a
26 pages
STAT2215FINALSEF24
No ratings yet
STAT2215FINALSEF24
9 pages
3.regression Slides
100% (1)
3.regression Slides
25 pages
Lecture 1
No ratings yet
Lecture 1
15 pages
A Zaenal Mufaqih - Tugas6
No ratings yet
A Zaenal Mufaqih - Tugas6
6 pages
Econometrics Final
No ratings yet
Econometrics Final
13 pages
A Physics and Data Co-Driven Surrogate Modeling Method For High-Dimensional Rare Event Simulation
No ratings yet
A Physics and Data Co-Driven Surrogate Modeling Method For High-Dimensional Rare Event Simulation
22 pages
BTMMeeting25Nov2020 StatisticalLearning
No ratings yet
BTMMeeting25Nov2020 StatisticalLearning
49 pages
Qns Exam2
No ratings yet
Qns Exam2
11 pages
Intro to Multiple Linear Regression
No ratings yet
Intro to Multiple Linear Regression
15 pages
Introduction To Kernel Smoothing
No ratings yet
Introduction To Kernel Smoothing
24 pages
Cs 109
No ratings yet
Cs 109
1 page
Graduate Statistics Standards
No ratings yet
Graduate Statistics Standards
9 pages
049 Stat 326 Regression Final Paper
No ratings yet
049 Stat 326 Regression Final Paper
17 pages
Sample Size and Estimation New
No ratings yet
Sample Size and Estimation New
4 pages
Psychology Statistics
No ratings yet
Psychology Statistics
1 page

IS4242 W3 Regression Analyses

Uploaded by

IS4242 W3 Regression Analyses

Uploaded by

Regression

Client: can control advertising budget, not sales

Sales (thousands of units)

Budget for TV, radio, newspaper in thousands of dollars

If so, how strong?

the relationship linear?

Which media contribute to sales?

All three? Just one or t wo?

How accurately can we estimate the effect of each

Model parameters or coefficients β0, β1

Training data used to estimate β0̂ , β1

Find β0̂ , β1̂ that fits the training data

Linear model: Line as close as

Residual Sum of Squares (RSS)

Least Squares: Find β0̂ , β1̂ that minimises

0 50 100 150 200 250 300

Least squares fit for regression of ‘sales’ onto ‘TV’

sales = 7.03 + 0.0475 TV

square estimates are unbiased

Least Square Estimates are unbiased

Average over many datasets will be

95% confidence interval: range of values

β̂0 − 2SE(β0̂ ), β0̂ + 2SE(β0̂ )

β̂1 − 2SE(β1̂ ), β1̂ + 2SE(β1̂ )

For each 1000 unit increase in TV advertising, there will be

There is no relationship bet ween X & Y

There is some relationship bet ween X & Y

Is our estimate sufficiently far from zero, so we can be confident of

Assuming null hypothesis (i.e. in the absence

Reject the null hypothesis

Residual Standard Error (RSE)

Estimate of the standard deviation of the error

R-squared: proportion of variability in Y that can be

R-squared: proportion of variability in Y that can be

model wrong and/or inherent error high

sales = 7.03 + 0.0475 TV

61% of variability in sales is

For simple linear regression

R-squared = squared linearcorrelation

Y = β0 + β1X1 + β2X2 + … + βpXp + ϵ

βj : average effect on Y of one unit increase

sales = β0 + β1TV + β2radio + β3newspaper + ϵ

Least Squares Estimate

Find Coefficients β0̂ , …, βp̂ to minimise

Average effect of Xj ignoring other predictors

Is at least one of the predictors useful in

At least one βj is non-zero

For large n, F > 1 is evidence against null

TV Sales 0.61 3.26

Y = β0 + β1X1 + β2X2 + … + βpXp + ϵ

Effect of change in a predictor on

Y = β0 + β1X1 + β2X2 + β3X2 + β4X3 + …

Model is linear in terms of coefficients

Non-linearity of the data

Predictions may be inaccurate

Will a credit card customer

Pr( default=Yes | balance )

Left: linear regression -> negative

Right: All probabilities lie bet ween 0 and 1

Logistic function (sigmoid) lies bet ween 0 and 1

Likelihood l(β0, β1) = ∏ P(xi) ∏ (1 − P(xj))

Target Variable: probabilities

Logistic Regression - Classification

Evaluate fit of the model

Accuracy of model parameter estimates

You might also like