0% found this document useful (0 votes)

20 views29 pages

Simple Linear Regression

This document discusses simple linear regression and correlation analysis. It defines regression, independent and dependent variables, and linear regression equations. Examples are provided to demonstrate calculating regression equations and interpreting correlation coefficients and coefficients of determination.

Uploaded by

shawnray1222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views29 pages

Simple Linear Regression

Uploaded by

shawnray1222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Simple Linear Regression and

Correlation Analysis

1
Simple Regression
Definition
A regression model is a mathematical equation that
describes the relationship between two or more variables.
A simple regression model includes only two variables:
one independent (X) and one dependent (Y).

X: The independent variable is the one that

explains
the dependent variable.
Y: The dependent variable is the one being explained.
A simple regression model that gives a straight-line relationship
between two variables is called a linear regression model.

Relationship between Food Expenditure and Income:

(a) Linear Relationship; (b) Nonlinear Relationship
Example of plotting a Linear Equation

ŷ  b0  b1 x

• b0 is the y intercept of the line.

• b1 is the slope coefficient of the line.
• ŷ is the estimated simple linear regression equation.
 The estimated simple linear regression equation is as
follows:

ŷ  b0  b1 x

• b0 is the y intercept of the line.

• b1 is the slope coefficient of the line.
• ŷis the estimated value of y for a given x value.
Example 1:

Table 1 shows the Incomes (in hundreds of dollars) and

Food Expenditures of seven households:

Table 1
(a) Find the regression equation for the data given in
Table 1. Use income as an independent
variable (X) and food expenditure (Y) as a
dependent variable.

The regression equation^

𝒚is=𝟏 .𝟓𝟎𝟕𝟑+𝟎 . 𝟐𝟓𝟐𝟓 𝒙
Coefficient of Determination, r 2

Correlation Coefficient, r
Coefficient of Determination, r2
The coefficient of determination or r2 is the percentage of
the total variation in the dependent variable (y) that is
explained by the independent variable (x).

Correlation Coefficient, r
Correlation measures the direction and the strength
of the linear association between two variables.
Features of Correlation
Coefficient, r
• Range between -1 and 1
• The closer to -1, the stronger the negative linear
relationship
• The closer to 1, the stronger the positive linear
relationship
• The closer to 0, the weaker the linear relationship
Example 1:

Table 1 shows the Incomes (in hundreds of dollars) and

Food Expenditures of seven households:

Table 1
(b) Determine and interpret the coefficient of
correlation (r) and coefficient of determination (r2).

r
r2
r = 0.9481, so r2 = 0.8988

Interpretation of r and r² :

The value of r = 0.95 indicates that the Income (x) and the Food
Expenditure (y) are positively correlated. The relationship is
strong. Those with higher income tends to spend more on food.

The value of r² = 0.8988 states that 89.88% of the total variation in

Food Expenditure (y) is explained by Income (x), while 10.12% is
explained by other factors.
(c) At the 5% significance level, does the data
provide sufficient evidence that there is a

correlation between the Income and the

Food Expenditure.

Step 1:
H0 : = 0 (no correlation between x and y)
HA: ≠ 0 (correlation exist between x and y)

Step 2:
We will use the t-distribution to perform this test.
Step 3:
Area in two tails = 0.05
df = n – 2 = 7 – 2 = 5
In Statistical Table (Table B.2), The critical values are
-2.5706 and 2.5706.

Reject H0 Reject H0

-2.5706 2.5706
17

Step 4:
From the summary output, the test statistic is 6.6641.
Since the test statistic > critical value, i.e. 6.6641 > 2.5706,
H0 is rejected. We conclude that there is a correlation between
Income (x) and Food Expenditure (y).
(d) At the 5% significance level, does the
data provide sufficient evidence that
there is a positive correlation
between the Income and the Food
Expenditure.

Step 1:
H0 : 0 (no positive correlation between x and y)
HA: > 0 (positive correlation exist between x and y)

Step 2:
We will use the t-distribution to perform this test.
Step 3:
Area in one tail = 0.05
df = n – 2 = 7 – 2 = 5
In Statistical Table (Table B.2), The critical value is
2.0150.

Reject H0

2.0150
21

Step 4:
From the summary output, the test statistic is 6.6641.
Since the test statistic > critical value, i.e. 6.6641 > 2.0150,
H0 is rejected. We conclude that there is a positive correlation
between Income (x) and Food Expenditure (y).
(e) Predict the food expenditure for a household
with income 90 (in hundreds of dollars).

From part (a), the regression equation is

^
𝒚 =𝟏 .𝟓𝟎𝟕𝟑+𝟎 . 𝟐𝟓𝟐𝟓 𝒙
^
𝒚 =𝟏 .𝟓𝟎𝟕𝟑 +𝟎 . 𝟐𝟓𝟐𝟓 (𝟗𝟎)
^
𝒚 =𝟐𝟒 . 𝟐𝟑𝟐𝟑
Table 2 lists t he driving experiences ( in

Example 2:
years) of eight drivers and t heir mont hly paid aut o insurance premiums (in dollars).

Let the driving experience be an independent variable (X), and the

insurance premium be a dependent variable (Y).
r Note: r is − 0.77
r2

^
𝒚 =𝟕𝟔 . 𝟔𝟔𝟎𝟒 −𝟏 . 𝟓𝟒𝟕𝟔24𝒙
The regression equation is
r = − 0.77, so r2 = 0.5929

Interpretation of r and r² :

The value of r = -0.77 indicates that the driving experience (x)

and the monthly auto insurance premium (y) are negatively
correlated. The relationship is strong but not very strong.

The value of r² = 0.59 states that 59% of the total variation in

insurance premiums (y) is explained by years of driving
experience (x), while 41% is explained by other factors.
Test whether there is a correlation between the driving
experiences (x) and monthly auto insurance premiums
(y) at 5% of level of significance.

Step 1:
H0 := 0 (no correlation between x and y)
HA: ≠ 0 (correlation exist between x and y)

Step 2:
We will use the t-distribution to perform this test.
Step 3:
Area in two tails = 0.05
df = n – 2 = 8 – 2 = 6
In Statistical Table (Table B.2), The critical values are
-2.447 and 2.447

Reject H0 Reject H0

-2.447 2.447
29

Step 4:
From the summary output, the test statistic is -2.9367.
Since the test statistic < critical value, i.e. -2.9367 < -2.447,
H0 is rejected. We conclude that the correlation exist between
driving experience (x) and auto insurance premium (y).

Linear Regression
No ratings yet
Linear Regression
24 pages
Correlation and Regression
No ratings yet
Correlation and Regression
10 pages
QBM 101 Lecture 10
No ratings yet
QBM 101 Lecture 10
45 pages
Topic 6 Simple Linear Regression
No ratings yet
Topic 6 Simple Linear Regression
57 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Stat Chapter 6
No ratings yet
Stat Chapter 6
23 pages
Data Analysis Training Workshop - Day 3 Presentation
No ratings yet
Data Analysis Training Workshop - Day 3 Presentation
24 pages
Regreesion and Correlation Presentation Revised
No ratings yet
Regreesion and Correlation Presentation Revised
17 pages
Dr. Sufian M. Salih / Regression and Correlation
No ratings yet
Dr. Sufian M. Salih / Regression and Correlation
14 pages
Case2 1015 1018 1060 1116 1124
No ratings yet
Case2 1015 1018 1060 1116 1124
8 pages
Handout 5 Correlation and Regression (Recovered)
No ratings yet
Handout 5 Correlation and Regression (Recovered)
6 pages
Statistics: Correlation & Regression
100% (1)
Statistics: Correlation & Regression
9 pages
Stats Chapter 6 Lesson 3
No ratings yet
Stats Chapter 6 Lesson 3
30 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
Correlation and Regression
No ratings yet
Correlation and Regression
4 pages
Correlation and Regression
No ratings yet
Correlation and Regression
62 pages
QBM101 Chapter10
No ratings yet
QBM101 Chapter10
40 pages
L5 Correlation & Regression - 082913
No ratings yet
L5 Correlation & Regression - 082913
14 pages
Lecture 6 Correlation and Regression
No ratings yet
Lecture 6 Correlation and Regression
10 pages
MBA LSCM: Correlation & Regression
No ratings yet
MBA LSCM: Correlation & Regression
50 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
25 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
8 pages
06 Correlation and Regression
No ratings yet
06 Correlation and Regression
63 pages
Topic 5: Correlation and Regression
No ratings yet
Topic 5: Correlation and Regression
30 pages
Correlation and Regression Analysis Guide
100% (2)
Correlation and Regression Analysis Guide
54 pages
Regression
No ratings yet
Regression
3 pages
Correlation Regression
100% (1)
Correlation Regression
55 pages
Pearson R Correlation: Test
No ratings yet
Pearson R Correlation: Test
5 pages
Correlation
No ratings yet
Correlation
38 pages
Business Stat CHAPTER 6
No ratings yet
Business Stat CHAPTER 6
5 pages
Chapter - 8 Linear Regression With The Matlab Code
No ratings yet
Chapter - 8 Linear Regression With The Matlab Code
39 pages
Raghunath Chatterjee Correlation Lecture
No ratings yet
Raghunath Chatterjee Correlation Lecture
40 pages
Intro to Correlation & Regression
No ratings yet
Intro to Correlation & Regression
71 pages
Lekcija 10 - Korelacija I Regresija
No ratings yet
Lekcija 10 - Korelacija I Regresija
76 pages
+part 02 - AMEFA - 2024 - Introduction and Repetition
No ratings yet
+part 02 - AMEFA - 2024 - Introduction and Repetition
78 pages
Econometrics Lectures
No ratings yet
Econometrics Lectures
22 pages
Corelation With Example
No ratings yet
Corelation With Example
112 pages
CH VII - Regression & Correlation
No ratings yet
CH VII - Regression & Correlation
7 pages
Chapter 3
No ratings yet
Chapter 3
15 pages
CHAP5.0 STA404 Bivariate Analysis
No ratings yet
CHAP5.0 STA404 Bivariate Analysis
7 pages
Correlation and Regression
No ratings yet
Correlation and Regression
54 pages
Research-Methodology-Litrature-Review of Fii N Fdi 2003
No ratings yet
Research-Methodology-Litrature-Review of Fii N Fdi 2003
12 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
53 pages
MKT3600 - L09 - Correlation and Regression
No ratings yet
MKT3600 - L09 - Correlation and Regression
51 pages
Applied Statistics 102 July 2016 With Business Case
No ratings yet
Applied Statistics 102 July 2016 With Business Case
91 pages
Regression Models - Follow
No ratings yet
Regression Models - Follow
7 pages
Lecture 16
No ratings yet
Lecture 16
35 pages
Assignment Individual MBA
No ratings yet
Assignment Individual MBA
5 pages
12.1correlation and Simple Linear
No ratings yet
12.1correlation and Simple Linear
45 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
35 pages
Regression Student
No ratings yet
Regression Student
20 pages
Chapter 8 Multiple Regression
No ratings yet
Chapter 8 Multiple Regression
24 pages
Correlation and Regression: Associate Professor Georgi Iskrov, PHD Department of Social Medicine and Public Health
No ratings yet
Correlation and Regression: Associate Professor Georgi Iskrov, PHD Department of Social Medicine and Public Health
28 pages
BSADM Question Bank - MBA Sem 1
No ratings yet
BSADM Question Bank - MBA Sem 1
48 pages
Lesson 9
No ratings yet
Lesson 9
4 pages
Correlation and Regression
No ratings yet
Correlation and Regression
7 pages
Chap 014
No ratings yet
Chap 014
25 pages
Parallel DFS
No ratings yet
Parallel DFS
10 pages
Statistics - Honours: Paper: CC-4 (Probability and Probability Distributions - II) Full Marks: 50
No ratings yet
Statistics - Honours: Paper: CC-4 (Probability and Probability Distributions - II) Full Marks: 50
2 pages
Application of Hilbert Huang Transform in The Field of Power Quality Events Analysis
No ratings yet
Application of Hilbert Huang Transform in The Field of Power Quality Events Analysis
7 pages
Factoring: Math 8 Teacher Jervy Josiah D. Bayang
No ratings yet
Factoring: Math 8 Teacher Jervy Josiah D. Bayang
23 pages
STMOL Lecture 1
No ratings yet
STMOL Lecture 1
54 pages
Inequalities and Wavy Curve Method
No ratings yet
Inequalities and Wavy Curve Method
51 pages
Prolog Programming: Techniques of
No ratings yet
Prolog Programming: Techniques of
7 pages
The Theoretical Framework of The Optimization of Public Transport Travel
No ratings yet
The Theoretical Framework of The Optimization of Public Transport Travel
7 pages
AP Statistics Free-Response Practice Test 8 Probability and Random Variables
No ratings yet
AP Statistics Free-Response Practice Test 8 Probability and Random Variables
2 pages
M.Tech Power Systems QBank
No ratings yet
M.Tech Power Systems QBank
6 pages
Quant Studies Chapter 7
No ratings yet
Quant Studies Chapter 7
14 pages
Tfy4280 T6a
No ratings yet
Tfy4280 T6a
9 pages
Foundations of Deep Reinforcement Learning Theory and Practice in Python (Laura Graesser, Wah Loon Keng) (Z-Library)
100% (3)
Foundations of Deep Reinforcement Learning Theory and Practice in Python (Laura Graesser, Wah Loon Keng) (Z-Library)
413 pages
Pushdown Automata Pdas: Fall 2006 Costas Busch - RPI 1
No ratings yet
Pushdown Automata Pdas: Fall 2006 Costas Busch - RPI 1
79 pages
Poisson Data MLE Guide for R Users
No ratings yet
Poisson Data MLE Guide for R Users
9 pages
Tutorial 6
No ratings yet
Tutorial 6
12 pages
Ba 4201 - QTDM - 20250514 - 0001
No ratings yet
Ba 4201 - QTDM - 20250514 - 0001
4 pages
Association Rules 1. Data Yang Digunakan Adalah Sebagai Berikut
No ratings yet
Association Rules 1. Data Yang Digunakan Adalah Sebagai Berikut
7 pages
Marketing Experts: Segmentation Insights
No ratings yet
Marketing Experts: Segmentation Insights
4 pages
Linear Algebra Exam Prep
No ratings yet
Linear Algebra Exam Prep
5 pages
Lec01 introductionToToC
No ratings yet
Lec01 introductionToToC
34 pages
Design Problem 1
No ratings yet
Design Problem 1
5 pages
S-19 - Random Variables and Bivariate Continuous Distributions
No ratings yet
S-19 - Random Variables and Bivariate Continuous Distributions
21 pages
Unit2 Maths Newly Edited
No ratings yet
Unit2 Maths Newly Edited
29 pages
Report - Numerical Analysis
No ratings yet
Report - Numerical Analysis
7 pages
DataDriven ReservoirModeling NAGAO THESIS 2021
No ratings yet
DataDriven ReservoirModeling NAGAO THESIS 2021
119 pages
Analisis Signal-To-Noise Ratio Pada Sinyal Audio Dengan Teknik Konvolusi
No ratings yet
Analisis Signal-To-Noise Ratio Pada Sinyal Audio Dengan Teknik Konvolusi
9 pages
Unit I Notes Machine Learning Techniques
No ratings yet
Unit I Notes Machine Learning Techniques
21 pages
Binomial Heaps: Manoj Kumar DTU, Delhi
No ratings yet
Binomial Heaps: Manoj Kumar DTU, Delhi
36 pages
WEKA for Movie Review Analysis
No ratings yet
WEKA for Movie Review Analysis
27 pages

Simple Linear Regression

Uploaded by

Simple Linear Regression

Uploaded by

Simple Linear Regression and

X: The independent variable is the one that

Relationship between Food Expenditure and Income:

• b0 is the y intercept of the line.

• b0 is the y intercept of the line.

Table 1 shows the Incomes (in hundreds of dollars) and

The regression equation^

Table 1 shows the Incomes (in hundreds of dollars) and

The value of r² = 0.8988 states that 89.88% of the total variation in

correlation between the Income and the

From part (a), the regression equation is

Let the driving experience be an independent variable (X), and the

The value of r = -0.77 indicates that the driving experience (x)

The value of r² = 0.59 states that 59% of the total variation in

You might also like