0% found this document useful (0 votes)

81 views8 pages

Ch. 8 Measures of Association

Uploaded by

Etsub Samuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views8 pages

Ch. 8 Measures of Association

Uploaded by

Etsub Samuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

CHAPTER 8: MEASURES OF ASSOCIATION

Session Learning Objectives:

By the end of this session students are expected to:

 Explain the purpose of correlation analysis

 Understand the use of scatter plots to study association
 Compute covariance and Karl Pearson’s coefficient of correlation
 Apply linear regression analysis to estimate the linear relationship between two
variables using the least square method
 Identify the coefficient of regression
 Describe and compute the coefficient of determination

Coefficient of Correlation: A measure of strength of linear relationship between 2 variables. It

ranges between -1 & 1. -1 denotes perfect negative relationship, 1 denotes perfect positive linear
relationship.

As r gets closer to (approximates) -1 or 1 strong correlation is implied; 0 indicates no linear

relationship; and r near 0 shows weak relationship. Correlation can be simple, partial and
multiple; positive and negative; linear and non linear. ~ does not necessarily imply causation.
Correlation could be clearly depicted on a scatter diagram.

E.g. high school GPA Vs. college GPA; Price vs Demand/quantity sold; Number of children vs.
annual demand. Height and weight; hot weather and ice cream consumption

Covariance: indicated direction of relationship; + = positive linear relationship (I,III)

-=negative relationship(II,IV) ; 0= no linear relationship (Even in all quadrants) It indicates the

location/quadrant of the points /values with greatest influence thus shows the direction.

Methods: i) scatter diagram ii) Karl Pearson method iii) Spearman’s rank correlation method

Simple Linear Regression: The simplest type of regression analysis involving one independent
variable and one dependent variable, in which the relationship between the variables is
approximated by a straight line.

1
It is a statistical method used to estimate unknown value of one from known value of another
given they are correlated.

Dependent variable: variable being predicted; Independent variable: Predictor variable.

Simple Linear Regression Model: explains how y is related to x and E.

Y=B0+B1X+E B0 ; B1 are parameters of the model and E accounts for the variability that
cannot be attributed to the relationship between x and y.

a+bx+E=Y E=Error assuming E=0 Regression equation of Y on X.

Estimated simple linear regression equation: y=bx+a is developed from sampe data using the
least square method

As it can be recalled, the coefficient of correlation, except for -1, 0, and +1, we cannot precisely
interpret its meaning. We can judge the coefficient of correlation in relation to its proximity to
only -1, 0, and +1. Fortunately, another measure that can be precisely interpreted is the
coefficient of determination, which is calculated by squaring the coefficient of correlation. For this
reason, it is denote as R2. The coefficient of determination measures the amount
(proportion) of variation in the dependent variable that is explained (accounted for) by the
variation in the independent variable. For instance, if the coefficient of correlation r =0.8711,
thus, the coefficient of determination is r2 (0.8711)2 = 0.7588; this indicates that 75.88% of the
variation in the dependent variable is explained by the independent variable. The remaining
24.12% is unexplained. The value of r2 ranges between 0 and 1 (inclusive). It cannot be
negative, hence it does not show the direction of relationship. When r 2 = 1, all the points on the
scatter diagram fall on the regression line and the entire variations are explained by the straight
line. On the other hand, when r2 = 0, none of the points on the scatter diagram falls on the
regression line, meaning thereby that there is no relationship between the two variables.

Example
ABU Furniture is a family business that has been selling to retail customers in the Merkato area
for many years. The company advertises extensively on radio, TV, and the Internet, emphasizing
low prices and easy credit terms. The owner would like to review the relationship between sales
and the amount spent on advertising. Below is information on sample of sales and advertising
expense for the last four months.
2
Month July August September October
Advertising Expense
2 1 3 4
(x) ($ million)
Sales Revenue(y)
7 3 8 10
($ million)

The owner wants to forecast sales on the basis of advertising expense.

a) Which variable is the dependent variable? Which variable is the independent variable?

b) Draw a scatter diagram.

c) Determine the sample covariance and interpret

d) Determine the correlation coefficient and interpret the result
e) Determine the estimated regression equation.

f) Interpret the values of a and b.

g) Draw the estimated regression line on the scatter diagram

h) Estimate sales when $3 million is spent on advertising.

i) Identify the coefficient of regression (byx)

j) Compute the coefficient of determination (r2) and interpret.

Answer

a) Independent variable: Advertisement expense; Dependent variable: Sales

b) Refer to g.

c) Covariance: Cov(x,y)= ∑(x- *(y- )/n-1 =11/4-1 = 11/3=3.67

Interpretation:
Cov(x,y)=3.667; there is a positive covariance between advertising expense and sales
d) Correlation coefficient: r = Cov (x,y)/Sx.Sy = 3.667/ (1.2910*2.9439)=0.9648
3
or

r= = =0.9648

Note:

Sx= = =1.2910 Sy= = =2.9439

Interpretation:
r=0.9648; there is a positive strong correlation between the advertising expense and sales

e) Estimated regression equation y=a+bx: =1.5+2.2X

Note:

b= =11/5=2.2 a=7-2.2(2.5)=1.5

f) The slope is 2.2. This indicates that an increase of $1 million in advertising will result in an
increase of $2.2 million in sales.
The intercept is 1.5. If there was no expenditure for advertising, sales would be $1.5 million.
g) The straight line graph of the estimated regression equation is drawn on the scatter diagram
(refer to a)
X 1 2 3 4
3.7 5.9 8.1 10.3

h) =1.5+2.2(3)=8.1million
i)Coefficient of regression (byx) is the slope (b)=2.2

b= Sx and Sy are respective standard deviations; r is the correlation coefficient.

This is another formula that can be used to find b; find data from d; b= =2.2
j) Coefficient of Determination: r was 0.9648, r2=0.9308
Interpretation: Ninety-three percent of the variation in sales is accounted for by advertising
expense.
4
Exercise
1. The following data refer to two variables promotional expenses and sales (1000 dollars)
collected in the context of a promotional study.
Promotional Expenses 10 12 15 23 20
Sales 14 17 23 25 21

a. Which variable is the dependent variable? Which variable is the independent variable?
b. Draw the scatter diagram
c. Calculate the covariance and interpret.
d. Calculate the correlation coefficient and interpret.
e. Determine the estimated regression equation.
f. Interpret the values of a and b.
g. Draw the estimated regression line on the scatter diagram
h. Estimate sales when $30 thousand is spent on advertising.
i. Identify the coefficient of regression (byx)
j. Compute the coefficient of determination (r2) and interpret.
Synopsis:

 The scatter diagram depicts relationships graphically; the covariance and the
coefficient of correlation describe the linear relationship numerically for interval or
ration scale data.
 Simple correlation analysis, which is concerned with measuring the relationship
between only one independent variable and the dependent variable.
 There are assumptions concerning (population) simple correlation analysis
 Correlation coefficient does not necessarily mean causation
 Correlation coefficient lies between -1 and 1 inclusive
 r=1 indicates positive perfect correlation, and r=-1 is negative perfect correlation;
while r=0 implies no correlation
 positive sign indicates direct relationship, while negative sign implies inverse
relationship

 Sample covariance Cov (x,y)=

 (x i  x )( yi  y )
n 1

 Pearson’s sample correlation coefficient r= or

5
r=

 The purpose of the simple linear regression equation is to quantify a linear relationship
between two variables (that are interval or ratio scale).
 In regression analysis, we estimate one variable based on another variable; the
variable being estimated is the dependent variable; and the variable used to make the
estimate or predict the value is the independent variable.
 The least squares criterion is used to determine the regression equation:

where a is the y intercept & b is the slope or coefficient of regression.

a and b are determined using the appropriate formula

 The slope of regression line is called the regression coefficient (byx)
 The coefficient of determination is a more objective measure of the degree of
relationship; it gives the percentage variation in the dependent variable that is
accounted for by the independent variable.
 The square of the correlation coefficient (r) is the coefficient of determination r 2; its
value ranges between 0 and 1 inclusive.
Wrap up Discussion Questions:

 Identify tools used to measure association?

 Does correlation imply cause and effect relationship?
 What is the difference between covariance and correlation?
 Explain how Pearsonian correlation coefficient is interpreted and what it measures
 What is the purpose of regression analysis?
 What is the difference between correlation coefficient and regression
 What is the coefficient of regression
 Explain the least square principle
 What is coefficient of determination? How is it measured?

Topic: Linear Correlation: Testing the Significance of Correlation Coefficient

Session Learning Objectives:

By the end of this session students are expected to:

6
 Test the significance of Correlation Coefficient of population

Reading Assignment Discussion:

 Why is test of significance of correlation coefficient conducted for the population?

Reading Text:

Typically, the null hypothesis of interest is that the population correlation =0, for if this
hypothesis is rejected at a specified level, we would conclude that there is a relationship
between the two variables in the population. The hypothesis can also be formulated as a one-tail
test. Given that the assumptions in Session 56’s reading text are satisfied, the following sampling
statistic involving r is distributed as the t distribution with degrees of freedom, df= n-2, when
=0:

Example

For a sample of n=10 loan recipients at a finance company, the correlation coefficient between
household income and amount of outstanding short-term debt is found to be r=+0:50.
a. Test the hypothesis that there is no correlation between these two variables for the entire
population of loan recipients, using the 5 percent level of significance.
b. Interpret the meaning of the correlation coefficient which was computed.
H 0: =0, H1: ≠0
Critical t (df = 10-2=8, =0.05) = ± 2:306

= =1.634

Because the computed t statistic of +1.634 is not in a region of rejection, the null hypothesis
cannot be rejected, and we continue to accept the assumption that there is no relationship
between the two variables. The observed sample relationship can be ascribed to chance at the 5
percent level of significance.
(b) Based on the correlation coefficient of r = 0:50, we might be tempted to conclude that
2
because r = 0.25, approximately 25 percent of the variance in short-term debt is explained
statistically by the amount of household income. This is true for the sample data. However,

7
because the null hypothesis in part (a) above was not rejected, a more appropriate interpretation
for the population is that none of the variance in Y can be assumed to be associated with
changes in X.

Exercise
1. A sample of 25 mayoral campaigns in medium-sized cities with populations between
50,000 and 250,000 showed that the correlation between the percent of the vote received
and the amount spent on the campaign by the candidate was 0.43. At the 0.05
significance level, is there a positive association between the variables?
2. Ethiopian Petroleum Corporation is studying the relationship between the pump price of
gasoline and the number of gallons sold. For a sample of 20 stations last Tuesday, the
correlation was 0.78. At the 0.01 significance level, is the correlation in the population
greater than zero?

Synopsis:

 Hypothesis test of the coefficient of correlation (r) can be made for making effective
generalizations.
 To test a hypothesis that a population correlation is different from 0, we use the following
statistic:

with n − 2 degrees of freedom

Wrap up Discussion Questions:

 How is test of significance of correlation conducted for the population?

M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
No ratings yet
M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
31 pages
Chap013 Test Bank
No ratings yet
Chap013 Test Bank
7 pages
Correlation and Regression
100% (6)
Correlation and Regression
36 pages
Part 4 Forecasting BIS & ABA
No ratings yet
Part 4 Forecasting BIS & ABA
16 pages
Chapter Eight 8 Simple Linear Regression and Correlation: N XY X Y N X X
No ratings yet
Chapter Eight 8 Simple Linear Regression and Correlation: N XY X Y N X X
5 pages
Linear Regression and Correlation: Mcgraw Hill/Irwin
No ratings yet
Linear Regression and Correlation: Mcgraw Hill/Irwin
37 pages
Chapter 13 PowerPoint
No ratings yet
Chapter 13 PowerPoint
36 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
89 pages
Correlation and Regression Analysis - Updated
No ratings yet
Correlation and Regression Analysis - Updated
49 pages
1755253710regression Analysis
No ratings yet
1755253710regression Analysis
16 pages
Chap 013
No ratings yet
Chap 013
18 pages
K - PPT - Simple Regression and Correlation
No ratings yet
K - PPT - Simple Regression and Correlation
21 pages
GMATH Regression Analysis
No ratings yet
GMATH Regression Analysis
3 pages
Online Correlation and Regression
No ratings yet
Online Correlation and Regression
6 pages
09 - M & S - Corr+Regr
No ratings yet
09 - M & S - Corr+Regr
18 pages
Linear Regression and Correlation: Mcgraw-Hill/Irwin
No ratings yet
Linear Regression and Correlation: Mcgraw-Hill/Irwin
16 pages
Simple Linear Regression and Correlation 568a5ac2ce9b3
No ratings yet
Simple Linear Regression and Correlation 568a5ac2ce9b3
31 pages
Linear Regression and Correlation: Mcgraw-Hill/Irwin
No ratings yet
Linear Regression and Correlation: Mcgraw-Hill/Irwin
49 pages
Chap 013
No ratings yet
Chap 013
36 pages
Correlation and Regression
No ratings yet
Correlation and Regression
12 pages
Linear Regression and Correlation: Mcgraw-Hill/Irwin
No ratings yet
Linear Regression and Correlation: Mcgraw-Hill/Irwin
29 pages
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
Correlation & Regression Guide
100% (1)
Correlation & Regression Guide
19 pages
Correlation and Regression Analysis: BMT 1063 Business Statistics
No ratings yet
Correlation and Regression Analysis: BMT 1063 Business Statistics
42 pages
CHAP5.0 STA404 Bivariate Analysis
No ratings yet
CHAP5.0 STA404 Bivariate Analysis
7 pages
Econometrics For Finance
100% (1)
Econometrics For Finance
54 pages
Stat Chapter 6
No ratings yet
Stat Chapter 6
23 pages
Correlation and Regression
No ratings yet
Correlation and Regression
8 pages
Chapter 13 Correlation and Linear Regression
No ratings yet
Chapter 13 Correlation and Linear Regression
19 pages
Chapter 4 - Correlation and Linear Regression
No ratings yet
Chapter 4 - Correlation and Linear Regression
28 pages
Simple Regression Analysis Guide
No ratings yet
Simple Regression Analysis Guide
58 pages
15 MAY - NR - Correlation and Regression
No ratings yet
15 MAY - NR - Correlation and Regression
10 pages
To Find Correlation and Regression of The Following Data
No ratings yet
To Find Correlation and Regression of The Following Data
5 pages
Chap 13 - Correlation and Linear Regression
No ratings yet
Chap 13 - Correlation and Linear Regression
55 pages
Regression Analysis-Nw
No ratings yet
Regression Analysis-Nw
6 pages
Regression & Correlation 230224 221642
No ratings yet
Regression & Correlation 230224 221642
9 pages
Regression Analysis
No ratings yet
Regression Analysis
6 pages
Chapter 14 Simple Linear Regression .
No ratings yet
Chapter 14 Simple Linear Regression .
39 pages
Chapter-9-Simple Linear Regression & Correlation
No ratings yet
Chapter-9-Simple Linear Regression & Correlation
11 pages
Stat II Chapter 6
No ratings yet
Stat II Chapter 6
11 pages
10-Correlation and Linear Regression
No ratings yet
10-Correlation and Linear Regression
25 pages
Business Stats for Students
No ratings yet
Business Stats for Students
66 pages
Sta404 - Chapter 5 - Bivariate Analysis (Student)
No ratings yet
Sta404 - Chapter 5 - Bivariate Analysis (Student)
27 pages
Handout 5 Correlation and Regression (Recovered)
No ratings yet
Handout 5 Correlation and Regression (Recovered)
6 pages
L5 - Simple Linear Regression Students
No ratings yet
L5 - Simple Linear Regression Students
33 pages
Regression and Correlation
No ratings yet
Regression and Correlation
37 pages
ANALYTICAL TECHNIQUES LU4 Lecture Notes
No ratings yet
ANALYTICAL TECHNIQUES LU4 Lecture Notes
25 pages
Class Note II - 044242
No ratings yet
Class Note II - 044242
19 pages
Correlation-Regression 2019
No ratings yet
Correlation-Regression 2019
76 pages
REGRESSION and CORRELATION ANALYSIS STA 106 - DR. BASHIRU
No ratings yet
REGRESSION and CORRELATION ANALYSIS STA 106 - DR. BASHIRU
10 pages
Stat Chap013 - 2 KPP
No ratings yet
Stat Chap013 - 2 KPP
18 pages
DrSoomro - 2588 - 20292 - 1 - Lecture 9
No ratings yet
DrSoomro - 2588 - 20292 - 1 - Lecture 9
29 pages
Oe Statistics Notes
No ratings yet
Oe Statistics Notes
32 pages
06 - Simple Linear Regression
No ratings yet
06 - Simple Linear Regression
20 pages
Regression Analysis
No ratings yet
Regression Analysis
13 pages
Module 4 Advanced Data Analytics Techniques BRM
No ratings yet
Module 4 Advanced Data Analytics Techniques BRM
29 pages
FM I - Ch4 Work Sheet
No ratings yet
FM I - Ch4 Work Sheet
2 pages
Statistical Inference Assignment
No ratings yet
Statistical Inference Assignment
4 pages
Cost Volume Profit (CVP) Analysis
No ratings yet
Cost Volume Profit (CVP) Analysis
60 pages
Binomial Probability Basics
No ratings yet
Binomial Probability Basics
45 pages
Cost II Chap I-1
No ratings yet
Cost II Chap I-1
52 pages
Budget Planning for Small Businesses
No ratings yet
Budget Planning for Small Businesses
5 pages
What Is R
No ratings yet
What Is R
4 pages
Linear Regression and Correlation - 3232612
No ratings yet
Linear Regression and Correlation - 3232612
6 pages
Chapter Two-Four
No ratings yet
Chapter Two-Four
118 pages
Module 4.1 Point and Interval Estimates
100% (2)
Module 4.1 Point and Interval Estimates
4 pages
BTMMeeting25Nov2020 StatisticalLearning
No ratings yet
BTMMeeting25Nov2020 StatisticalLearning
49 pages
Econometrics Unit 2
No ratings yet
Econometrics Unit 2
21 pages
EC203 Tutorial 12 Time Series 16
No ratings yet
EC203 Tutorial 12 Time Series 16
4 pages
Regularized Linear Regression. Linear Regression Is A Widely Used - by Yahya Ansari - Medium
No ratings yet
Regularized Linear Regression. Linear Regression Is A Widely Used - by Yahya Ansari - Medium
12 pages
Regression Vs Bland-Altman
No ratings yet
Regression Vs Bland-Altman
37 pages
Basic Business Statistics Australian 4Th Edition Berenson Test Bank Full Chapter PDF
100% (25)
Basic Business Statistics Australian 4Th Edition Berenson Test Bank Full Chapter PDF
68 pages
Lampiran 3. Hasil Analisis Data Descriptive Statistics
No ratings yet
Lampiran 3. Hasil Analisis Data Descriptive Statistics
5 pages
Analysis of Covariance-ANCOVA-with Two Groups PDF
No ratings yet
Analysis of Covariance-ANCOVA-with Two Groups PDF
41 pages
Kernel Density Estimation
No ratings yet
Kernel Density Estimation
10 pages
Interaction Effects in MLR Guide
No ratings yet
Interaction Effects in MLR Guide
3 pages
Regression - What Is The Difference Between $/beta - 1$ and $/hat (/beta) - 1$? - Cross Validated
No ratings yet
Regression - What Is The Difference Between $/beta - 1$ and $/hat (/beta) - 1$? - Cross Validated
1 page
Statistical Analysis of Kekerasan, Susut Bobot, and TPT
No ratings yet
Statistical Analysis of Kekerasan, Susut Bobot, and TPT
4 pages
Missing Data
No ratings yet
Missing Data
71 pages
Pattern Recognition
No ratings yet
Pattern Recognition
3 pages
Chapter 4. Violation of Assumptions
No ratings yet
Chapter 4. Violation of Assumptions
51 pages
Introduction To Econometrics
No ratings yet
Introduction To Econometrics
37 pages
Econometrics II
100% (1)
Econometrics II
4 pages
DWJF 58 WF
No ratings yet
DWJF 58 WF
34 pages
Qs 2
No ratings yet
Qs 2
11 pages
Reg Lin
No ratings yet
Reg Lin
73 pages
Alhar Coba Sendiri 14 Jul 20.14
No ratings yet
Alhar Coba Sendiri 14 Jul 20.14
21 pages
Econometrics Assignment Answer Key
No ratings yet
Econometrics Assignment Answer Key
8 pages
SPSS Answers (Chapter 5)
No ratings yet
SPSS Answers (Chapter 5)
6 pages
Instant Download Multilevel Modeling Using R 1st Edition Edition W. Holmes Finch PDF All Chapters
100% (21)
Instant Download Multilevel Modeling Using R 1st Edition Edition W. Holmes Finch PDF All Chapters
81 pages
Problems in Uncertainty With Solutions Physics 1
No ratings yet
Problems in Uncertainty With Solutions Physics 1
13 pages
Trip Generation (Cont.)
No ratings yet
Trip Generation (Cont.)
36 pages

Ch. 8 Measures of Association

Uploaded by

Ch. 8 Measures of Association

Uploaded by

CHAPTER 8: MEASURES OF ASSOCIATION

Session Learning Objectives:

By the end of this session students are expected to:

 Explain the purpose of correlation analysis

Coefficient of Correlation: A measure of strength of linear relationship between 2 variables. It

As r gets closer to (approximates) -1 or 1 strong correlation is implied; 0 indicates no linear

Covariance: indicated direction of relationship; + = positive linear relationship (I,III)

-=negative relationship(II,IV) ; 0= no linear relationship (Even in all quadrants) It indicates the

Dependent variable: variable being predicted; Independent variable: Predictor variable.

Simple Linear Regression Model: explains how y is related to x and E.

a+bx+E=Y E=Error assuming E=0 Regression equation of Y on X.

The owner wants to forecast sales on the basis of advertising expense.

b) Draw a scatter diagram.

c) Determine the sample covariance and interpret

f) Interpret the values of a and b.

g) Draw the estimated regression line on the scatter diagram

h) Estimate sales when $3 million is spent on advertising.

i) Identify the coefficient of regression (byx)

a) Independent variable: Advertisement expense; Dependent variable: Sales

c) Covariance: Cov(x,y)= ∑(x- *(y- )/n-1 =11/4-1 = 11/3=3.67

Sx= = =1.2910 Sy= = =2.9439

e) Estimated regression equation y=a+bx: =1.5+2.2X

b= Sx and Sy are respective standard deviations; r is the correlation coefficient.

 Sample covariance Cov (x,y)=

 Pearson’s sample correlation coefficient r= or

where a is the y intercept & b is the slope or coefficient of regression.

a and b are determined using the appropriate formula

 Identify tools used to measure association?

Topic: Linear Correlation: Testing the Significance of Correlation Coefficient

Session Learning Objectives:

By the end of this session students are expected to:

Reading Assignment Discussion:

 Why is test of significance of correlation coefficient conducted for the population?

with n − 2 degrees of freedom

Wrap up Discussion Questions:

 How is test of significance of correlation conducted for the population?

You might also like