0% found this document useful (0 votes)

11 views11 pages

Homework 1

This document outlines Homework I for STOR 455/002, due on January 30, 2025. It includes exercises on bivariate datasets, variance, covariance, and linear regression models, providing detailed mathematical proofs and derivations. The exercises require the application of statistical concepts to demonstrate relationships between variables and to derive least-squares estimates.

Uploaded by

Songquan Dong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views11 pages

Homework 1

Uploaded by

Songquan Dong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

STOR 455/002 - Homework I

January 17, 2025

This homework is due on 11:59 pm (Eastern Time) on Jan 30, 2025. The homework is to be
submitted in Gradescope.

Exercise 1 (13 points)

Consider a bivariate dataset consisting of 𝑛 observations on 2 numerical variables 𝑥 (") and
(&)
𝑥 ($) . Let 𝑥% denotes the value of the variable 𝑥 (&) for the 𝑗-th observation, where 𝑖 = 1,2
and 𝑗 = 1, … , 𝑛.

(a) (2 + 3 + 3 points)
Fix two real numbers 𝑎 and 𝑏. Consider the new variable 𝑧 : = 𝑎𝑥 (") + 𝑏. In other words, the
(")
value of the variable 𝑧 for the 𝑗-th observation is given by 𝑧% = 𝑎𝑥% + 𝑏. Show that
(")
𝑧‾' = 𝑎𝑥‾' + 𝑏, var(𝑧) = 𝑎$ var6𝑥 (") 7, cov6𝑧, 𝑥 ($) 7 = 𝑎cov6𝑥 (") , 𝑥 ($) 7.
Use the definition of sample mean, variance and covariance.

' '
1 1 (")
𝑧‾' = ; 𝑧% = ;<𝑎𝑥% + 𝑏=
𝑛 𝑛
%(" %("

' '
1 (") 1
𝑧‾' = 𝑎 ⋅ ; 𝑥% + ; 𝑏
𝑛 𝑛
%(" %("

(") " (")

Simplify using 𝑥‾' = ' ∑'%(" 𝑥% :

(")
𝑧‾' = 𝑎𝑥‾' + 𝑏
'
1 $
var(𝑧) = ;6𝑧% − 𝑧‾' 7
𝑛−1
%("

(") (")
Substitute 𝑧% = 𝑎𝑥% + 𝑏 and 𝑧‾' = 𝑎𝑥‾' + 𝑏:

'
1 (") (")
$
var(𝑧) = ; C𝑎𝑥% + 𝑏 − <𝑎𝑥‾' + 𝑏=D
𝑛−1
%("

'
1 (") (")
$
var(𝑧) = ; 𝑎$ <𝑥% − 𝑥‾' =
𝑛−1
%("

'
1 (") (")
$
var(𝑧) = 𝑎 ⋅ ;<𝑥% − 𝑥‾' = = 𝑎$ var6𝑥 (")7
$
𝑛−1
%("

'
($)
1 ($) ($)
cov6𝑧, 𝑥 7= ;6𝑧% − 𝑧‾' 7 <𝑥% − 𝑥‾' =
𝑛−1
%("

(") (")
Substitute 𝑧% = 𝑎𝑥% + 𝑏 and 𝑧‾' = 𝑎𝑥‾' + 𝑏:

'
($)
1 (") (") ($) ($)
cov6𝑧, 𝑥 7= ; 𝑎 <𝑥% − 𝑥‾' =<𝑥% − 𝑥‾' =
𝑛−1
%("

'
($)
1 (") (") ($) ($)
cov6𝑧, 𝑥 7=𝑎⋅ ;<𝑥% − 𝑥‾' = <𝑥% − 𝑥‾' = = 𝑎cov6𝑥 (") , 𝑥 ($) 7
𝑛−1
%("
(b) (5 points)
Show that

var6𝑥 (") + 𝑥 ($) 7 = var6𝑥 (")7 + var6𝑥 ($)7 + 2cov6𝑥 (") , 𝑥 ($) 7.

Write down Var6𝑥 (") + 𝑥 ($)7 as

'
(") ($)
1 (") ($) (") ($)
$
var6𝑥 +𝑥 7= ;<𝑥% + 𝑥% − 𝑥‾' − 𝑥‾' = .
𝑛−1
%("

Expand in the following way :

$ $ $
(") ($) (") ($) (") (") ($) ($) (") (") ($) ($)
<𝑥% + 𝑥% − 𝑥‾' − 𝑥‾' = = <𝑥% − 𝑥‾' = + <𝑥% − 𝑥‾' = + 2<𝑥% − 𝑥‾' =<𝑥% − 𝑥‾' =.

'
(") ($)
1 (") ($) (") ($)
$
var6𝑥 +𝑥 7= ;<𝑥% + 𝑥% − 𝑥‾' − 𝑥‾' =
𝑛−1
%("

$ $ $
(") ($) (") ($) (") (") ($) ($) (") (") ($) ($)
<𝑥% + 𝑥% − 𝑥‾' − 𝑥‾' = = <𝑥% − 𝑥‾' = + <𝑥% − 𝑥‾' = + 2<𝑥% − 𝑥‾' =<𝑥% − 𝑥‾' =

var6𝑥 (") + 𝑥 ($) 7

' '
1 (") (")
$
($) ($)
$
= F;<𝑥% − 𝑥‾' = + ;<𝑥% − 𝑥‾' =
𝑛−1
%(" %("
'
(") (") ($) ($)
+ 2 ;<𝑥% − 𝑥‾' = <𝑥% − 𝑥‾' =G
%("

var6𝑥 (") + 𝑥 ($) 7 = var6𝑥 (") 7 + var6𝑥 ($) 7 + 2cov6𝑥 (") , 𝑥 ($)7
Exercise 2 (7 points)
Consider a dataset of 𝑛 observations which collected data on a variable 𝑌. The value for
the numerical variable 𝑌 for the 𝑖-th observation is given by 𝑌& , for 𝑖 = 1, … , 𝑛. Establish the
following sum-of-squares decomposition. For any real number 𝑐,
' '

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ ,

&(" &("

where 𝑌‾' is the sample mean of the variable 𝑌. Observe that the above identity shows the
LHS to be minimized uniquely at 𝑐 = 𝑌‾' .
Write (𝑌& − 𝑐) on the left hand side as
(𝑌& − 𝑐) = (𝑌& − 𝑌‾' ) + (𝑌‾' − 𝑐),
and use the quadratic formula (𝑎 + 𝑏)$ = 𝑎$ + 𝑏 $ + 2𝑎𝑏 to expand (𝑌& − 𝑐)$ .
' '

;(𝑌& − 𝑐)$ = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ .

&(" &("

(𝑌& − 𝑐) = (𝑌& − 𝑌‾' ) + (𝑌‾' − 𝑐). (𝑌& − 𝑐)$ = [(𝑌& − 𝑌‾' ) + (𝑌‾' − 𝑐)]$ .
(𝑌& − 𝑐)$ = (𝑌& − 𝑌‾' )$ + (𝑌‾' − 𝑐)$ + 2(𝑌& − 𝑌‾' )(𝑌‾' − 𝑐).
' ' ' '

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾'

$ )$ + ;(𝑌‾' − 𝑐) + 2 ;(𝑌& − 𝑌‾' ) (𝑌‾' − 𝑐).
$

&(" &(" &(" &("

since ∑'&("(𝑌‾' − 𝑐)$ = 𝑛(𝑌‾' − 𝑐)$ and ∑'&("(𝑌& − 𝑌‾' ) = 0,

' '

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ .

&(" &("

Exercise 3 (13 points)

Consider the bivariate dataset of 𝑛 observations
{(𝑌" , 𝑥" ), … , (𝑌' , 𝑥' )},
where 𝑌 and 𝑥 are respectively response and explanatory variables. We want to fit a simple
linear regression model of 𝑌 on 𝑥, given by
𝑌& = 𝛽) + 𝛽" 𝑥& + 𝜀& , 𝑖 = 1, … , 𝑛.
In this exercise we shall prove (without using calculus) the following algebraic expressions
for the least-square estimates :
𝑟*+ 𝑠+
𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' , 𝛽Q" = .
𝑠*
In other words, the above quantities solve the following minimization problem:

(a) (2 points )
Use Exercise 1 to show that for any (𝛽) , 𝛽" ),
' '

;(𝑌& − 𝛽) − 𝛽" 𝑥& )$ = ;(𝑌& − 𝑌‾' − 𝛽" 𝑥& + 𝛽" 𝑥‾' )$ + 𝑛(𝑌‾' − 𝛽" 𝑥‾' − 𝛽) )$ .
&(" &("

(b) (2 points)
Deduce from Part (a) that any solution 6𝛽Q) , 𝛽Q" 7 to the optimization problem in () satisfies
the relation

𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' .

(d) (4 points)
Show that
'
1 $
;6(𝑌& − 𝑌‾' ) − 𝛽" (𝑥& − 𝑥‾' )7 = 𝛽"$ Var(𝑥) − 2𝛽" Cov(𝑥, 𝑌) + Var(𝑌).
𝑛−1
&("

(e) (3 points)
Use Part (d) to deduce that () is solved at
𝑟*+ 𝑠+
𝛽Q" = .
𝑠*
Use the fact that for any positive real number 𝑎 and real numbers 𝑏, 𝑐, the quadratic
expression 𝑎𝑡 $ + 𝑏𝑡 + 𝑐 is minimized at 𝑡 = −𝑏/2𝑎.
a.
Using Exercise 2, substitute 𝑐 = 𝛽) + 𝛽" 𝑥& into the sum-of-squares decomposition. Let 𝑐 =
𝑌‾' − 𝛽" 𝑥‾' + 𝛽) − (𝑌‾' − 𝛽" 𝑥‾' ). Expanding:
' '
$
;(𝑌& − 𝛽) − 𝛽" 𝑥& )$ = ;6𝑌& − 𝑌‾' − 𝛽" (𝑥& − 𝑥‾' )7 + 𝑛(𝑌‾' − 𝛽" 𝑥‾' − 𝛽) )$ .
&(" &("

This matches the required decomposition.

𝑌& − 𝛽) − 𝛽" 𝑥& = 6𝑌& − 𝑌‾' − 𝛽" (𝑥& − 𝑥‾' )7 + (𝑌‾' − 𝛽" 𝑥‾' − 𝛽) ).

' '
$
;(𝑌& − 𝛽) − 𝛽" 𝑥& )$ = ;6𝑌& − 𝑌‾' − 𝛽" (𝑥& − 𝑥‾' )7 + 𝑛(𝑌‾' − 𝛽" 𝑥‾' − 𝛽) )$ .
&(" &("

The cross-term vanishes because ∑'&("(𝑌& − 𝑌‾' ) = 0 and ∑'&("(𝑥& − 𝑥‾' ) = 0.

b. The total sum of squares is minimized when the second term 𝑛(𝑌‾' − 𝛽" 𝑥‾' − 𝛽) )$ is zero.
Thus:

𝑌‾' − 𝛽" 𝑥‾' − 𝛽) = 0

𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' .

c. Substitute 𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' to the original minimization problem:

' '
$ $
;6𝑌& − 𝛽Q) − 𝛽" 𝑥& 7 = ;6(𝑌& − 𝑌‾' ) − 𝛽" (𝑥& − 𝑥‾' )7 .
&(" &("

𝛽Q" minimizes this reduced sum of squares

$
6(𝑌& − 𝑌‾' ) − 𝛽" (𝑥& − 𝑥‾' )7 = (𝑌& − 𝑌‾' )$ − 2𝛽" (𝑌& − 𝑌‾' )(𝑥& − 𝑥‾' ) + 𝛽"$ (𝑥& − 𝑥‾' )$ .

'
1
;[⋯ ] = 𝛽"$ Var(𝑥) − 2𝛽" Cov(𝑥, 𝑌) + Var(𝑌).
𝑛−1
&("
e. 𝛽"$ Var(𝑥) − 2𝛽" Cov(𝑥, 𝑌) + Var(𝑌) is a quadratic in 𝛽" . Minimizing this quadratic gives:

Cov(𝑥, 𝑌) 𝑟*+ 𝑠+
𝛽Q" = = ,
Var(𝑥) 𝑠*

,-.(*,+)
where 𝑟*+ = 0! 0"
, 𝑠* = XVar(𝑥), and 𝑠+ = XVar(𝑌).

Exercise 4 (12 points)

The file contains data from the public health study on Nepalese children. The dataset has
877 observations on 3 variables:

(a) (5 + 5 points)
Fit separate simple linear regression models with being the response variable and being
the predictor variable on the sub-datasets of male and female children. Report the
estimated co-efficients and a scatter plot (along with the fitted line) for each sub-
population.
nepal <- read.csv("/Users/macbook/Desktop/STOR455/nepal.csv")
males <- subset(nepal, sex == 1)
females <- subset(nepal, sex == 2)
male_model <- lm(weight~height, data = males)
female_model <- lm(weight~height, data = females)
male_model$coefficients

## (Intercept) height
## -9.0869252 0.2393433

female_model$coefficients

## (Intercept) height
## -8.3712108 0.2281936

plot(males$height, males$weight, main = "Weight vs. Height (Males)", xlab =

"Height (cm)", ylab = "Weight (kg)", pch = 19)
abline(male_model, col = "red", lwd = 2)
plot(females$height, females$weight, main = "Weight vs. Height (Females)",
xlab = "Height (cm)", ylab = "Weight (kg)", pch = 19)
abline(male_model, col = "red", lwd = 2)
(b) (2 points)
Comment on the goodness-of-fit of the simple linear regression models for the two sub-
populations. For which sub-population, the model fits the data better?
summary(male_model)

##
## Call:
## lm(formula = weight ~ height, data = males)
##
## Residuals:
## Min 1Q Median 3Q Max
## -2.7192 -0.5064 -0.0510 0.4496 3.2427
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -9.086925 0.288998 -31.44 <2e-16 ***
## height 0.239343 0.003341 71.63 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 0.8373 on 453 degrees of freedom
## Multiple R-squared: 0.9189, Adjusted R-squared: 0.9187
## F-statistic: 5131 on 1 and 453 DF, p-value: < 2.2e-16

summary(female_model)

##
## Call:
## lm(formula = weight ~ height, data = females)
##
## Residuals:
## Min 1Q Median 3Q Max
## -2.82127 -0.57982 -0.02652 0.50813 3.15115
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -8.371211 0.303580 -27.57 <2e-16 ***
## height 0.228194 0.003551 64.26 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 0.8916 on 420 degrees of freedom
## Multiple R-squared: 0.9077, Adjusted R-squared: 0.9075
## F-statistic: 4129 on 1 and 420 DF, p-value: < 2.2e-16

Examining the regression outputs for both subpopulations, the male model exhibits a
lower standard error (0.8383 vs. 0.8924), indicating that its predictions are slightly more
precise compared to the female model. Additionally, the male subpopulation
demonstrates a higher R-squared value (0.9188 vs. 0.9073) and a lower residual standard
error (RSE) than the female subpopulation. These results suggest that the simple linear
regression model provides a better fit for the male subpopulation, as it explains a slightly
greater proportion of the variance in weight and produces more precise predictions.

Exercise 5 (3 + 2 points)
Consider a bivariate dataset consisting of 𝑛 observations on 2 numerical variables 𝑥 (") and
𝑥 ($) . We first fit a simple linear regression model with 𝑥 (") as the response variable and 𝑥 ($)
as the explanatory variable. Let 𝑏"$ denote the (least-square) estimated slope co-efficient.
Next we fit a simple linear regression model with 𝑥 ($) as the response variable and 𝑥 (") as
the explanatory variable. Let 𝑏$" denote the (least-square) estimated slope of this new
fitted line. Show that
$
𝑏"$ 𝑏$" = 6𝑟* ($) *(&) 7 ,

where 𝑟* ($) * (&) is the sample correlation co-efficient between the variables 𝑥 (") and 𝑥 ($) .
Argue that both the estimated slopes have the same signs and at most one of them can
have absolute value greater than 1.
The least-square slope estimates are:

Cov6𝑥 (") , 𝑥 ($)7 Cov6𝑥 (") , 𝑥 ($) 7

𝑏"$ = , 𝑏$" = .
Var(𝑥 ($) ) Var(𝑥 (") )

$
\Cov6𝑥 (") , 𝑥 ($) 7]
𝑏"$ ⋅ 𝑏$" = .
Var(𝑥 (") )Var(𝑥 ($) )
The correlation coefficient:

Cov6𝑥 (") , 𝑥 ($)7

𝑟* ($)* (&) = ,
𝑠* ($) 𝑠* (&)

since 𝑠* ($) = XVar(𝑥 (") ) and 𝑠* (&) = XVar(𝑥 ($))

$
$ \Cov6𝑥 (") , 𝑥 ($) 7]
6𝑟* ($) * (&) 7 = .
Var(𝑥 (") )Var(𝑥 ($) )

$
Thus, 𝑏"$ ⋅ 𝑏$" = 6𝑟* ($)* (&) 7
Both slopes 𝑏"$ and 𝑏$" share the sign of Cov6𝑥 (") , 𝑥 ($) 7, which matches the sign of 𝑟* ($)* (&) .

The slopes are:$ b_{12} = r_{x{(1)}x{(2)}} , b_{21} = r_{x{(1)}x{(2)}} . $

0 ($) 0 (&)
If 0! > 1, then 0! <1
!(&) !($)

Since `𝑟* ($) * (&) ` ≤ 1, at most one of |𝑏"$ | or |𝑏$" | can exceed 1.

Probability and Statistics - Book (DR Hari Arora)
100% (3)
Probability and Statistics - Book (DR Hari Arora)
473 pages
Aiken L. Multiple Regression. Testing and Interpreting... 1991
No ratings yet
Aiken L. Multiple Regression. Testing and Interpreting... 1991
220 pages
Essential Statistics For The Behavioral Sciences 2nd Edition Gregory J Privitera ISBN 9781506386300 ISBN 9781544324647
0% (1)
Essential Statistics For The Behavioral Sciences 2nd Edition Gregory J Privitera ISBN 9781506386300 ISBN 9781544324647
318 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
18 pages
Additional Exercises: C 2005 A. Colin Cameron and Pravin K. Trivedi "Microeconometrics: Methods and Applications"
No ratings yet
Additional Exercises: C 2005 A. Colin Cameron and Pravin K. Trivedi "Microeconometrics: Methods and Applications"
5 pages
Exam 1 Spring 2023 Donald
No ratings yet
Exam 1 Spring 2023 Donald
8 pages
8 - Linear Regression-Least Square Error Fit
No ratings yet
8 - Linear Regression-Least Square Error Fit
26 pages
ch9 - Model Specification and Data Problems
No ratings yet
ch9 - Model Specification and Data Problems
79 pages
Multiple Regression Model - 03
No ratings yet
Multiple Regression Model - 03
27 pages
Scott and Watson CHPT 4 Solutions
No ratings yet
Scott and Watson CHPT 4 Solutions
4 pages
Solutions Chapter 4 PDF
No ratings yet
Solutions Chapter 4 PDF
31 pages
Regression Analysis Guide
100% (1)
Regression Analysis Guide
280 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
WST 311 Notes Part 2 2024
No ratings yet
WST 311 Notes Part 2 2024
21 pages
Assignment 1 New Version
No ratings yet
Assignment 1 New Version
4 pages
Homework Assignment1
No ratings yet
Homework Assignment1
4 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
60 pages
Exam Practice 4
No ratings yet
Exam Practice 4
5 pages
f23 Econ103 Week2 Ta Note
No ratings yet
f23 Econ103 Week2 Ta Note
5 pages
Homework 2 DSC 40A
No ratings yet
Homework 2 DSC 40A
13 pages
Final AK (Spring 2024)
No ratings yet
Final AK (Spring 2024)
14 pages
p8 p15 Annotated
No ratings yet
p8 p15 Annotated
10 pages
Answer Key To Exercises - LN3 - Ver2
No ratings yet
Answer Key To Exercises - LN3 - Ver2
16 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
Reg HW1 Solution
No ratings yet
Reg HW1 Solution
2 pages
Problem Set 3 PDF
No ratings yet
Problem Set 3 PDF
2 pages
MATH3714 Jan 2024
No ratings yet
MATH3714 Jan 2024
9 pages
Stat2 2023 Syllabus B v1.0 Weeks 5-6-7
No ratings yet
Stat2 2023 Syllabus B v1.0 Weeks 5-6-7
41 pages
Econometrics Midterm Solutions
No ratings yet
Econometrics Midterm Solutions
4 pages
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
No ratings yet
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
15 pages
Regression
No ratings yet
Regression
12 pages
Demo0 Sol1
No ratings yet
Demo0 Sol1
5 pages
LM Ques PPR
No ratings yet
LM Ques PPR
8 pages
QUIZ (Objectives) Identification: - (Residual)
No ratings yet
QUIZ (Objectives) Identification: - (Residual)
5 pages
Exercise 1
0% (1)
Exercise 1
5 pages
ECON 342 AE Model Specification and Data Problems 2021
No ratings yet
ECON 342 AE Model Specification and Data Problems 2021
43 pages
Regression and Probability Models Analysis
No ratings yet
Regression and Probability Models Analysis
16 pages
Basic Econometrics 2019 Question Paper With Solution Delhi University BBE Business Economics
No ratings yet
Basic Econometrics 2019 Question Paper With Solution Delhi University BBE Business Economics
12 pages
Least Squares Curve Fitting Guide
No ratings yet
Least Squares Curve Fitting Guide
2 pages
Principle of Least Squares
No ratings yet
Principle of Least Squares
8 pages
Statistical Significance & Association
No ratings yet
Statistical Significance & Association
21 pages
Finance Students' Matlab Guide
No ratings yet
Finance Students' Matlab Guide
3 pages
Statistics Homework Solutions
No ratings yet
Statistics Homework Solutions
23 pages
A Sample Mid-Term Examination of Econometrics Multiple Choice
No ratings yet
A Sample Mid-Term Examination of Econometrics Multiple Choice
8 pages
ML Lec-3
No ratings yet
ML Lec-3
11 pages
Final - Econ3005 - 2022spring - Combined 2
No ratings yet
Final - Econ3005 - 2022spring - Combined 2
11 pages
Curve Fitting
No ratings yet
Curve Fitting
17 pages
Linear Regression Course
No ratings yet
Linear Regression Course
22 pages
Problem Set 3
No ratings yet
Problem Set 3
9 pages
Problem Set 03 - Solutions
No ratings yet
Problem Set 03 - Solutions
16 pages
36-401 Modern Regression HW #9 Solutions: Problem 1 (44 Points)
No ratings yet
36-401 Modern Regression HW #9 Solutions: Problem 1 (44 Points)
14 pages
Stats 216: Polynomial Regression Analysis
No ratings yet
Stats 216: Polynomial Regression Analysis
26 pages
Sta104 Tutorial 1
No ratings yet
Sta104 Tutorial 1
3 pages
HW 03 Sol
No ratings yet
HW 03 Sol
9 pages
Lecture25 Ps
No ratings yet
Lecture25 Ps
10 pages
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
No ratings yet
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
11 pages
F-Ratio Table 2005 - 8ead4a63b2
No ratings yet
F-Ratio Table 2005 - 8ead4a63b2
5 pages
Biostatistics Lab - 4 Sem
No ratings yet
Biostatistics Lab - 4 Sem
17 pages
Enhancing Laboratory Safety With AI: PPE Detection and Non-Compliant Activity Monitoring Using Object Detection and Pose Estimation
No ratings yet
Enhancing Laboratory Safety With AI: PPE Detection and Non-Compliant Activity Monitoring Using Object Detection and Pose Estimation
10 pages
STA 302 / 1001 - Summer 2010 Term Test
No ratings yet
STA 302 / 1001 - Summer 2010 Term Test
9 pages
Series 1
No ratings yet
Series 1
2 pages
Comprehensive Research On Econometrics - Theory, Me
No ratings yet
Comprehensive Research On Econometrics - Theory, Me
24 pages
CH 2
No ratings yet
CH 2
31 pages
Manual Vars
No ratings yet
Manual Vars
52 pages
Statistics For Business and Economics: Anderson Sweeney Williams
No ratings yet
Statistics For Business and Economics: Anderson Sweeney Williams
47 pages
Unit 3 Bayesian Concept Learning
No ratings yet
Unit 3 Bayesian Concept Learning
66 pages
Part 8 Linear Regression
No ratings yet
Part 8 Linear Regression
6 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
10 pages
02 Regression and Classification Problems
No ratings yet
02 Regression and Classification Problems
7 pages
LREA - Low-Rank Efficient Attention On Modeling Long-Term User Behaviors For CTR Prediction - 2503.02542v4
No ratings yet
LREA - Low-Rank Efficient Attention On Modeling Long-Term User Behaviors For CTR Prediction - 2503.02542v4
5 pages
PUBH6450 WhatIsBiostatistics Slides
No ratings yet
PUBH6450 WhatIsBiostatistics Slides
14 pages
An Introduction To Latent Variable Mixture Modeling
No ratings yet
An Introduction To Latent Variable Mixture Modeling
14 pages
Estimation of Population Trend of Odisha by Using Least Square Method
No ratings yet
Estimation of Population Trend of Odisha by Using Least Square Method
3 pages
Statistics Formula Appendix
No ratings yet
Statistics Formula Appendix
5 pages
Les8e PPT Study 07 02
No ratings yet
Les8e PPT Study 07 02
46 pages
Probability & Statistics Course
No ratings yet
Probability & Statistics Course
4 pages
CO2024 Tutorial Estimation
No ratings yet
CO2024 Tutorial Estimation
3 pages
Student Exam Review
No ratings yet
Student Exam Review
14 pages
7 3 Cost Behavior Analysis 2022
No ratings yet
7 3 Cost Behavior Analysis 2022
50 pages
Use Your Shoe!: Suggested Grade Range: 6-8 Approximate Time: 2 Hours State of California Content Standards
No ratings yet
Use Your Shoe!: Suggested Grade Range: 6-8 Approximate Time: 2 Hours State of California Content Standards
12 pages
R Project
No ratings yet
R Project
22 pages
Research Methods for Students
No ratings yet
Research Methods for Students
17 pages
Docx
No ratings yet
Docx
14 pages
Norm Test
No ratings yet
Norm Test
14 pages
Chapter 5
No ratings yet
Chapter 5
16 pages

Homework 1

Uploaded by

Homework 1

Uploaded by

STOR 455/002 - Homework I

January 17, 2025

Exercise 1 (13 points)

(") " (")

Write down Var6𝑥 (") + 𝑥 ($)7 as

Expand in the following way :

var6𝑥 (") + 𝑥 ($) 7

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ ,

;(𝑌& − 𝑐)$ = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ .

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾'

&(" &(" &(" &("

since ∑'&("(𝑌‾' − 𝑐)$ = 𝑛(𝑌‾' − 𝑐)$ and ∑'&("(𝑌& − 𝑌‾' ) = 0,

;(𝑌& − 𝑐) = ;(𝑌& − 𝑌‾' )$ + 𝑛(𝑌‾' − 𝑐)$ .

Exercise 3 (13 points)

𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' .

This matches the required decomposition.

The cross-term vanishes because ∑'&("(𝑌& − 𝑌‾' ) = 0 and ∑'&("(𝑥& − 𝑥‾' ) = 0.

𝑌‾' − 𝛽" 𝑥‾' − 𝛽) = 0

c. Substitute 𝛽Q) = 𝑌‾' − 𝛽Q" 𝑥‾' to the original minimization problem:

𝛽Q" minimizes this reduced sum of squares

Exercise 4 (12 points)

plot(males$height, males$weight, main = "Weight vs. Height (Males)", xlab =

Cov6𝑥 (") , 𝑥 ($)7 Cov6𝑥 (") , 𝑥 ($) 7

Cov6𝑥 (") , 𝑥 ($)7

since 𝑠* ($) = XVar(𝑥 (") ) and 𝑠* (&) = XVar(𝑥 ($))

The slopes are:$ b_{12} = r_{x{(1)}x{(2)}} , b_{21} = r_{x{(1)}x{(2)}} . $

You might also like