Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
14 views26 pages

University Statistics Testing Material 2

University statistics testing material 2

Uploaded by

kashki67
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views26 pages

University Statistics Testing Material 2

University statistics testing material 2

Uploaded by

kashki67
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 26

Department of Mathematics and Statistics

Course STAT2910-2: Statistics for the Sciences


TEST 1: Winter 2021

Solution of Test 1

1. This test has 7 pages including the cover page and 9 questions

2. DO ALL OF THE QUESTIONS

3. Print your name, student number and section number on the first page. Write your last
name on each subsequent page.

4. It is important to be specific about what you are calculating, otherwise it is impossible to


award partial marks for solutions containing errors.

5. MUST ALWAYS SHOW YOUR WORK

Question Maximum mark Actual mark

1 18

2 10

3 10

4 18

5 10

6 10

7 10

8 10

9 4

Total 100 pts

1
Question 1. 18 pts
The following data represent the final grades of students in first year Statistics. The leaf unit is
1.

Stems Leaves

4 34

5 6799

6 123444

7 5667788

8 1223

9 789

a. (2 pts) Calculate the range of the data?


The range is give by,
R = max − min = 99 − 43 = 56 (2 pts)

b. (3 pts) Find the median.


The position of the median is,

position = 0.5(n + 1) = 0.5 × (26 + 1) = 0.5 × 27 = 13.5 (1.5 pts)

Then, the median is

Median = 75 + 0.5(76 − 75) = 75.5 (1.5 pts)

c. (3 pts) Find the average grade.


Let x denotes the grades of students, then, the average is give by,
26
1X 42 + 44 + . . . + 98 + 99
x̄ = xi = = 71.34615 (3 pts)
n i=1 26

d. (8 pts) Find the first and the third quartiles and the inter-quartile range of the data.
The position of Q1 is,

position = 0.25(n + 1) = 0.25 × (26 + 1) = 0.25 × 27 = 6.75 (1.5 pts)

Then, Q1 is
Q1 = 59 + 0.75(61 − 59) = 60.5 (1.5 pts)
The position of Q3 is,

position = 0.75(n + 1) = 0.75 × (26 + 1) = 0.75 × 27 = 20.25 (1.5 pts)

2
Then, Q3 is
Q3 = 81 + 0.25(82 − 81) = 81.25 (1.5 pts)
Finally,
IQR = Q3 − Q1 = 81.25 − 60.5 = 20.75 (2 pts)

e. (2 pts) What can you say about the shape of this distribution? Justify?
The fact that the mean if less than the median makes the distribution skewed left.

Question 2. 10 pts
The following data represent the number of patients who took the flu-shot for a sample of 10
Ontario doctors: 15, 18, 7, 1, 12, 12, 9, 10, 15, and 22

a. (6 pts) Calculate mean, variance and standard deviation of the sample


Let x denotes the numbers of students who took the flu-shot, then, the average is give by,
10
1X 15 + 18 + . . . + 15 + 22
x̄ = xi = = 12.1 . (2 pts)
n i=1 10

The variance can be obtained as


10
1 X
S2 = (xi − x̄)2 = 34.76667 . (3 pts)
n − 1 i=1

Therefore, the standard deviation is



S = 34.76667 = 5.896327 (1 pts)

b. (4 pts) Is the measurement 22 an outlier? Justify?


First, we find the z − score of 22
x − x̄ 22 − 12.1
z − score = = = 1.679011 (2 pts)
s 5.896327
The measurement 22 is not an outlier as the absolute value of z−score is less than 2 (2 pts).

Question 3. 10 pts
A distribution of measurements is relatively mound-shaped with mean 60 and standard deviation
15.

3
a. (3 pts) What proportion of the measurements will fall between 45 and 75.

Based on the empirical rule 68% of the mea-


surements will fall within one standard devi-
ation of the mean. Thus 68% will fall in in
the interval 0.02

A1 = 68%

0.01

[µ − 1σ, µ + 1σ] = [45, 75]

0.00

15 30 45 60 75 90 105

The green area in Figure 1 Figure 1

b. (3 pts) What proportion of the measurements will fall between 30 and 90.

Based on the empirical rule 95% of the mea-


surements will fall within one standard devi-
ation of the mean. Thus 95% will fall in in
the interval 0.02

A2 = 95%

0.01

[µ − 2σ, µ + 2σ] = [30, 90]

0.00

15 30 45 60 75 90 105

The red area in Figure 2 Figure 2

c. (4 pts) What proportion of the measurements will fall between 75 and 90.

This proportion is given by

0.02

A3 = A2/2 − A1/2 = 47.5 − 43 = 13.5%


0.01

A3 = 13.5%

0.00
The yellow area in Figure 3 15 30 45 60 75 90 105

4 Figure 3
Question 4. 18 pts
Research studies suggest that the likelihood a university student chooses to study Engineering
may depend on the person’s gender. The proportions of the total number of cases that fall into
four major/gender categories are shown in the table below:

Education Male (M) Female (F)

Engineering Major (E) 0.20 0.20

Non-engineering 0.25 0.35

a. (4 pts) Find P (E)



P (E) = P (E ∩ M ) + P E ∩ M c = 0.20 + 0.20 = 0.4

b. (4 pts) P (E ∪ M )

P (E ∪ M ) = P (E) + P (M ) − P (E ∩ M ) = 0.40 + 0.45 − 0.2 = 0.65

c. (4 pts) P (E | M c )
 P (E ∩ M c ) 0.2
P E | Mc = = = 0.3636
P (M )
c 0.55

d. (2 pts) Are the events E and M mutually exclusive? Justify?


Since P (E ∩ M ) = 0.2 6= 0, we conclude that E and M are not mutually exclusive.

e. (4 pts) Are the events E and M independent? Justify?


Given that

P (E ∩ M ) 6= P (E).P (M )
0.2 6= 0.4 × 0.45
0.2 6= 0.18,

we conclude that E and M are not independent.

Question 5. 10 pts
a. (4 pts) In how many ways a committee of 4 members can be selected from a group of 10
members?

10!
C10
4 = = 210 .
4!(10 − 4)!

b. (6 pts) How many computer passwords of length 4 you can make out of given 10 characters

5
i. if repetition of characters is allowed?

104 = 10000 .

ii. without repeating a character more than one time?


10!
P10
4 = = 5040 .
(10 − 4)!

Question 6. 10 pts
A food truck near the University of Windsor specializes in healthy sandwiches. The sandwiches
are prepared fresh each morning and any unsold sandwiches are donated to a food bank. Each
sandwich costs $6 to make and sells for $10. The daily number of sandwiches sold, X, has the
following distribution:

Table 1: Probability distribution function of X

x 0 1 2 3 4

p(x) 0.15 0.05 ? 0.25 0.35

a. (4 pts) Fill in the missing probability in the table above.


To find the missing probability p(2) = p2 , we solve the following equation

1 = p(0) + p(1) + p2 + P (3) + p(4)


1 = 0.15 + 0.05 + p2 + 0.25 + 0.35
1 = 0.80 + p2
=⇒ p2 = 1 − 0.80 = 0.2

b. (6 pts) What is the expected daily profit? (hint: find the mean of X and multiply it by profit
per sandwich)
First we find the mean µ of X
4
X
µ= x.p(x) = 0 × 0.15 + 1 × 0.05 + 2 × 0.2 + 3 × 0.25 + 4 × 0.35 = 2.6 .
i=0

Thus, the daily expected profit is µ × (10 − 6) = 2.6 × 4 = 10.4 .

6
Question 7. 10 pts
A statistic report suggests that 55% of Canadian families are homeowners. If this is still true and
we pick a random sample of 60 Canadian families.
a. (6 pts) Find the probability that exactly 35 of the 60 will be homeowners.
Let X denotes the number of Canadian families who are homeowners, then X ∼ Binomial (n = 60, p0.55),
hence the desired probability is given by
P (X = 35) = C60 35
35 (0.55) (0.45)
60−35
= 0.0908

b. (4 pts) Find the expected (mean) number of families in the sample, which own their homes.
The mean is give by
µ = np = 60 × 0.55 = 33

Question 8. 10 pts
Suppose that the number of typhoons per year in a given part of the world averages 3 per year.
If typhoon occurrence follows a Poisson distribution,
a. (6 pts) find P (X = 3).
Using the Poisson probability distribution with parameter µ = 3, we
µ3 e−µ 33 e−3
P (X = 3) = = = 0.2240418 .
3! 3!

b. (4 pts) what is the probability that in a given year there will be no typhoons in that part?
Similarly, we
µ0 e−µ 30 e−3
P (X = 0) = = = 0.04978707 .
0! 0!

Question 9. 4 pts
Before the last federal elections took place, a poll for Maclean’s magazine by Abacus reported
that, when the question “Who would be the most likely to stop and help if your car was stranded?”
was asked, 18% and 41% responded S=Andrew Scheer and J=Jagmeet Singh, respectively, while
the rest answered T=Justin Trudeau.

If 20%, 80% and 20% of those answering S, J and T, respectively, would vote for Singh in the
actual elections, what is the probability that any randomly selected individual who is similar to
the population in the poll would vote for Singh?

Let V bet the event that the individual would vote for Singh. Then
  
P (V ) = P V | S P (S) + P V | J P (J) + P V | T P (T )
= (0.2)(0.18) + (0.8)(0.41) + (0.2)(0.41)
= 0.446 .

7
Department of Mathematics and Statistics
Course STAT2910-2: Statistics for the Sciences
Quiz 2: Winter 2021

Last Name: Solution First Name: Solution ID: Solution

1. This test has 4 pages including the cover page and 4 questions

2. Stop working at 11:10 AM and start the submission process.

3. Submission must be completed before 11:30 AM. No extra time will be given.

4. DO ALL OF THE QUESTIONS

5. Print your name, student number and section number on the first page. Write your
last name on each subsequent page.

6. It is important to be specific about what you are calculating, otherwise it is impos-


sible to award partial marks for solutions containing errors.

7. MUST ALWAYS SHOW YOUR WORK

Question Maximum mark Actual mark

1 15

2 15

3 15

4 15

Total 60 pts

1
Question 1. 15 pts
a. Let Z denote a standard normal variable.

i. (5 pts) Find P (−0.44 < Z < 2.68).

P (−0.44 < Z < 2.68) = P (Z < 2.68) − P (Z < −0.44)


= 0.99632 − 0.32997
= 0.66635

ii. (5 pts) Determine the value of z0 which satisfies P (Z ≥ z0 ) = 0.5285.

P (Z ≥ z0 ) = 0.5285 =⇒ P (Z < z0 ) = 1 − 0.5285


=⇒ P (Z < z0 ) = 0.4715
=⇒ z0 = -0.0715

b. (5 pts) Suppose that 10% of the probability for a certain distribution that is N (µ, σ 2 )
is below 60 and that 5% is above 90. What are the values of µ and σ?
To find µ and σ, one can solve the following system of equations:
     
X−µ 60−µ 60−µ
(
P (X < 60) = 0.1  P < = 0.1  P Z < = 0.1
=⇒  σ σ
 =⇒  σ

P (X > 90) = 0.05 P X−µ < 90−µ = 0.95 P Z < 90−µ = 0.95
σ σ σ
( (
60−µ
σ
= −1.282 60 − µ = −1.282 σ
=⇒ 90−µ =⇒
σ
= 1.645 90 − µ = 1.645 σ

These conditions require that µ = 73.14 and σ = 10.25 approximately.

Question 2. 15 pts
Assuming that the distribution of risk of falls scores for seniors is normally distributed
with mean µ = 67 and standard deviation σ = 10 units.

a. (2 pts) What is the sampling distribution of the sample mean x̄?


x̄ will have normal distribution

b. (5 pts) What will be the mean and standard deviation of the sampling distribution of
x̄ if n = 100 ?
x̄ will have mean µx̄ = µ = 0.67 and standard deviation given by
σ 10 10
σx̄ = √ = √ = √ =1
n n 100

2
c. (8 pts) What is the probability that a sample of 100 seniors would give a sample mean
that exceeds 70?

!
 X̄ − µ 70 − 67
P X̄ > 70 = P >
σx̄ 1
= P (Z > 3)
= 1 − P (Z ≤ 3)
= 1 − 0.9987 = 0.0013

Question 3. 15 pts
A random sample of size n = 80 is selected from a binomial distribution with population
proportion p = 0.25

a. (2 pts) What will be the approximate shape of the sampling distribution of pb ?


For n = 80 and p = 0.25 =⇒ np = 20 and nq = 60 are both greater than 5, the
normal approximation and sampling distribution of p̂ will be approximately normal.

b. (5 pts) What will be the mean and standard deviation of the sampling distribution of
pb ?
p̂ will have mean µp̂ = p = 0.25 and standard deviation given by
r r
pq 0.25 × 0.75
σp̂ = = = 0.04841
n 80

c. (8 pts) Find the probability that the sample proportion pb is between 0.18 and 0.44.
 
0.18 − 0.25 0.44 − 0.25
P (0.18 < p̂ < 0.44) = P <Z<
0.04841 0.04841
= P (−1.45 < Z < 3.92)
≈ 1 − 0.0735 = 0.9265

3
Question 4. 15 pts
a. (8 pts) A binomial experiment yielded 150 successes from 350. Construct a 98% con-
fidence interval for the population proportion p.
A point estimate of p is the sample proportion p̂ = 150 350
= 0.4285. Then, a 98%
confidence interval for the population proportion p can be obtained as follow:
r r
p̂q̂ (0.4285)(0.5715)
p̂ ± zα/2 =⇒ 0.4285 ± 2.33
n 350
=⇒ 0.4285 ± 0.0616
=⇒ 0.3669 < p < 0.4901


b. (7 pts) A random sample of size 16 from the normal distribution N µ, σ 2 = 25 yielded
x̄ = 73.8. Find a 95% confidence interval for µ.
A 95% confidence interval for the population proportion µ can be obtained as follow:
σ 5
x̄ ± zα/2 √ =⇒ 73.8 ± 1.96 √
n 16
=⇒ 73.8 ± 2.45
=⇒ 71.35 < µ < 76.25

4
Department of Mathematics and Statistics
Course STAT2910-2: Statistics for the Sciences
Quiz 3: Winter 2021

Last Name: Solution First Name: ID:

1. This test has 4 pages including the cover page and 3 questions

2. Stop working at 11:10 AM and start the submission process.

3. Submission must be completed before 11:30 AM.

4. Late submission will not be marked

5. It is important to be specific about what you are calculating, otherwise it is impos-


sible to award partial marks for solutions containing errors.

6. Print your name on the blank line and handwritten signature below the statement:
I, , verify that the submitted work is my own
original work and adheres to all my Academic Rights and Responsibilities
as outlined in the Student Code of Conduct.
Signature: Date:

Question Maximum mark Actual mark

1 14

2 11

3 15

Total 40 pts

1
Question 1. 14 pts
A machine shop that manufactures toggle levers has both a day and a night shift. A
toggle lever is defective if a standard nut cannot be screwed onto the threads. Let p1
and p2 be the proportion of defective levers among those manufactured by the day and
night shifts, respectively. A two random samples, each of 1000 levers yield to x1 = 37
and x2 = 53 defectives levers for the day and night shifts, respectively. Is there enough
evidence to claim a difference between p1 and p2 ? Use α = 0.05

a. (1.5 pts) Null hypothesis:


H0 : p1 − p2 = 0.

b. (1.5 pts) Alternate hypothesis:

Ha : p1 − p2 6= 0.

c. (5 pts) Test Statistic:


x1 37 x2
A point estimates of p1 and p2 are given by p̂1 = 1000
= 1000
= 0.037 and p̂2 = 1000
=
53
1000
= 0.053, respectively. Also , given that

x1 + x2 37 + 53 90
p̂ = = = = 0.045.
n1 + n2 1000 + 1000 2000
Therefore, the test statistic is
p̂1 − p̂2 0.037 − 0.053
z=r =q = −1.725
2

p̂q̂ n11 + 1 (0.045)(0.955) 1000
n2

d. Decision rule:

i. (2 pts) Rejection region:


Rejection region for two-tailed z−test with α = 0.05
Reject H0 if z ≤ −zα/2 = −1.96 or z ≥ zα/2 = 1.96
ii. (2 pts) P-value:

pv = P (z > 1.725) + P (z < −1.725) = 0.0845

e. (2 pts) Conclusion:
The p-value is greater than 0.05, also z = −1.725 > −1.96, thus we fail to reject H0
and conclude that there is not enough evidence to claim that there is a significant
difference between the two proportions.

2
Question 2. 11 pts
A physician claims that joggers’ maximal volume oxygen uptake is greater than the av-
erage of all adults. A random sample of 15 joggers has a mean of 40.6 milliliters per
kilogram (ml/kg) and a standard deviation of 6 ml/kg. If the average of all adults is 36.7
ml/kg, is there enough evidence to support the physician’s claim at α = 0.05? Assume
the variable is normally distributed.

a. (1.5 pts) Null hypothesis:


H0 : µ = 36.7

b. (1.5 pts) Alternate hypothesis:


Ha : µ > 36.7

c. (4 pts) Test Statistic:


x̄ − µ0 40.6 − 36.7
t= √ = √ = 2.517
s/ n 6/ 15

d. (2 pts) Decision rule:


Reject H0 if t ≥ tα/2,n−1 = t0.05,14 = 1.671, given that t = 2.517 > 1.761, H0 is rejected.
Find the P-value. Looking across the row with d.f. = 14 in t−table , you see that 2.517
falls between 2.145 and 2.624, corresponding to α = 0.025 and α = 0.01 since this is
a right-tailed test. Hence, P-value > 0.01 and P-value< 0.025, or 0.01 < P − value <
0.025. That is, the P-value is somewhere between 0.01 and 0.025.

e. (2 pts) Conclusion:
There is enough evidence to support the claim that the joggers’ maximal volume
oxygen uptake is greater than 36.7 ml/kg.

3
Question 3. 15 pts
A random sample of nine local banks shows their deposits (in billions of dollars) 3 years
ago and their deposits (in billions of dollars) today. Can it be concluded that the average
in deposits for the banks is greater today than it was 3 years ago?. Use α = 0.05. Assume
the variable is normally distributed.

Bank 1 2 3 4 5 6 7 8 9

3 years ago 11.42 8.41 3.98 7.37 2.28 1.10 1.007 0.9 1.35

Today 16.69 9.44 6.53 5.58 2.92 1.88 1.78 1.5 1.22

a. (1.5 pts) Null hypothesis:


H0 : µd = 0

b. (1.5 pts) Alternate hypothesis:


Ha : µd < 0

c. (8 pts) Test Statistic:

i. Make a table

3 years ago (x1 ) Today (x2 ) d = x1 − x2 d2 = (x1 − x2 )2

11.42 16.69 -5.27 27.7729

8.41 9.44 -1.03 1.0609

3.98 6.53 -2.55 6.5025

7.37 5.58 1.79 3.2041

2.28 2.92 -0.64 0.4096

1.10 1.88 -0.78 0.6084

1.007 1.78 -0.773 0.5975

0.90 1.50 -0.6 0.3600

1.35 1.22 0.13 0.0169

d2i = 40.5328
P P
di = −9.723

4
ii. Find the standard deviation of the differences.
v
u P 2 (P di )2
u s
2
t di − n 40.5328 − (−9.723)
9
sd = = = 1.937
n−1 9−1

iii. find the t-test


d¯ − µd −1.081 − 0
t= √ = √ = −1.674.
sd / n 1.937/ 9

d. (2 pts) Decision rule:


Reject H0 if t < −tα,n−1 = −t0.05,8
Do not reject the null hypothesis since the test value, t = −1.674, is greater than the
critical value, −tα,n−1 = −1.860.

e. (2 pts) Conclusion:
There is not enough evidence to show that the deposits have increased over the last 3
years.

5
UNIVERSITY OF WINDSOR
DEPARTMENT OF MATHEMATICS AND STATISTICS
STAT 2910-02; Winter 2021; Quiz 1
Maximum Time: 40 Minutes

LAST NAME:_______________________FIRST NAME:_______________________


I.D.#:_________________

1. This test has 4 pages including the cover page and 4 questions.
2. DO ALL OF THE QUESTIONS
3. Print your name on the first page.
4. It is important to be specific about what you are calculating, otherwise it is
impossible to award partial marks for solutions containing errors.
5. MUST ALWAYS SHOW YOUR WORK

Do not write anything on this sheet below this line.

Question Maximum mark Actual mark

1 5

2 24

3 6

4 5

Total 40
Page 2 of 4

[5] 1. Identify the following statements whether they are true (T) or false (F)
a) Rating of a newly elected politician (excellent, good, fair, poor) is a qualitative variable. T
b) A person’s body temperature is a discrete variable. F
c) The relative frequency of a class is computed by dividing the frequency of the class by the
total number of observations in the data set. T
d) A market share of 78.5 percent would be represented in a pie chart by a slice with a central
angle of 141.3 degrees. F
e) In a symmetric distribution the mean and median are equal. T

[24] 2. The maximum daily temperature in C recorded during sixteen days of March in a Canadian city
are given in the following stem and leaf plot:

Stem Leaf
3 9
4 55566799 Leaf unit = 0.1

5 000125
6 0

[2] a. What is the range of these temperatures?

6.0-3.9=2.1

[2] b. What was the mode of these temperatures?

4.5 and 5.0

[3] c. What is the median of these temperatures?

Position: 0.5(𝑛 + 1) = 0.5 × 17 = 8.5


4.9+4.9
Median: = 4.9
2

[2+2+1] d. Calculate the Interquartile IQR.


Page 3 of 4

Position Q1: 0.25 × (𝑛 + 1) = 0.25 × 17 = 4. 𝟐𝟓


Q1= 4.5+0.25× (4.6-4.5) = 4.525.

Position of Q3: 0.75 × (𝑛 + 1) = 0.75 × 17 = 12. 𝟕𝟓

Q3 = 5.0 + 0.75 × (5.1 − 5.0) = 5.075

Then
𝐼𝑄𝑅 = 𝑄3 − 𝑄1 = 5.075 − 4.525 = 0.55.

[3] e. What is the average temperature?

∑𝑛𝑖=1 𝑥𝑖
𝑥̅ = = 4.87
𝑛

[4] f. what is the sample standard deviation of temperatures?

∑𝑛𝑖=1(𝑥𝑖 − 𝑥̅ )2
𝑆2 = = 0.23
𝑛−1

And S = 0.48

[3] g. Is the lowest temperature an outlier?

3.9−4.87
𝑧= = −2.02, since |𝑧| < 3, so lowest weight is not an outlier.
0.48

[2] h. Comment the shape of the distribution.


Since Mean < Median, left skewed
Since both are very close so symmetric.
Both answers are right.
Page 4 of 4

[6] 3. Assume the hourly dollar amount of food sold by a McDonald franchise follows an approximately
mound-shaped distribution with a mean sales level of $500 per hour and a standard deviation of $80 per
hour. What percentage of the working hours does this McDonald franchise sell between $420 and $660
worth of food per hour? Show your calculations and make a graph.

𝑥̅ ± 𝑠 = 500 ± 80 i.e. 420 to 580 contains 68% of the data.


𝑥̅ ± 2𝑠 = 500 ± 2 × 80 i.e. 340 to 660 contains 95% of the data.
(34 + 47.5)% = 81.5%

[5] 4. Three city council members are to be selected (one as a President, Treasurer and other as a
Secretary) from a total of 15 members to form a subcommittee to study the city’s traffic problems.
[3] a) How many different subcommittees are possible?

15! 15!
𝑃315 = = = 2730
(15 − 3)! 12!

[2] b) If all possible council members have an equal chance of being selected. What is the probability
that members Smith (as a President), Ronald (as a Treasurer) and Jones (as a Secretary) are both
selected?

1 1
=
𝑃315 2730
Department of Mathematics and Statistics
Course STAT2910-1: Statistics for the Sciences
TEST 2: Winter 2021; Working time: 1h10 min

Last Name: First Name: ID:

1. This test has 8 pages including the cover page and 8 questions

2. Stop working at 11:10 AM and start the submission process.

3. Submission must be completed before 11:30 AM.

4. Late submission will be penalized by minus 20 points

5. It is important to be specific about what you are calculating, otherwise it is impos-


sible to award partial marks for solutions containing errors.

6. Print your name on the blank line and handwritten signature below the statement:
I, , verify that the submitted work is my own
original work and adheres to all my Academic Rights and Responsibilities
as outlined in the Student Code of Conduct.
Signature: Date:

Question Maximum mark Actual mark

1 10

2 10

3 10

4 30

5 10

6 5

7 10

8 15

Total 100 pts

1
Question 1. 10 pts
a. Let X be a normally distributed random variable with mean µ = 2 and standard
deviation σ = 5.

i. (2.5 pts) Find P (−8 < X < 1)


 
−8 − 2 1−2
P (−8 < X < 1) = P <Z<
5 5
= P (−2 < Z < −0.2)
= P (Z < −0.2) − P (Z < −2)
= 0.4207 − 0.0227
= 0.398

ii. (2.5 pts) Find x0 such that P (X < x0 ) = 0.121


 
x0 − 2
P (X ≤ x0 ) = 0.121 =⇒ P Z ≤ = 0.121
5
x0 − 2
=⇒ = −1.17
5
=⇒ x0 = 5 × (−1.17) + 2
=⇒ x0 = −3.85


b. (5 pts) Let X be N µ, σ 2 so that P (X < 89) = 0.90 and P (X < 94) = 0.95. Find µ
and σ.
To find µ and σ, one can solve the following system of equations:
     
X−µ 89−µ 89−µ
(
P (X < 89) = 0.90  P < σ = 0.90 P Z < = 0.90
=⇒  σ  =⇒  σ

P (X < 94) = 0.95 P X−µ < 94−µ = 0.95 P Z < 94−µ
= 0.95
σ σ σ
( (
89−µ
σ
= 1.282 89 − µ = 1.282 σ
=⇒ 94−µ =⇒
σ
= 1.645 94 − µ = 1.645 σ

These conditions require that µ = 71.341 and σ = 13.7741 approximately.

Question 2. 10 pts
The lifetime X of a particular type of light bulb is normally distributed with a mean of
1100 days and a standard deviation of 80 days. The manufacturer randomly selects 400
bulbs of this type and ships them to a tire retailer.
a. (3 pts) What is the mean and standard deviation of the sampling distribution of X?

ˆ the mean of X̄ is µ = 1100;


√ √
ˆ the standard deviation of X̄ is σ/ n = 80/ 400 = 4

2
b. (7 pts) What is the probability that the average lifetime of these 400 bulbs is between
1097 and 1104 days?
 
 1097 − 1100 1104 − 1100
P 1097 ≤ X̄ ≤ 1104 = P √ ≤Z≤ √
σ/ n σ/ n
= P (−0.75 ≤ Z ≤ 1)
= P (Z ≤ 1) − P (Z ≤ −0.75)
= 0.8413 − 0.2266
= 0.6147 .

Question 3. 10 pts
The prevalence of cannabis use among Canadians 15 years of age and older was 9.1% in
2011(according to Statistics Canada).

a. (4 pts) If we were to sample n = 125 individuals, what would be the approximate


distribution of the sample proportion p̂ ? give the mean, the standard deviation and
the name of the approximate distribution of such sample proportion as well as the
name of the theorem used to justify your claims

ˆ (3 pts) p̂ will be approximately normal with µp̂ = 0.091 and


r r
pq (0.091(0.909))
σp̂ = = = 0.02572
n 125

ˆ (1 pts) The CLT allows us this approximation.

b. (6 pts) If we assume that the above proportion is still true in 2021, how likely is it
to find a random sample of n = 125 individuals that results in a sample proportion
exceeding 11% ?
 
0.11 − 0.091 
P (p̂ > 0.11) = P Z > q


(0.091(0.909))
125

= P (Z > 0.7386)
= 1 − P (Z ≤ 0.7386)
= 1 − 0.7699
= 0.2301 .

3
Question 4. 30 pts
One of the major costs involved in planning a summer vacation is the cost of lodging.
Even within a particular chain of hotels, costs can vary substantially depending on the
type of room and the amenities offered. Suppose that we randomly select 50 billing
statements from each of the computer databases of the Marriott, Westin, and Doubletree
hotel chains, and record the nightly room rates.

Marriot Westin Doubletree

Sample average $ 150 165 125

Sample Standard Deviation $ 17.2 22.5 12.8

a. (5 pts) Find a point estimate for the average room rate for the Marriott hotel chain.
Calculate the 95% margin of error.
ˆ A point estimate of µ is x̄ = 150
ˆ The 95% margin of error in estimation with s = 17.2 and n = 50 is
σ s 17.2
1.96SE = 1.96 √ ≈ 1.96 √ = 1.96 √ = 4.7676
n n 50

b. (10 pts) Find a 95% confidence interval for the difference in the average room rates for
the Marriott and the Doubletree hotel chains.
The 95% confidence interval for µ1 − µ2 is approximately
s r
s21 s22 17.22 12.82
(x̄1 − x̄2 ) ± 1.96 + = (150 − 125) ± 1.96 + = 25 ± 5.942
n1 n2 50 50
or
19.05 < µ1 − µ2 < 30.94.

c. (10 pts) Find a 99% confidence interval for the difference in the average room rates for
the Westin and the Doubletree hotel chains.
The 99% confidence interval for µ1 − µ2 is approximately
s r
s21 s22 22.52 12.82
(x̄1 − x̄2 ) ± 2.58 + = (165 − 125) ± 2.58 + = 40 ± 9.35
n1 n2 50 50
or
30.65 < µ1 − µ2 < 49.35.

d. (5 pts) Do the data indicate a difference in the average room rates between the Marriott
and the Doubletree chains? Between the Westin and the Doubletree chains?
Since neither confidence interval contains the value µ1 − µ2 = 0 it is not likely that the
means are equal. You should conclude that there is a difference in the average room
rates for the Marriott and Doubletree and also for the Westin and the Doubletree
chains.

4
Question 5. 10 pts
A manufacturing plant has two assembly lines for producing glass bottles. The plant
manager was concerned about whether the proportion of defective bottles differs between
the two lines. Two independent random samples were selected and the following summary
data computed:

Line 1 Line 2

Number of defectives 10 13

Sample size 100 100

a. (8 pts) Find a 95% confidence interval for the true difference in proportion of defective
bottles produced by the two assembly lines.
A 95% confidence interval for the difference between the two proportions is
r r
p̂1 q̂1 p̂2 q̂2 0.1(0.9) 0.13(0.87)
(p̂1 −p̂2 )±1.96 + = (0.1−0.13)±1.96 + = −0.03±0.0883
n1 n2 100 100
or
−0.118 < p1 − p2 < 0.058.

b. (2 pts) Based on this interval, can one conclude there is a difference in proportion of
defective bottles produced by the two lines? Justify your answer.
Since the value p1 − p2 = 0 is in the confidence interval, it is likely that p1 = p2 . You
should not conclude that there is a difference in the proportion of defective bottles
between Line 1 and Line 2.

Question 6. 5 pts
A process control engineer wishes to estimate the true proportion of defective computer
chips by using a 90% confidence interval whose half width is no more than B = 0.0001.
How many observations does the engineer need to include in the sample to achieve his
goal, if based on his subjective judgment, the true proportion is about 0.01?
Since half width of the desired confidence interval is
r
pq
z α2 , with z α2 = z 0.1 = z0.05 = 1.645, and p = 0.01.
n 2

Thus, we need that


r r
pq 0.01 × 0.99
1.645 ≤ B =⇒ 1.645 ≤ 0.0001
n n
(1.645)2 (0.01 × 0.99)
=⇒ n ≥
(0.00012 )
=⇒ n ≥ 2678965
Then, the engineer needs approximately n = 2678965 observations.

5
Question 7. 10 pts
Social media has become extremely popular. According to the data provided by comScore
via a new mobile measurement report, the average time per month spent by users on
Twitter is 170 minutes. Suppose that a random sample is taken of 120 users whose
average time spent per month on Twitter is 173 minutes and standard deviation is 15
minutes. Does the data represent sufficient evident to reject comScore’s claim? Test
using α = 0.05.
a. (1.5 pts) Null hypothesis:
H0 : µ = 170
b. (1.5 pts) Alternate hypothesis:
H0 : µ 6= 170
c. (3 pts) Test statistic:
x̄ − µ0 x̄ − µ0 173 − 170
z= √ ≈ √ = √ = 2.19
σ/ n s/ n 15/ 120

d. (2 pts) Decision rule:


ˆ Rejection region: z = 2.19 > z α2 = z 0.05 = 1.96
2
ˆ The p-value
P (z < −2.19) + P (z > 2.19) = 2(1 − P (z < 2.19))
= 2(1 − 0.9857) = 0.0286.

e. (2 pts) Conclusion: With α = 0.05, the p−value is less than α and H0 is rejected.
There is sufficient evidence to indicate that the average time per month spent by users
on Twitter is different from 170 minutes.

Question 8. 15 pts
2

Assume that IQ scores for a certain population are approximately N µ, σ = 100 . To
test H0 : µ = 110 against the one-tailed alternative hypothesis Ha : µ > 110, we take a
random sample of size n = 16 from this population and observe x̄ = 113.5.
a. (9 pts) Do we accept or reject H0 at the 5% significance level?

ˆ The test statistics


x̄ − µ0 113.5 − 110
z= √ = √ = 1.4
σ/ n 10/ 16
ˆ Decision rule: Since z = 1.4 < zα = z0.05 = 1.645, we do not reject H0

b. (3 pts) Do we accept or reject H0 at the 10% significance level?


ˆ Decision rule: Since z = 1.4 > zα = z0.1 = 1.2815, H0 is rejected.

c. (3 pts)What is the p-value of this test?


ˆ The p-value is given by
pv = P (Z > z) = P (Z > 1.4) = 1 − P (Z ≤ 1.4) = 1 − 0.9192 = 0.0808

You might also like