Question 1
Find the value 𝑘, 𝑘 < 0 , such that the area under the standard normal curve between 𝑘 and 0 is
0.27. If the value of 𝑘 is to be found using Microsoft Excel, which of the following
commands/functions can be used?
a) = 𝑁𝑂𝑅𝑀. 𝑆. 𝐼𝑁𝑉(0,27)
b) = 𝑁𝑂𝑅𝑀. 𝐼𝑁𝑉(0,23)
c) = 𝑁𝑂𝑅𝑀. 𝐼𝑁𝑉(0,27)
d) = 𝑁𝑂𝑅𝑀. 𝑆. 𝐼𝑁𝑉(0,23)
e) = 1 − 𝑁𝑂𝑅𝑀. 𝑆. 𝐼𝑁𝑉(0,27)
Question 2
The diagram below shows the probability distribution curves of three normal random variables: 𝑋1
, 𝑋2 and 𝑋3 .
Which one of the following statements is true?
a) The mean of 𝑋3 is smaller than the means of 𝑋1 and 𝑋2.
b) 𝑋2 has the largest variance.
c) The median of 𝑋1 is the same as that of 𝑋2.
d) All three random variables are only defined for values from –2.5 to 4.
e) The variances of 𝑋1 and 𝑋2 both equal 1.
Questions 3 to 7 are based on the following:
The weights of young giraffes are known to be normally distributed with a mean of 284 kg and a
standard deviation of 34 kg.
Question 3
What is the probability that a randomly chosen young giraffe will weigh less than 270 kg?
a) 0.3409
b) 0.5882
c) 0.4118
d) 0.6591
e) None of the above
Question 4
Which Excel function would you use to find the probability in Question 11 above?
a) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 270 ; 284 ; 34 ; 𝑇𝑅𝑈𝐸 )
b) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 270 ; 284 ; 34 ; 𝐹𝐴𝐿𝑆𝐸 )
c) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 270 ; 284 ; 34 ; 𝑇𝑅𝑈𝐸 )
d) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 284 ; 270 ; 34 ; 𝑇𝑅𝑈𝐸 )
e) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 270 ; 284 ; 34 ; 𝐹𝐴𝐿𝑆𝐸 )
Question 5
A random sample of 40 young giraffes is obtained. What is the probability that their mean weight is
between 280 kg and 290 kg?
a) 0.6390
b) 0.1428
c) 0.8686
d) 0.3800
e) None of the above
Question 6
When there are too many giraffes on game farms, some of the young giraffes are sent to zoos
(instead of being culled). For logistical reasons, only the lightest young giraffes can be transported.
The owner of a particular game farm needs to reduce the number of young giraffes by 5%. Young
giraffes of at most 𝑥 kg will be sent to zoos. The value of 𝑥 is:
a) 𝑥 = 221.41
b) 𝑥 = 282.30
c) 𝑥 = 339.93
d) 𝑥 = 269.80
e) 𝑥 = 228.07
Question 7
To determine the value of 𝑥 in Question 14, we would have used the following function in Excel:
a) = 𝑁𝑂𝑅𝑀. 𝑆. 𝐼𝑁𝑉 ( 0,05 ; 284 ; 34 )
b) = 𝑁𝑂𝑅𝑀. 𝐼𝑁𝑉 ( 0,05 ; 284 ; 34 )
c) = 𝑁𝑂𝑅𝑀. 𝐼𝑁𝑉 ( 0,95 ; 34 ; 284 )
d) = 𝑁𝑂𝑅𝑀. 𝑆. 𝐼𝑁𝑉 ( 0,95 ; 34 ; 284 )
e) = 1 − 𝑁𝑂𝑅𝑀. 𝐼𝑁𝑉 ( 0,05 ; 284 ; 34)
Questions 8 and 9 are based on the following information:
The engines made by Ford for speedboats have an average power of 220 horsepower (HP) and a
standard deviation of 15 HP. It is assumed that the power of the engines follows a normal
distribution.
Question 8
A sample of 30 engines is taken. Approximately how many could you expect to have more than 230
HP?
a) 22
b) 20
c) 10
d) 8
e) 4
Question 9
If a sample of 4 engines is taken, what is the probability that the sample mean differs from the
actual mean by at least 12 HP?
a) 0.0548
b) 0.8904
c) 0.1096
d) 0.9986
e) This cannot be calculated.
Question 10
At Stanleys Bakery the weights of (standard) loaves of bread follow a normal distribution with a
mean of 700 grams. If only 15% of the loaves are heavier than 718.72 grams, what is the value of
the standard deviation?
a) 13.37
b) 18.00
c) 1.40
d) 4.24
e) 23.00
Question 11
In a particular farming region, the average weight of pumpkins is known to be 6.4 kg with a
standard deviation of 1.7 kg. If a random sample of 400 pumpkins from this region is selected, the
variance of the sampling distribution of the sample mean will be
1.7
a)
400
1.72
b)
√400
1.7
c)
√400
1.72
d) √
400
1.72
e)
400
Question 12
Which one of the following statements is true?
a) For a t-distribution with 𝜐 degrees of freedom, 𝑡𝜐 ; 𝛼 = −𝑡𝜐 ; 1+𝛼
b) The variance of the standard normal distribution is smaller than that of the t-distribution.
c) As the degrees of freedom get smaller, the variance for the t-distribution gets smaller.
d) All normal distributions and t-distributions are symmetric around zero.
e) 𝑃(𝑍 > 0) ≠ 𝑃(𝑡 < 0)
Questions 13, 14 and 15 are based on the following information:
You work for a company that provides online travel guides. In a popular tourist city, health officials
reported that 14% of restaurants have unsatisfactory health conditions. You would like to confirm
that this amount is true. You randomly select 100 restaurants to check.
Question 13
You assume that the health officials reported proportion is correct. What is the probability that the
sample proportion of restaurants with unsatisfactory health conditions is more than 0.13?
a) 0.6103
b) 0.9509
c) 0.3859
d) 0.3897
e) 0.6141
Question 14
Select the correct Excel function to determine the probability in Question 21 above?
a) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,13 ; 0,14 ; 0,14 ∗ 0,86/100 ; 𝑇𝑅𝑈𝐸 )
b) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,13 ; 0,14 ; 𝑆𝑄𝑅𝑇(0,14 ∗ 0,86/100) ; 𝑇𝑅𝑈𝐸 )
c) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,14 ; 0,13 ; 14 ∗ 86/𝑆𝑄𝑅𝑇(100) ; 𝑇𝑅𝑈𝐸 )
d) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,13 ; 0,14 ; 𝑆𝑄𝑅𝑇(0,14 ∗ 0,86/100) ; 𝐹𝐴𝐿𝑆𝐸 )
e) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,13 ; 0,14 ; 0,14 ∗ 0,86/𝑆𝑄𝑅𝑇(100) ; 𝐹𝐴𝐿𝑆𝐸 )
Question 15
What is the probability that the sample proportion of restaurants with unsatisfactory health
conditions is more than that reported by the health officials?
a) 0
b) 0.5
c) 0.14
d) 0.86
e) This cannot be calculated.
Questions 16 and 17 are based on the following information:
It is known that 43% of Americans own an iPhone. A random sample of 50 Americans was surveyed.
Question 16
What is the probability that 45% to 50% of the sample owned an iPhone?
a) 0.0005
b) 0.8094
c) 0.2593
d) 0.2272
e) 0.7141
Question 17
What is the probability that the sample proportion of Americans that own iPhones differs from the
true proportion by at most 0.12?
a) 0.9128
b) 0.0090
c) 0.9910
d) 0.0436
e) 0.9994
Question 18
Which one of the following statements is true?
a) The chi-squared distribution is positively skewed with no values greater than zero.
b) The area under the curve of the chi-squared distribution is greater than 1.
c) The chi-squared distribution is symmetric and centered at a value determined by its degrees
of freedom.
d) The shape of the chi-squared distribution depend on its degrees of freedom.
e) All of the above are false.
Question 19
2
Find the following values: (i) 𝑡10; 0.05 and (ii) 𝜒12; 0.05
a) (i) 1.833 and (ii) 3.074
b) (i) – 3.169 and (ii) – 5.226
c) (i) –1.812 and (ii) 5.226
d) (i) –1.812 and (ii) 4.575
e) (i) 1.812 and (ii) 2.603
Question 20
Many distributions have the same properties. Consider the 4 properties below:
i) The distribution is symmetric around its mean.
ii) The distribution is bell-shaped.
iii) The variance of the distribution is larger than 1.
iv) The probability that the random variable is less than 0 (zero) is 0.5.
Which of the following distributions have all four qualities above?
a) 𝑁(−0.5 ;2)
2
b) 𝜒23
c) 𝑁(0; 1)
d) 𝑡23
e) 𝑁(5 ; 2)
Questions 21, 22 and 23 are based on the following information:
The weights of plums from a certain orchard are known to be normally distributed with a mean of 24
grams and a standard deviation of 5 grams.
Question 21
What proportion of plums weigh between 24g and 28g?
a) 0.8
b) 0.7881
c) 0.2119
d) 0.2881
e) This cannot be calculated.
Question 22
A sample of 40 plums is randomly selected. What is the probability that their average weight is less
than 23 g?
a) 0.2530
b) 0.4013
c) 0.8962
d) 0.0023
e) 0.1038
Question 23
Fifteen (15) percent of plums are put into the “extra large” category. What is the minimum weight that
a plum must be in order to be classed as extra large? If using Microsoft Excel to answer this question,
which one of the following commands will you use?
a) =𝑁𝑂𝑅𝑀.𝐼𝑁𝑉 ( 0,85 ;24 ;25 )
b) =𝑁𝑂𝑅𝑀.𝑆.𝐼𝑁𝑉 ( 0,15 ;24 ;5 )
c) =𝑁𝑂𝑅𝑀.𝐼𝑁𝑉 ( 0,15 ;24 ;5 )
d) =𝑁𝑂𝑅𝑀.𝑆.𝐼𝑁𝑉 ( 0,85 ;24 ;5 )
e) =𝑁𝑂𝑅𝑀.𝐼𝑁𝑉 ( 0,85 ;24 ;5 )
Question 24
The diagram below shows three normal distributions.
In the diagram we see that
a) Distribution A has the largest mean.
b) Distribution C and A have the same variance.
c) Distribution C has the largest mean.
d) Distribution B has the largest variance.
e) Distribution A has the smallest variance.
Questions 25 and 26 are based on the following information:
Protein is essential for the growth and maintenance of muscle. A good source of protein is red meat.
Fifteen (15) percent of households eat red meat regularly.
Question 25
A random sample of 80 households is selected. What is the probability that red meat is eaten
regularly in over 20% of these households?
a) 0.0002
b) 0.8944
c) 0.9998
d) 0.1056
e) None of the above
Question 26
Suppose a random sample of 45 households is selected. What is the probability that the difference
between the sample proportion of households that eat red regularly and the true proportion is at most
0.05?
a) 0.9398
b) 0.9393
c) 0.9914
d) 0.5933
e) 0.6528
Question 27
If 𝜒 2 follows a Chi-square distribution with 12 degrees of freedom, what function/formula will be
used in Microsoft Excel to find 𝑃( 𝜒 2 > 11.02) + 𝑃( 𝜒 2 < 2.27) ?
a) =𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇 ( 11,02 ;12 ;𝑇𝑅𝑈𝐸)+𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇 ( 2,27 ;12 ;𝑇𝑅𝑈𝐸)
b) =𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇.𝑅𝑇 ( 11,02 ;12)+𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇 ( 2,27 ;12 ;𝑇𝑅𝑈𝐸)
c) =𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇.𝑅𝑇 ( 11,02 ;12 ; 𝑇𝑅𝑈𝐸)+𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇 ( 2,27 ;12 ;𝑇𝑅𝑈𝐸)
d) =𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇.𝑅𝑇 ( 11,02 ;12)+𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇 ( 2,27 ;12 ;𝐹𝐴𝐿𝑆𝐸)
e) =𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇.𝑅𝑇 ( 11,02 ;12 ; 𝐹𝐴𝐿𝑆𝐸)+𝐶𝐻𝐼𝑆𝑄.𝐷𝐼𝑆𝑇 ( 2,27 ;12 ;𝐹𝐴𝐿𝑆𝐸)
Question 28
A bank surveyed all of its 60 employees to determine the proportion who participate in volunteer
activities. Which of the following statements is true?
a) The bank should not use the data from this survey because this is an observational study.
b) The bank can use the results of this survey to prove that working for the bank causes
employees to participate in volunteer activities.
c) The bank does not need to use an inference procedure to determine the proportion of
employees who participate in volunteer activities because the survey was a census of all
employees.
d) The bank did not select a random sample of employees, so the survey will not provide the
bank with useful information.
The bank would have to use the survey data to conduct a confidence interval in order to estimate the
proportion of employees who participate in volunteer activities.
Questions 29 and 30 are based on the following information:
It has been reported that 30% of people use long life milk. You randomly select 25 people and ask
them if they use long life milk.
Question 29
What is the probability that more than 3 but less than 8 of them say they use long life milk?
a) 0.4786
b) 0.6437
c) 0.4213
d) 0.6769
e) 0
Question 30
Which one of the following commands can you use in Microsoft Excel to calculate the probability
that at least 10 people say they do not use long life milk?
a) = 1 − 𝐵𝐼𝑁𝑂𝑀. 𝐷𝐼𝑆𝑇 ( 9 ; 25 ; 0,70 ; 𝐹𝐴𝐿𝑆𝐸 )
b) = 𝐵𝐼𝑁𝑂𝑀. 𝐷𝐼𝑆𝑇 ( 10 ; 25 ; 0,30 ; 𝑇𝑅𝑈𝐸 )
c) = 1 − 𝐵𝐼𝑁𝑂𝑀. 𝐷𝐼𝑆𝑇 ( 9 ; 25 ; 0,70 ; 𝑇𝑅𝑈𝐸 )
d) = 1 − 𝐵𝐼𝑁𝑂𝑀. 𝐷𝐼𝑆𝑇 ( 10 ; 25 ; 0,30 ; 𝐹𝐴𝐿𝑆𝐸)
e) = 𝐵𝐼𝑁𝑂𝑀. 𝐷𝐼𝑆𝑇 ( 10; 25; 0,70 ; 𝐹𝐴𝐿𝑆𝐸 )
Questions 31 and 32 are based on the following information:
At Sam’s Petrol Garage an average of 8 cars get their tanks filled every 30 mins. Sam’s petrol garage
is open from 06:00 to 20:00 every day.
Question 31
What is the probability that at least 3, but at most 7, cars get their tanks filled in 15 mins?
a) 0.1682
b) 0.2549
c) 0.7108
d) 0.5154
e) None of the above
Question 32
What is the probability that at least 210 cars get their tanks filled on any given day?
Which of the following commands can be used if this probability is to be found using Microsoft
Excel?
a) = 1 − 𝑃𝑂𝐼𝑆𝑆𝐼𝑁. 𝐷𝐼𝑆𝑇 ( 210 ; 224 ; 𝑇𝑅𝑈𝐸 )
b) = 1 − 𝑃𝑂𝐼𝑆𝑆𝐼𝑁. 𝐷𝐼𝑆𝑇 ( 210 ; 8 ; 𝑇𝑅𝑈𝐸 )
c) = 𝑃𝑂𝐼𝑆𝑆𝐼𝑁. 𝐷𝐼𝑆𝑇 ( 210 ; 224 ; 𝑇𝑅𝑈𝐸 )
d) = 𝑃𝑂𝐼𝑆𝑆𝐼𝑁. 𝐷𝐼𝑆𝑇 ( 210 ; 224 ; 𝐹𝐴𝐿𝑆𝐸 )
e) = 1 − 𝑃𝑂𝐼𝑆𝑆𝐼𝑁. 𝐷𝐼𝑆𝑇 ( 210 ; 384 ; 𝑇𝑅𝑈𝐸 )
Question 33
Which one of the following statements is false?
a) If 𝑋~𝑁(8 ; 0.7), then 0 < 𝑥 < ∞.
b) The standard normal distribution and the t-distribution are symmetric around their means.
c) 𝑃(𝑍 > 0) = 1 − 𝑃(𝑡 < 0)
d) As the degrees of freedom increase, the t-distribution tends towards the standard normal
distribution.
e) If 𝑋~𝑡10 and 𝑌~𝑡20 , then 𝑋 has a larger variance than 𝑌.
Questions 34, 35 and 36 are based on the following information:
The number of bricks laid by a construction worker follows a normal distribution with a mean of 120
bricks per hour and a standard deviation of 20 per hour.
Question 34
Which of the following Microsoft Excel commands can be used to calculate the probability that a
randomly selected worker lays more than 140 bricks in the next hour?
a) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 140 ; 120 ; 20 ; 𝑇𝑅𝑈𝐸 )
b) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 140 ; 120 ; 20^2 )
c) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 140 ; 120 ; 20^2; 𝑇𝑅𝑈𝐸 )
d) = 1 − 𝑁𝑂𝑅𝑀. 𝑆. 𝐷𝐼𝑆𝑇 ( 120 ; 140 ; 20 ; 𝑇𝑅𝑈𝐸 )
e) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 140 ; 120 ; 20 ; 𝑇𝑅𝑈𝐸 )
Questions 35
If a sample of 50 construction workers is selected, what is the probability that the average number
of bricks laid by them in an hour differs from the true average number of bricks laid per hour by less
than 6 bricks?
a) 0.0340
b) 0.9830
c) 0.9660
d) 0.7734
e) None of the above.
Questions 36
Only 14% of workers lay more than, approximately, 𝑥 bricks per hour.
Find the value of 𝑥.
a) 𝑥 = 142
b) 𝑥 = 156
c) 𝑥 = 98
d) 𝑥 = 84
e) 𝑥 = 137
Questions 37 and 38 are based on the following information:
It has been found that 70% of adults favour some kind of government control on the prices of
medicines. Suppose a random sample of 400 adults is selected.
Question 37
What is the probability that less than 65% of them favour some kind of government control of the
prices of medicines?
Which one of the following Microsoft Excel commands can be used to calculate this probability?
f) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,65 ; 0,70 ; 0,70 ∗ 0,30/400 ; 𝑇𝑅𝑈𝐸 )
g) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,65 ; 0,70 ; 𝑆𝑄𝑅𝑇(0,70 ∗ 0,30/400) ; 𝑇𝑅𝑈𝐸 )
h) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,70 ; 0,65 ; 70 ∗ 30/𝑆𝑄𝑅𝑇(400) ; 𝑇𝑅𝑈𝐸 )
i) = 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,70 ; 0,65 ; 𝑆𝑄𝑅𝑇(0,70 ∗ 0,30/400) ; 𝐹𝐴𝐿𝑆𝐸 )
j) = 1 − 𝑁𝑂𝑅𝑀. 𝐷𝐼𝑆𝑇 ( 0,65 ; 0,70 ; 0,70 ∗ 0,30/𝑆𝑄𝑅𝑇(400) ; 𝐹𝐴𝐿𝑆𝐸 )
Question 38
What is the probability that the proportion of adults who favour some kind of government control
on the prices of medicines is within 0.06 of the population proportion?
a) 0.9910
b) 0.9956
c) 0.0088
d) 0.9912
e) 0.0044
Question 39
Which of the following statements is correct?
a) The 𝜒 2 -distribution is skewed to the left.
b) As the degrees of freedom decrease, the χ2-distribution becomes more skewed.
c) The 𝜒 2 -distribution does not change its shape when there are changes in the number of
degrees of freedom.
d) 𝑃(χ2 < −5.12) = 𝑃(χ2 > 5.12)
e) The 𝜒 2 -distribution is one of the few symmetric distributions known.
Question 40
When comparing the standard error of the sampling distribution of the sample mean to the
standard deviation of the underlying population, the population standard deviation is
a) always smaller than the standard error.
b) always equal to the standard error.
c) always larger than the standard error.
d) either smaller or larger than the standard error depending on the mean.
e) either smaller or larger than the standard error depending on the data.
Question 41
Suppose 𝑌~𝑡16 . Which of the following Microsoft Excel commands can be used to find the tenth
(10th) percentile of 𝑌?
a) = 𝑇. 𝐼𝑁𝑉 ( 0,1 ; 16 )
b) = 𝑇. 𝐼𝑁𝑉 ( 10 ; 16 )
c) = 𝑇. 𝐼𝑁𝑉 ( 0,1 ; 16 ; 𝑇𝑅𝑈𝐸 )
d) = 𝑇. 𝐼𝑁𝑉 ( 10 ; 16 ; 𝑇𝑅𝑈𝐸 )
e) None of the above.
Question 42
Options (a) to (e) describe situations in which a claim is being made. If the claims are to be tested,
which of the situations can be tested using the hypotheses:
𝐻0 ∶ 𝜇 = 10 versus 𝐻1 ∶ 𝜇 > 10 ?
a) The owner of a trucking company claims that more than 10% of his employees are female.
b) A car-part manufacturer claims that the average size of a particular part differs from the
industry standard by no more than 10mm.
c) A school teacher claims that the variance of the marks for a biology test is greater than 10%.
d) An employee of a consumer watch claims that the average weight of 10kg bags of avocadoes
is more than what is indicated on the packaging.
e) In a recent article it says that the average man is 10kg heavier than the average woman.
Questions 43 to 46 are based on the following information:
Researchers are studying the effect of energy drinks on the ability of rats to complete a complex
maze. From previous studies it was found that, for rats that consumed no energy drinks, the
average time to complete the maze is 49.2 seconds. The researchers believe that the rats that are
given an energy drink will complete the maze faster. They select a random sample of 10 mice and
give them an energy drink before making them complete the maze. The sample gives a mean of
46.2 seconds with a standard deviation of 3.5 seconds. Using this data, the researchers conduct a
hypothesis test using a 5% level of significance.
Question 43
The test statistic that the researchers calculate for this test is –2.71. What critical value(s) will they
use and what decision will they make?
a) Critical value: –1.833.
Decision: Reject the null hypothesis in favour of the alternative hypothesis.
b) Critical values: 2.262 and –2.262
Decision: Do not reject the null hypothesis in favour of the alternative hypothesis.
c) Critical value: 1.833
Decision: Do not reject the null hypothesis in favour of the alternative hypothesis.
d) Critical value: –1.645
Decision: Reject the null hypothesis in favour of the alternative hypothesis.
e) Critical value: 1.645
Decision: Do not reject the null hypothesis in favour of the alternative hypothesis.
Question 44
Suppose the null hypothesis is not rejected in favour of the alternative hypothesis. The researchers
can conclude, at a 5% level of significance, that
a) there is insufficient evidence to indicate that the average time to complete the maze by rats
who have consumed energy drinks is 46.2 seconds.
b) there is insufficient evidence to indicate that the average time to complete the maze by rats
who have consumed energy drinks is more than 49.2 seconds.
c) there is sufficient evidence to indicate that the average time to complete the maze by rats
who have consumed energy drinks is 49.2 seconds.
d) there is insufficient evidence to indicate that the average time to complete the maze by rats
who have consumed energy drinks is less than 49.2 seconds.
e) there is insufficient evidence to indicate that the average time to complete the maze by rats
who have consumed energy drinks is less than 46.2 seconds.
Question 45
The researchers would like conduct a hypothesis test, using a 10% level of significance, to find out
whether the population variance of the time taken to do the maze by rats who have consumed
energy drinks is longer than 9.61 seconds2. What is the value of the test statistic that the
researchers will use for this test?
a) 4.168
b) 3.278
c) 1.194
d) 11.472
e) None of the above.
Question 46
The researchers decide to construct a 95% confidence interval for the true variance of the time
taken to do the maze by rats that have consumed energy drink. The confidence interval construct is
a) ( 5.981 ; 37.727 )
b) ( 6.516 ; 33.158 )
c) ( 1.912 ; 8.883 )
d) ( 1.656 ; 11.667 )
e) ( 5.796 ; 40.833 )
Questions 47, 48 and 49 are based on the following information:
KwaZulu-Natal (KZN) often has periods of very hot weather. You believe that in a particular wealthy
area in KZN, more than half the houses have central air conditioning. You randomly select 150
houses from this area and find that 65 have central air conditioning. Using this information, you do
a hypothesis test to see if your belief could be true. You use a 1% level of significance.
Question 47
For this test, a Type II error will occur if
a) 𝐻0 ∶ 𝑝 = 0.5 is rejected when it is false.
b) 𝐻0 ∶ 𝑝 = 0.5 is not rejected when it is true.
c) 𝐻1 ; 𝑝 > 0.5 is accepted when it is true.
d) 𝐻0 ∶ 𝑝 = 0.5 is rejected when it is true.
e) 𝐻0 ∶ 𝑝 = 0.5 is not rejected when it is false.
Question 48
What is the value of the test statistic that you use for your hypothesis test?
a) 2.326
b) –1.633
c) –2.308
d) –1.648
e) None of the above
Question 49
You construct the following 95% confidence interval for the true proportion of houses in this area
that have central air conditioning:
( 0.35 ; 0.51 ).
Thus, with 95% confidence you can say that
a) 35% to 51% of the time houses in this area have central air conditioning.
b) 35% to 51% of the houses in this area have central air conditioning 95% of the time.
c) 35% to 51% of houses in this area have central air conditioning.
d) 5% of houses in this area have central air conditioning.
e) 35 to 51 houses in this area have central air conditioning.
Question 50
You want to estimate the true average cost of a road repair job. If you want to be 90% confident
that the error of your estimate is at most R900, how many road repair jobs should you include in
your sample? Based on past reports, you assume the standard deviation is R3000.
a) 31
b) 19
c) 6
d) 5
e) 82
Question 51
Using a sample of 100 nurses, you construct a 99% confidence interval for the variance associated
with the amount (in mililitres) of sanitizer used by nurses. Your sample gives 𝑠 = 2100 millilitres.
Using a separate sample of 100 nurses, a friend of yours also constructs a 99% confidence interval
for the variance of the amount of sanitizer. Your friends sample gives 𝑠 = 2400. The confidence
interval your friend constructs is
a) narrower than yours.
b) the same as yours.
c) to the left of yours.
d) wider than yours.
e) The confidence intervals cannot be compared because different samples are used.
Question 52
Shown below is a scatterplot of Y versus X.
Which of the options below is most likely to be the approximate value of 𝑟 2 , associated with a
regression of Y on X?
a) –99.5%
b) 2.0%
c) 50.0%
d) 99.5%
e) –50.0%
Questions 53 and 54 are based on the information below:
In an attempt to predict a student’s final statistics mark from their year mark (DP mark), a linear
regression was performed on 259 students’ marks using the Excel Data Analysis package. The table
below gives a partial summary of the output from Excel.
Question 53
The “R Square” value of 0.648 (rounded to 3-decimal places) means that
a) 0.648% of the variation in the year mark is explained by the final mark.
b) 35.2% of the variation in the final mark is explained by the year mark.
c) the correlation between final mark and year mark is 0.648.
d) the correlation between year mark and final mark is –0.648
e) 64.8% of the variation in the final mark is explained by the year mark.
Question 54
Using the regression line constructed from this output, what final mark can be predicted for a
student with a class mark of 65%.
a) 50%
b) 43%
c) 57%
d) 65%
e) 53%
Questions 55 and 56 are based on the following information:
The training manager of a company that assembles and exports pool pumps wants to find out if a
worker’s productivity (number of pumps assembled) can be predicted by the number of hours
he/she spent in training. A random sample of 10 assembly workers
was selected and their performances evaluated. The data was captured in Microsoft Excel. A
portion of the worksheet is shown below.
Question 55
Which one of the following is the regression line for the data?
a) 𝑦̂ = −17.481 − 2.63 𝑥
b) 𝑦̂ = 16.391 − 3.24 𝑥
c) 𝑦̂ = −12.713 + 5.72 𝑥
d) 𝑦̂ = 18.916 + 1.11 𝑥
e) 𝑦̂ = 21.161 + 2.96 𝑥
Question 56
The owner of the company asks the training manager for an estimate of the average productivity of
a worker. The training manager decides to construct a 95% confidence interval. Which of the
following commands can he use to calculate the upper confidence limit of the confidence interval?
a) = 𝐸7 + 𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸. 𝑇 ( 0,05 ; 𝐸8 ; 10 )
b) = 𝐸3 + 𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸. 𝑁𝑂𝑅𝑀 ( 0,95 ; 𝐸4 ; 10 )
c) = 𝐸7 + 𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸. 𝑇 ( 0,95 ; 𝐸8 ; 10 )
d) = 𝐸3 + 𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸. 𝑇 ( 0,05 ; 𝐸4 ; 10 )
e) = 𝐸7 + 𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸. 𝑁𝑂𝑅𝑀 ( 0,05 ; 𝐸8 ; 10 )
Questions 57 to 60 are based on the following information:
Laura would like to purchase a second-hand car. She surveyed the ages (years) and the prices (thousands of
Rands) of ten second-hand vehicles of a particular type. The data was captured in an Excel workbook, a
portion of which is shown below.
Question 57
Laura would like to construct a 95% confidence interval for the true average price of second-hand
vehicles of this type. If Laura uses Microsoft Excel to construct the construct the confidence interval,
which of the following commands will she use to find the upper confidence limit?
a) =𝐼4+𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸.𝑇 ( 0,05 ;𝐼5 ;10 )
b) =𝐼4+𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸.𝑍 ( 0,95 ;𝐼5 ;10 )
c) =𝐼4+𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸.𝑇 ( 0,95 ;𝐼5 ;10 )
d) =𝐼4+𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸.𝑇 (0,05 ;𝐼5^2 ;10)
e) =𝐼4+𝐶𝑂𝑁𝐹𝐼𝐷𝐸𝑁𝐶𝐸.𝑍 ( 0,05 ;𝐼5 ;10 )
Question 58
The correlation coefficient and its interpretation in ‘real life’ terms are:
a) 𝑟 2 = – 0.815 , There is a strong negative linear correlation implying as the age of a vehicle
increases, its price decreases.
b) 𝑟 = – 0.9028 , There is a strong negative linear correlation implying as the age of a vehicle
increases, its price decreases.
c) 𝑟 = – 0.9028 , There is a strong negative linear correlation implying as the age of a vehicle
decreases, its price decreases.
d) 𝑟 = 0.9028 , There is a strong positive linear correlation implying as the age of a vehicle
increases, its price also increases.
e) 𝑟 2 = 0.815 There is a strong positive linear correlation implying as the age of a vehicle
increases, its price also increases.
Question 59
Laura will be using Microsoft Excel to calculate the average age of these 10 cars. Which one of the
following commands can she use to do the calculation?
a) =𝐴𝑉𝐸𝑅𝐴𝐺𝐸 ( 𝐵1∶𝐾1 )
b) =𝐴𝑉𝐸 ( 𝐵1∶𝐾1 )
c) =𝑀𝐸𝐴𝑁 ( 𝐵1 ;𝐾2 )
d) =𝐴𝑉𝐸𝑅𝐴𝐺𝐸 ( 𝐵1:𝐾2 )
e) =𝑀𝐸𝐴𝑁 ( 𝐵1∶𝐾1 )
Question 60
What is the average age of the 10 cars in the sample?
a) 6.7 years
b) 8 years
c) 7 years
d) 8.5 years
e) 9.1 years
Questions 61 and 62 are based on the following information:
As part of a class project at a large university, Mthobi selected a random sample of 12 students in her major
field of study. All students in the sample were asked to report their number of hours spent studying for the
final exam and their score on the final exam. A regression analysis on the data produced the following partial
computer output.
Question 61
The correlation coefficient and coefficient of determination for the data are:
a) 𝑟 = 0.45 and 𝑟2=0.56
b) 𝑟 = 0.75 and 𝑟2=0.56
c) 𝑟 = 0.56 and 𝑟2=3.55
d) 𝑟 = 0.75 and 𝑟2=12
e) 𝑟 = 0.75 and 𝑟2=0.45
Question 62
Which one of the following is the correct regression line for this data?
a) 𝑦 = 3.558 + 63.97 𝑥
b) 𝑦 = 3.558 + 9.788 𝑥
c) 𝑦 = 1.571 + 8.57 𝑥
d) 𝑦 = 8.57 + 1.571 𝑥
e) 𝑦 = 63.97 + 3.558 𝑥