STATISTICS & PROBABILITY the population sizes.
1. All of the following increase the width of a 4. A 99% t-based confidence interval for the
confidence interval except: average number of hours per day students
a. Increased confidence level spent on studying their lessons in a day is
b. Increased variability calculated using a simple random sample 50
interval is 3.32 hrs < 𝜇 < 3.98 hrs, what is the
c. Increased sample size students. Given that the 99% confidence
d. Decreased sample size
sample mean hours students spent in studying in
2. The p-value in hypothesis testing a day?
represents which of the following: a. 2.33 hours
a. The probability of failing to reject the null b. 3.65 hours
hypothesis, given the observed results. c. Not Enough Information; we would need to
b. The probability that the null hypothesis is know the variation in the sample of number
true, given the observed results. of hours students spent in studying in a
c. The probability that the observed results day.
are d. Not Enough Information; we would need to
statistically significant, given that the null know the variation in the population of
hypothesis is true. hours
d. The probability of observing results as students spent in studying in a day.
extreme or more extreme than currently
observed, given that the null hypothesis is For #s 5-6 refer to the illustration below.
true.
3. A sociologist focusing on popular culture
and media believes that the average number
of hours per week (hrs/week) spent using
social media is greater for women than for
men. Examining two independent simple
random samples of 100 individuals each, the
researcher calculates sample standard
deviations of 2.3 hrs/week and 2.5 hrs/week
for women and men respectively. If the 5. The histogram above represents the
average number of hrs/week spent using hospital length of stay (in days) for patients at
social media for the sample of women is 1 a nearby medical facility. How many patients
hour greater than that for the sample of men, are included in the histogram?
what conclusion can be made from a a. 5 b. 21 c. 17 d. 9
𝐻0: 𝜇𝑊 − 𝜇𝑀 = 0
hypothesis test where:
𝐻1: 𝜇𝑊 − 𝜇𝑀 > 0
6. Using the histogram to the right that
represents the hospital lengths of stay (in
a. The observed difference in average days) for patients at a nearby medical facility,
number determine the relationship between the mean
of hrs/week spent using social media is not and the median.
significant. a. Mean = Median c. Mean < Median
b. The observed difference in average b. Mean ≈ Median d. Mean > Median
number
of hrs/week spent using social media is 7. Green sea turtles have normally distributed
significant. weights, measured in kilograms, with a mean
c. A conclusion is not possible without of 134.5 and a variance of 49.0. A particular
knowing green sea turtle’s weight has a z-score of -
the average number of hrs/week spent 2.4. What is the weight of this green sea
using turtle? Round to the nearest whole number.
social media in each sample. a. 17 kg b. 151 kg c. 118 kg d. 252 kg
d. A conclusion is not possible without
knowing
8. Which of the following exam scores is c. It is a margin of error.
better relative to other students enrolled in d. It is a standard error.
the course?
A psychology exam grade of 85; the mean grade for the 12. Which of the following examples involves
psychology exam is 92 with a standard deviation of 3.5.
An economics exam grade of 67; the mean grade for the paired data?
economics exam is 79 with a standard deviation of 8. a. A study compared the average number of
A chemistry exam grade of 62; the mean grade for the courses taken by a random sample of 100
chemistry exam is 62 with a standard deviation of 5.
freshmen at a university with the average
number of courses taken by a separate
a. The psychology exam score is relatively
random sample of 100 freshmen at a
better.
community college.
b. The economics exam score is relatively
b. A group of 100 students were randomly
better.
assigned to receive vitamin C (50 students)
c. The chemistry exam score is relatively
or a placebo (50 students). The groups
better.
were
d. All of the exam scores are relatively
followed for 2 weeks and the proportions
equivalent.
colds were compared.
c. A group of 50 students had their blood
9. The statement “If there is sufficient
pressures measured before and after
evidence to reject a null hypothesis at the
watching a movie containing violence. The
10% significance level, then there is sufficient
mean blood pressure before the movie was
evidence to reject it at the 5% significance
compared with the mean pressure after the
level” is: Please select the best answer of
movie.
those provided below.
d. None of the above.
a. Always True
b. Never True
13. The expected value of a random variable
c. Sometimes True; the p-value for the
is the
statistical test needs to be provided for a
a. value that has the highest probability of
conclusion.
occurring.
d. Not Enough Information; this would depend
b. mean value over an infinite number of
on the type of statistical test used
observations of the variable.
c. largest value that will ever occur.
10. Which of the following statements best
d. most common value over an infinite
describes the relationship between a
number
parameter and a statistic?
of observations of the variable.
a. A parameter has a sampling distribution
with
14. Which one of these variables is a
the statistic as its mean.
continuous random variable?
b. A parameter has a sampling distribution
a. The time it takes a randomly selected
that
student to complete an exam.
can be used to determine what values the
b. The number of tattoos a randomly selected
statistic is likely to have in repeated
has.
samples. c. A parameter is used to estimate a
c. The number of women taller than 68 inches
statistic. d. A statistic is used to estimate a
in a random sample of 5 women.
parameter.
d. The number of correct guesses on a
multiple choice test.
11. A randomly selected sample of 400
students at a university with 15-week
15. Suppose that vehicle speeds at an
semesters was asked whether or not they
interstate location have a normal distribution
think the semester should be shortened to 14
with a mean equal to 70 mph and standard
weeks (with longer classes). Forty six percent
deviation equal to 8 mph. What is the z-score
(46%) of the 400 students surveyed
for a speed of 64 mph?
answered "yes." Which one of the following
a. −0.75 b. +0.75 c. −6 d. +6
statements about the number 46% is correct?
a. It is a sample statistic.
b. It is a population parameter.
16. Pulse rates of adult men are World War I and World War II, respectively. If the
approximately normal with a mean of 70 and average height from the sample of World War II
a standard deviation of 8. Which choice soldiers is 1 inch greater than from the sample of
correctly describes how to find the proportion World War I soldiers, what conclusion is justified
from a two-sample hypothesis test where
of men that have a pulse rate greater than
78? H0 : µ1 − µ2 = 0 and Ha : µ1 − µ2 < 0?
a. The observed difference in average height
a. Find the area to the left of z = 1 under a is significant.
standard normal curve. b. The observed difference in average height
b. Find the area between z = −1 and z = 1 is not significant.
under a standard normal curve. c. A conclusion is not possible without knowing
c. Find the area to the right of z =1 under a the mean height in each sample.
standard normal curve. d. A conclusion is not possible without knowing
d. Find the area to the right of z = −1 under a both the sample means and the two original
standard normal curve. population sizes.
17. The probability is p = 0.80 that a patient
with a certain disease will be successfully 21. The death rate from a particular form of
treated with a new medical treatment. cancer is 23% during the first year. When
Suppose that the treatment is used on 40 treated with an experimental drug, only 15 out
patients. What is the "expected value" of the of 84 patients die during the initial year. Is this
number of patients who are successfully strong evidence to claim that the new
treated? medication reduces the mortality rate?
a. 40 b. 20 c. 8 d. 32 a. Yes, because the P-value is .0459.
b. Yes, because the P-value is .1314.
18. Suppose there is a correlation of r = 0.9 c. No, because the P-value is only .0459.
between number of hours per day students d. No, because the P-value is above .10.
study and GPAs. Which of the following is a
reasonable conclusion? 22. A teacher believes that giving her
a. 90% of students who study receive high students a practice quiz every week will
grades. motivate them to study harder, leading to a
b. 90% of the variation in GPAs can be greater overall understanding of the course
explained by variation in number of study material. She tries this technique for a year,
hours. and everyone in the class achieved a grade
c. 10% of the variation in GPAs cannot be of a least C. Is this an experiment or an
explained by variation in number of study observational study?
hours per day. a. An experiment, but with no reasonable
d. 81% of the variation in GPAs can be conclusion possible about cause and
explained by variation in number of study effect.
hours per day. b. An experiment, thus making cause and
effect a reasonable conclusion.
19. Which of the following are true statements?
c. An observational study, because there was
I. If there is sufficient evidence to reject a null
hypothesis at the 10% level, then there is
no use of a control group.
sufficient evidence to reject it at the 5% level. d. An observational study, but a poorly
II. Whether to use a one- or two-sided test is designed one because randomization was
typically decided after the data are gathered. not used.
III. If a hypothesis test is conducted at the 1%
level, 23. In a simple random survey of 89 teachers
there is a 1% chance of rejecting the null of high school AP Statistics, 73 said that it
hypothesis. was the most satisfying, most enjoyable
a. I & II only c. I, II, & III course they had ever taught. Establish a 98%
b. II & III only d.None are true. confidence interval estimate of the proportion
20. A historian believes that the average height of of all high school AP Statistics teachers who
soldiers in World War II was greater than that of feel this way.
soldiers in World War I. She examines a random a. 0.820 ± 0.004 c. 0.820 ± 0.084
sample of records of 100 men in each war and b. 0.820 ± 0.041 d. 0.820 ± 0.095
notes standard deviations of 2.5 and 2.3 inches in
24. To survey the opinions of the students at
your high school, a researcher plans to select 29. What is one of the distinctions between a
every twenty-fifth student entering the school population parameter and a sample statistic?
in the morning. Assuming there are no a. A population parameter is only based on
absences, will this result in a simple random conceptual measurements, but a sample
sample of students attending your school? statistic is based on a combination of real
a. Yes, because every students has the same and conceptual measurements.
chance of being selected. b. A sample statistic changes each time you
b. Yes, but only if there is a single entrance to
try to measure it, but a population
the school.
c. Yes, because the 24 out of every 25
parameter
students who are not selected will form a remains fixed.
control group. c. A population parameter changes each time
d. Yes, because this is an example of you try to measure it, but a sample statistic
systematic sampling, which is a special remains fixed across samples.
case of simple random sampling. d. The true value of a sample statistic can
e. No, because not every sample of the intended never be known but the true value of a
size has an equal chance of being selected. population parameter can be known.
25. Following is a histogram of ages of
people applying for a particular high-school 30. Past data has shown that the regression
teaching position. line relating the final exam score and the
midterm exam score for students who take
statistics from a certain professor is:
final exam = 50 + 0.5 × midterm
One interpretation of the slope is
Which of the following statements are true? a. a student who scored 0 on the midterm
I. The median age is between 24 and 25. would be predicted to score 50 on the final
II. The mean age is between 22 and 23. exam.
III. The mean age is greater than the median age. b. a student who scored 0 on the final exam
would be predicted to score 50 on the
a. I only c. III only midterm exam.
b. II only d. All are true c. a student who scored 10 points higher than
another student on the midterm would be
26. To test the hypothesis H o: µ ≤ 24; H1: µ predicted to score 5 points higher than the
>24 at α = 0.05 If the test value = 2.10, P- other student on the final exam.
value =0.0179, then the decision is d. students only receive half as much credit
(.5) for a correct answer on the final exam
a. reject H1 c. can not make a compared to a correct answer on the
decision midterm exam.
b. reject Ho d. do not reject Ho
31. The length of time a traffic signal stays
27. What type of sampling is being used if green (nicknamed the "green time") at a
USTP students are divided into different particular intersection follows a normal
groups according to their colleges and a probability distribution with a mean of 200
sample is chosen from each college to be seconds and the standard deviation of 10
surveyed? seconds. Use this information to answer the
a. stratified c. random following questions. Which of the following
b. cluster d. systematic describes the derivation of the sampling
distribution of the sample mean?
28. Find the probability of getting a number
greater than 4 when a die is rolled one time. a. The means of a large number of samples
of
a. 2/3 b. 1/3 c. 1/6 size n randomly selected from the
population
d. ½ of "green times" are calculated and their
probabilities are plotted. normal, regardless of the population
b. The standard deviations of a large number d. for a large n, it says the population is
of samples of size n randomly selected approximately normal
from
the population of "green times" are 34) A local eat-in pizza restaurant wants to
calculated and their probabilities are investigate the possibility of starting to deliver
plotted. pizzas. The owner of the store has
c. The mean and median of a large randomly determined that home delivery will be
selected sample of "green times" are successful only if the average time spent on a
calculated. Depending on whether or not delivery does not exceed 40 minutes. The
the owner has randomly selected 17 customers
population of "green times" is normally and delivered pizzas to their homes in order
distributed, either the mean or the median to test whether the mean delivery time
is actually exceeds 40 minutes. Suppose the p-
chosen as the best measurement of center. value for the test was found to be .0293.
d. A single sample of sufficiently large size is State the correct conclusion.
randomly selected from the population of a. At α = .03, we fail to reject H0.
"green times" and its probability is b. At α = .05, we fail to reject H0.
determined. c. At α = .025, we fail to reject H0.
d. At α = .02, we reject H0.
35) A bottling company produces bottles that
hold 12 ounces of liquid. Periodically, the
32. The Central Limit Theorem states that the company gets complaints that their bottles
sampling distribution of the sample mean is are not holding enough liquid. To test this
approximately normal under certain claim, the bottling company randomly
conditions. Which of the following is a samples 64 bottles and finds the average
necessary condition for the Central Limit amount of liquid held by the bottles is
Theorem to be used? 11.9155 ounces with a standard deviation of
0.40 ounce. Suppose the p-value of this test
a. The sample size must be large (e.g., at is 0.0455. State the proper conclusion.
least a. At α = 0.025, reject the null hypothesis.
30). b. At α = 0.05, accept the null hypothesis.
b. The population size must be large (e.g., at c. At α = 0.05, reject the null hypothesis.
least 30). d. At α = 0.10, fail to reject the null
c. The population from which we are hypothesis.
sampling
must be normally distributed. 36) Given H0: µ = 25, Ha: µ ≠ 25, and p =
d. The population from which we are 0.029. Do you reject or fail to reject H 0 at the
sampling 0.01 level of significance?
must not be normally distributed. a. fail to reject H0
b. reject H0
33. The Central Limit Theorem is important in c. not sufficient information to decide
statistics because _____. d. accept both H0 and Ha based on the p-
value
a. for any size sample, it says the sampling
distribution of the sample mean is 37) An insurance company sets up a
approximately normal statistical test with a null hypothesis that the
b. for any population, it says the sampling average time for processing a claim is 7 days,
distribution of the sample mean is and an alternative hypothesis that the
approximately normal, regardless of the average time for processing a claim is greater
sample size than 7 days. After completing the statistical
c. for a large n, it says the sampling test, it is concluded that the average time
distribution exceeds 7 days. However, it is eventually
of the sample mean is approximately learned that the mean process time is really 7
days. What type of error occurred in the person has.
statistical test? c. The number of women taller than 68 inches
a. Type II error in a random sample of 5 women.
b. Type III error d. The number of correct guesses on a
c. No error occurred in the statistical sense. multiple choice test.
d. Type I error
38) In the past, the mean battery life for a
certain type of flashlight battery has been 9.4
hours. The manufacturer has introduced a
change in the production method and wants
to perform a hypothesis test to determine
whether the mean battery life has increased
as a result. The hypotheses are:
H0 : µ = 9.4 hours HA : µ > 9.4 hours
Explain the result of a Type II error.
a. The manufacturer will decide the mean battery
life is greater than 9.4 hours when in fact it is
greater than 9.4 hours.
b. The manufacturer will decide the mean battery
life is 9.4 hours when in fact it is 9.4 hours.
c. The manufacturer will decide the mean battery
life is less than 9.4 hours when in fact it is
greater
than 9.4 hours.
d. The manufacturer will decide the mean battery
life is 9.4 hours when in fact it is greater than
9.4
hours.
e. The manufacturer will decide the mean battery
life is greater than 9.4 hours when in fact it is
9.4
hours.
39) Which statement best describes a
parameter?
a. A parameter is a level of confidence
with an interval about a sample mean or
proportion.
b. A parameter is a numerical measure of a
population that is almost always unknown
and must be estimated.
c. A parameter is a sample size that
guarantees the error in estimation is within
acceptable limits.
d. A parameter is an unbiased estimate of a
statistic found by experimentation or polling.
40) Which one of these variables is a
continuous random variable?
a. The time it takes a randomly selected
student to complete an exam.
b. The number of tattoos a randomly selected