Mathematics: Data Analysis and Probability (a) 1/2
50 items, 30 minutes (b) 1/4
(c) 1/8
Instructions: Choose the best letter of the correct (d) 1/16
answer.
1. What is the mean of the following data set: 2, 4, 6, 8, 8. The standard deviation of a data set measures:
10? (a) The average distance of data points from the
(a) 4 mean
(b) 5 (b) The range of the data
(c) 6 (c) The most frequent data point
(d) 7 (d) The difference between the highest and lowest
data points
2. A bag contains 5 red marbles and 3 blue marbles.
What is the probability of drawing a red marble, then 9. If two events are mutually exclusive, then:
another red marble, without replacement? (a) They cannot occur at the same time
(a) 5/14 (b) They are independent events
(b) 25/64 (c) The probability of both occurring is 1
(c) 1/4 (d) The probability of one event affects the
(d) 5/8 probability of the other
3. Which of the following is NOT a measure of central 10. A sample is considered biased if:
tendency? (a) It is not representative of the population
(a) Mean (b) It is too small
(b) Median (c) It contains outliers
(c) Mode (d) It is randomly selected
(d) Range
4. A normal distribution is symmetrical about its: 11. What is the median of the following data set: 3, 7, 2,
(a) Mean 9, 5, 4, 8?
(b) Median (a) 4
(c) Mode (b) 5
(d) All of the above (c) 6
(d) 7
5. The correlation coefficient between two variables
measures: 12. A scatter plot shows a strong negative linear
(a) The strength of their linear relationship relationship between two variables. Which correlation
(b) The difference in their means coefficient is most likely?
(c) The ratio of their standard deviations (a) -0.9
(d) The probability of one variable given the other (b) -0.2
(c) 0.5
(d) 0.9
6. A box plot displays:
(a) The mean and standard deviation 13. If P(A) = 0.4 and P(B) = 0.6, and A and B are
(b) The five-number summary independent events, what is P(A and B)?
(c) The frequency distribution (a) 0.24
(d) The correlation coefficient (b) 1
(c) 0.2
7. What is the probability of flipping a coin three times (d) 0.64
and getting tails all three times?
14. A data set has a mean of 50 and a standard (b) The time until the first success
deviation of 10. What percentage of the data falls (c) The average value of a continuous variable
between 40 and 60? (d) The spread of data around the mean
(a) 34%
(b) 68%
(c) 95% 21. What is the probability of rolling a 6 on a standard
(d) 99.7% die and flipping heads on a fair coin?
(a) 1/12
15. A random variable X follows a normal distribution (b) 1/6
with mean 100 and standard deviation 15. What is P(X (c) 1/3
> 130)? (d) 1/2
(a) 0.0228
(b) 0.1587 22. A researcher wants to study the opinions of
(c) 0.8413 students at a university. They randomly select 100
(d) 0.9772 students from the university's directory. This is an
example of:
(a) Cluster sampling
16. Which of the following is NOT a type of sampling (b) Stratified sampling
method? (c) Simple random sampling
(a) Simple random sampling (d) Convenience sampling
(b) Convenience sampling
(c) Normal sampling 23. If a data set is skewed right, then its mean is
(d) Stratified sampling typically:
(a) Less than the median
17. A study found a correlation between ice cream (b) Equal to the median
sales and crime rates. This implies: (c) Greater than the median
(a) Ice cream causes crime (d) Cannot be determined
(b) Crime causes people to buy ice cream
(c) There is a lurking variable that affects both 24. A researcher surveys customers at a restaurant to
(d) There is no relationship between the two determine their satisfaction levels. This is an example
of what type of study?
18. The interquartile range (IQR) is calculated by: (a) Observational study
(a) Subtracting the minimum from the maximum (b) Experimental study
(b) Subtracting the first quartile from the third (c) Simulation
quartile (d) Census
(c) Dividing the standard deviation by the mean
(d) Squaring the variance 25. The complement of an event A is the event that:
(a) A occurs
19. A probability distribution lists all possible values of (b) A does not occur
a random variable and their corresponding: (c) A and another event both occur
(a) Frequencies (d) A or another event occurs
(b) Probabilities
(c) Z-scores
(d) Percentiles 26. What is the mode of the following data set: 10, 12,
15, 12, 10, 13, 12, 14?
20. A binomial distribution describes the probability (a) 10
of: (b) 12
(a) The number of successes in a fixed number of (c) 13
trials (d) There is no mode
(b) 0.8
27. If the variance of a data set is 25, what is its (c) 0.2
standard deviation? (d) 0
(a) 5
(b) 10 34. A researcher conducts a hypothesis test and
(c) 12.5 obtains a p-value of 0.02. This means:
(d) 625 (a) There is a 2% chance the null hypothesis is true
(b) There is a 98% chance the alternative hypothesis
28. A z-score indicates: is true
(a) How many standard deviations a data point is (c) The results are statistically significant at the 0.05
from the mean level
(b) The probability of a certain event occurring (d) The results are not statistically significant
(c) The correlation between two variables
(d) The spread of data around the median 35. A confidence interval is:
(a) A range of values that is likely to contain the true
29. A sample space is: population parameter
(a) A subset of the population (b) The probability of rejecting the null hypothesis
(b) The set of all possible outcomes of an experiment when it is true
(c) The probability of a certain event occurring (c) The difference between the sample mean and the
(d) The mean of a data set population mean
(d) The standard deviation of the sampling
30. The probability of an event is always between: distribution
(a) -1 and 1
(b) 0 and 1
(c) 0 and 100 36. A researcher wants to compare the effectiveness of
(d) -infinity and infinity two different teaching methods. They randomly assign
students to one of two groups, one using each method,
and then compare their test scores. This is an example
31. A researcher wants to know the average height of of:
high school students in a certain state. They randomly (a) An observational study
select 50 high schools in the state and measure the (b) An experimental study
height of every student in those schools. This is an (c) A survey
example of: (d) A simulation
(a) Simple random sampling
(b) Stratified sampling 37. The standard normal distribution has a mean of ____
(c) Cluster sampling and a standard deviation of ____.
(d) Systematic sampling (a) 0, 1
(b) 1, 0
32. The line of best fit in a scatter plot is the line that: (c) 1, 1
(a) Passes through all the data points (d) 0, 0
(b) Minimizes the sum of the squared residuals
(c) Maximizes the correlation coefficient 38. A Type I error occurs when:
(d) Is always horizontal (a) We reject the null hypothesis when it is true
(b) We fail to reject the null hypothesis when it is
33. If the probability of event A is 0.3 and the false
probability of event B is 0.5, and events A and B are (c) We correctly reject the null hypothesis
mutually exclusive, what is the probability of either A (d) We correctly fail to reject the null hypothesis
or B occurring?
(a) 0.15 39. The central limit theorem states that:
(a) The sampling distribution of the mean will be (a) 1/2
approximately normal, regardless of the shape of the (b) 1/4
population distribution, as the sample size increases (c) 1/52
(b) The mean of the sampling distribution of the (d) 13/52
mean is equal to the population mean
(c) The standard deviation of the sampling 44. If a fair coin is flipped 100 times, the expected
distribution of the mean is equal to the population number of tails is:
standard deviation divided by the square root of the (a) 25
sample size (b) 50
(d) All of the above (c) 75
(d) 100
40. A researcher wants to estimate the proportion of
voters who support a particular candidate. They 45. A researcher conducts a survey and finds that 80%
survey a random sample of voters and find that 55% of respondents prefer brand A over brand B. However,
support the candidate. The margin of error for the poll the margin of error is ±10%. This means:
is ±3%. This means: (a) The true proportion of people who prefer brand
(a) The true proportion of voters who support the A is definitely between 70% and 90%
candidate is definitely between 52% and 58% (b) The sample size is too small to draw any
(b) There is a 95% chance that the true proportion of conclusions
voters who support the candidate is between 52% and (c) The survey was poorly designed
58% (d) There is a 95% chance that the true proportion of
(c) The sample proportion is 55% ± 3% people who prefer brand A is between 70% and 90%
(d) The sample size is too small
46. A random variable X has a normal distribution with
41. A random variable X follows a binomial a mean of 50 and a standard deviation of 10. What is
distribution with n = 10 and p = 0.3. What is the P(40 < X < 60)?
expected value of X? (a) 0.3413
(a) 3 (b) 0.6826
(b) 7 (c) 0.9544
(c) 0.3 (d) 0.9974
(d) 10
47. A researcher wants to study the effect of a new
42. A researcher wants to study the relationship drug on blood pressure. They randomly assign
between hours of sleep and exam performance. They participants to one of two groups: a treatment group
collect data from a group of students and create a that receives the drug and a control group that receives
scatter plot. The correlation coefficient is 0.75. This a placebo. The researcher then measures the blood
suggests: pressure of both groups after a certain period. What is
(a) There is a strong positive linear relationship the dependent variable in this study?
between hours of sleep and exam performance (a) The new drug
(b) There is a weak positive linear relationship (b) Blood pressure
between hours of sleep and exam performance (c) The placebo
(c) There is a strong negative linear relationship (d) The control group
between hours of sleep and exam performance
(d) There is no linear relationship between hours of 48. A researcher wants to determine if there is a
sleep and exam performance relationship between income level and level of
education. They collect data on both variables from a
43. The probability of drawing a heart from a standard random sample of individuals. What statistical test
deck of cards, given that the card is red, is: would be most appropriate to analyze this data?
(a) T-test
(b) Chi-square test
(c) Correlation coefficient
(d) ANOVA
49. A box contains 5 red balls, 3 blue balls, and 2 green
balls. Two balls are drawn at random without
replacement. What is the probability that both balls are
red?
(a) 1/9
(b) 1/15
(c) 2/9
(d) 5/9
50. A study found that people who eat breakfast are
more likely to be physically active than those who skip
breakfast. However, the study cannot conclude that
eating breakfast causes increased physical activity.
This is because:
(a) Correlation does not equal causation
(b) The sample size was too small
(c) The study was not double-blind
(d) There was no control group
Answer Key 39. D
Note: This is a right-minus-wrong test. For each correct 40. B
answer, you will earn 1 point. For each incorrect 41. A
answer, 0.25 points will be deducted from your total 42. A
score. 43. A
Score Calculation Formula: 44. B
Total Score = number of correct answers – (0.25 x 45. D
number of incorrect answers) 46. B
47. B
48. B
49. C
1. C 50. A
2. A
3. D
4. D
5. A
6. B
7. C
8. A
9. A
10. A
11. B
12. A
13. A
14. B
15. A
16. C
17. C
18. B
19. B
20. A
21. A
22. C
23. C
24. A
25. B
26. B
27. A
28. A
29. B
30. B
31. C
32. B
33. B
34. C
35. A
36. B
37. A
38. A