Prob Worksheet 3
Prob Worksheet 3
Worksheet-ch5-9
First, let's list the sample space of two dice throws. Each outcome is an ordered pair (d1
,d2) where d1 is the result of the first die and d2 is the result of the second die. There are
6×6=36 possible outcomes.
o Event E: The sum of the dice is odd. The sum is odd if one die is odd and the
other is even.
E={(1,2),(1,4),(1,6),(2,1),(2,3),(2,5),(3,2),(3,4),(3,6),(4,1),(4,3),(4,5),(5,2),(5,4),(5
,6),(6,1),(6,3),(6,5)}
o Event F: The first die lands on 1. F={(1,1),(1,2),(1,3),(1,4),(1,5),(1,6)}
o Event G: The sum is 7. G={(1,6),(2,5),(3,4),(4,3),(5,2),(6,1)}
o EF (E and F): The intersection of E and F. The sum is odd AND the first die is 1.
EF=E∩F={(1,2),(1,4),(1,6)}
o E U F (E or F): The union of E and F. The sum is odd OR the first die is 1.
E∪F=E+F−(E∩F)
E∪F={(1,1),(1,2),(1,3),(1,4),(1,5),(1,6),(2,1),(2,3),(2,5),(3,2),(3,4),(3,6),(4,1),(4,3)
,(4,5),(5,2),(5,4),(5,6),(6,1),(6,3),(6,5)}
o FG (F and G): The intersection of F and G. The first die is 1 AND the sum is 7.
FG=F∩G={(1,6)}
o EFc (E and not F): The intersection of E and the complement of F. The sum is
odd AND the first die is NOT 1. First, find Fc (complement of F): The first die is
NOT 1. Fc={(d1,d2)∣d1∈{2,3,4,5,6},d2∈{1,2,3,4,5,6}}
EFc=E∩Fc={(2,1),(2,3),(2,5),(3,2),(3,4),(3,6),(4,1),(4,3),(4,5),(5,2),(5,4),(5,6),(6,
1),(6,3),(6,5)}
o EFG (E and F and G): The intersection of E, F, and G. The sum is odd AND the
first die is 1 AND the sum is 7. From EF={(1,2),(1,4),(1,6)}. Now intersect with
G. EFG=(E∩F)∩G={(1,2),(1,4),(1,6)}∩{(1,6),(2,5),(3,4),(4,3),(5,2),(6,1)}
EFG={(1,6)}
5. A class in probability theory consists of 6 men and 4 women. An exam is given and
the students are ranked according to their performance. Assuming that no two
students obtain the same score,
o a) how many different rankings are possible? There are a total of 6+4=10
students. Since no two students obtain the same score, every permutation of these
10 students represents a unique ranking. The number of different rankings is the
number of permutations of 10 distinct items. Number of rankings =
10!=10×9×8×7×6×5×4×3×2×1=3,628,800
o b) If all rankings are considered equally likely, what is the probability that
women receive the top 4 scores? For women to receive the top 4 scores, the first
4 positions in the ranking must be occupied by the 4 women. The number of ways
to arrange the 4 women in the top 4 positions is 4!. The number of ways to
arrange the remaining 6 men in the remaining 6 positions is 6!. Number of
favorable rankings = (ways to arrange 4 women in top 4 positions) × (ways to
arrange 6 men in remaining 6 positions) Number of favorable rankings =
4!×6!=(4×3×2×1)×(6×5×4×3×2×1) =24×720=17,280
The total number of possible rankings is 10! (from part a). Probability that women
receive the top 4 scores = (Number of favorable rankings) / (Total number of
rankings) P(women get top 4 scores)=10!4!×6!=3,628,80017,280=2101 (This can
also be simplified as 10×9×8×7×6!4!×6!=10×9×8×74!=504024=2101)
The event that the system works is the union of these two sets: System works
outcomes = {(1,1,0,0),(1,1,0,1),(1,1,1,0),(1,1,1,1),(0,0,1,1),(0,1,1,1),(1,0,1,1)}
(Note: (1,1,1,1) is common to both and is listed only once.)
o c) Let E be the event that components 1 and 3 are both failed. How many
outcomes are contained in event E? If components 1 and 3 are both failed, then
x1=0 and x3=0. The states of components 2 (x2) and 4 (x4) can be either 0 or 1.
Possible outcomes for x2: 0, 1 (2 options) Possible outcomes for x4: 0, 1 (2
options) The outcomes in event E are of the form (0,x2,0,x4). Number of
outcomes in E = 1×2×1×2=4 The outcomes are:
(0,0,0,0),(0,0,0,1),(0,1,0,0),(0,1,0,1)
7. A group of 5 boys and 10 girls is lined up in random order that is, each of the 15!
permutations is assumed to be equally likely.
There are a total of 5+10=15 people. The total number of permutations is 15!.
o a) What is the probability that the person in the 4th position is a boy? To
calculate this, imagine fixing a boy in the 4th position. Number of choices for the
4th position: 5 (any of the 5 boys). Number of ways to arrange the remaining 14
people in the remaining 14 positions: 14!. Number of favorable permutations =
5×14! Probability =
Total number of permutationsNumber of favorable permutations=15!5×14!
=15×14!5×14!=155=31
o b) What about the person in the 12th position? This is analogous to part (a).
The specific position doesn't change the probability, assuming random order.
Number of choices for the 12th position: 5 (any of the 5 boys). Number of ways
to arrange the remaining 14 people in the remaining 14 positions: 14!. Probability
= 15!5×14!=155=31
o c) What is the probability that a particular boy is in the 3rd position? Let's
pick a specific boy, say Boy A. Number of choices for the 3rd position: 1 (Boy
A). Number of ways to arrange the remaining 14 people in the remaining 14
positions: 14!. Number of favorable permutations = 1×14! Probability = 15!1×14!
=151
8. A committee of size 5 is to be selected from a group of 6 men and 9 women. If the
selection is made randomly, what is the probability that the committee consists of 3
men and 2 women?
Let F be the event that the student is female. Let C be the event that the student is
majoring in computer science.
Table of salaries:
| Wife | Husband < $25,000 | Husband > $25,000 | Total (Husband) | | :--------- | :-----------
------ | :----------------- | :-------------- | | < $25,000 | 212 | 198 | 410 | | > $25,000 | 36 | 54 |
90 | | Total (Wife) | 248 | 252 | 500 |
Let H<25 be the event that the husband earns less than $25,000. Let H>25 be the event
that the husband earns more than $25,000. Let W<25 be the event that the wife earns less
than $25,000. Let W>25 be the event that the wife earns more than $25,000.
o a) The probability that the husband earns less than $25,000; Total couples =
500. Number of couples where husband earns < $25,000 = 212 + 36 = 248.
P(H<25)=Total couplesNumber of H<25 couples=500248=12562=0.496
o b) The probability that the wife earns more than $25,000 given that the
husband earns more than this amount; We need to find P(W>25 | H>25).
Number of couples where husband earns > $25,000 = 198 + 54 = 252. (This is the
new sample space for the conditional probability). Among these 252 couples, the
number where the wife earns > $25,000 is 54.
P(W>25 | H>25)=Number of H>25Number of (W>25 and H>25)=25254 To
simplify 25254: Divide by 2: 12627 Divide by 9: 143 So, P(W>25 | H>25)=143
≈0.2143
o c) The conditional probability that the wife earns more than $25,000 given
that the husband earns less than this amount? We need to find
P(W>25 | H<25). Number of couples where husband earns < $25,000 = 212 + 36
= 248. (This is the new sample space). Among these 248 couples, the number
where the wife earns > $25,000 is 36.
P(W>25 | H<25)=Number of H<25Number of (W>25 and H<25)=24836 To
simplify 24836: Divide by 4: 629 So, P(W>25 | H<25)=629≈0.1452
11. A man is known to speak the truth in 2 out of 3 times. He throws a balanced die and
reported that the result is five. What is the probability that it was actually five?
Let T be the event that the man speaks the truth. P(T)=32. Let F be the event that the man
lies. P(F)=1−P(T)=1−32=31.
Let R5 be the event that the man reported the result is five. Let A5 be the event that the
actual result on the die is five. Let not A5 be the event that the actual result on the die is
not five.
For a balanced die, the probability of rolling a five is P(A5)=61. The probability of not
rolling a five is P(not A5)=1−61=65.
First, let's find P(R5∣A5): This means the die was actually five, and the man reported
five. This happens if he tells the truth. P(R5∣A5)=P(T)=32.
Next, let's find P(R5∣not A5): This means the die was not five, but the man reported five.
This happens if he lies. If he lies when the die is not five, he reports one of the other 5
numbers (1, 2, 3, 4, 6). Assuming he randomly picks a false number, the probability he
reports "five" is 51. So,
P(R5∣not A5)=P(F)×(probability of reporting 5 given he lies and it’s not 5)=31×51=151.
Let's check condition 1: For x∈[0,1], x≥0 and (1−x)≥0. Therefore, 6x(1−x)≥0. For
x outside [0,1], f(x)=0, which is ≥0. So, condition 1 is satisfied.
Let's check condition 2: ∫−∞∞f(x)dx=∫016x(1−x)dx =∫01(6x−6x2)dx
=[3x2−2x3]01 =(3(1)2−2(1)3)−(3(0)2−2(0)3) =(3−2)−0=1 So, condition 2 is
satisfied.
Since both conditions are met, yes, f(x) is a probability density function.
15. The annual rainfall (in inches) in a certain region is normally distributed with μ=40,
σ=4. What is the probability that in 2 of the next 4 years the rainfall will exceed 50
inches? Assume that the rainfalls in different years are independent.
Let R be the annual rainfall. R∼N(μ=40,σ=4). First, find the probability that the rainfall
in a single year will exceed 50 inches. P(R>50)=P(Z>σ50−μ) P(R>50)=P(Z>450−40
)=P(Z>410)=P(Z>2.5)
Now, we are interested in the probability that in 2 of the next 4 years the rainfall will
exceed 50 inches. This is a binomial probability problem, where: n=4 (number of
trials/years) k=2 (number of successes/years with rainfall > 50 inches) p=0.0062
(probability of success in a single trial) The binomial probability formula is P(X=k)=(kn
)pk(1−p)n−k.
P(X=2)=(24)(0.0062)2(1−0.0062)4−2 P(X=2)=2!2!4!(0.0062)2(0.9938)2
P(X=2)=6×(0.00003844)×(0.98763844) P(X=2)≈6×3.844×10−5×0.9876
P(X=2)≈0.000227
16. If each voter is for Proposition A with probability .7, what is the probability that
exactly 7 of 10 voters are for this proposition?
This is a binomial probability problem. n=10 (total number of voters) k=7 (number of
voters for Proposition A) p=0.7 (probability that a single voter is for Proposition A)
1−p=1−0.7=0.3 (probability that a single voter is not for Proposition A)
P(X=7)=120×0.0823543×0.027 P(X=7)≈0.2668
17. Suppose that the average number of accidents occurring weekly on a particular
stretch of a highway equals 3. Calculate the probability that there is at least one
accident this week.
This is a Poisson distribution problem, as it deals with the number of events (accidents)
occurring in a fixed interval of time (weekly) at a given average rate. The average
number of accidents, λ=3 per week. The Poisson probability mass function is
P(X=k)=k!e−λλk.
We want to find the probability of at least one accident this week, i.e., P(X≥1). It's easier
to calculate P(X≥1) as 1−P(X=0). P(X=0)=0!e−330=1e−3×1=e−3
e−3≈0.049787 P(X≥1)=1−e−3=1−0.049787=0.950213
18. It is known that bacteria of a certain kind occur in water at a rate 2 bacteria per
cubic centimeter of water. Assuming that the phenomenon obeys a Poisson
probability law, what is the probability that a sample of two cube centimeters of
water will contain
The rate of bacteria is 2 bacteria per cubic centimeter. We are considering a sample of
two cubic centimeters. So, the average number of bacteria in a 2 cm$^3$ sample,
λ=2 bacteria/cm3×2 cm3=4 bacteria. This is a Poisson distribution with λ=4.
This problem seems to be missing information required to directly calculate the expected
payment. The problem states that "on the average 10 students receive marks of at least
75% in one or more of these classes". However, the scholarship is awarded to students
who achieve at least 75% in all three courses. Without knowing the probability or
average number of students who achieve marks of at least 75% in all three courses, we
cannot determine the expected payment.
If we assume (which is not stated in the problem and would be an inference) that the "10
students who receive marks of at least 75% in one or more classes" is somehow related to
the number of students getting 75% in all three classes, it would be an unjustified leap.
Self-correction: If the problem implicitly means that on average 10 students get at least
75% in each of the classes independently and we are to infer a proportion, that's not
clearly stated. If the "10 students receive marks of at least 75% in one or more of these
classes" meant the intersection (all three), then the answer would be
10 students×$500/student=$5000. But "one or more" implies the union.
20. If the average number of claims handled daily by an insurance company is 5, what
proportion of days have less than 3 claims? What is the probability that there will
be 4 claims in exactly 3 of the next 5 days? Assume that the number of claims on
different days is independent.
This involves two parts, both using the Poisson distribution for the number of claims
daily, and then a binomial distribution for the "days" part.
Part 1: Proportion of days with less than 3 claims Let X be the number of claims on a
given day. X∼Poisson(λ=5). We need to find P(X<3)=P(X=0)+P(X=1)+P(X=2).
P(X=k)=k!e−λλk
P(X=0)=0!e−550=e−5≈0.006738 P(X=1)=1!e−551=5e−5≈5×0.006738=0.03369
P(X=2)=2!e−552=225e−5≈12.5×0.006738=0.084225
Part 2: Probability that there will be 4 claims in exactly 3 of the next 5 days First,
find the probability of a single day having exactly 4 claims (P(X=4)). P(X=4)=4!e−554
=24e−5×625 P(X=4)=24625×0.006738≈244.21125≈0.17546875 Let
p′=P(X=4)≈0.17546875.
Now, we are looking at the next 5 days. This is a binomial experiment: n=5 (number of
days) k=3 (number of days with 4 claims) p′≈0.17546875 (probability of success - 4
claims - on a single day) 1−p′≈1−0.17546875=0.82453125
21. It is known that disks produced by a certain company will be defective with
probability .01 independently of each other. The company sells the disks in packages
of 10 and offers a money-back guarantee that at most 1 of the 10 disks is defective.
What proportion of packages is returned? If someone buys three packages, what is
the probability that exactly one of them will be returned?
Let p=0.01 be the probability that a single disk is defective. Let n=10 be the number of
disks in a package. Let X be the number of defective disks in a package.
X∼Binomial(n=10,p=0.01).
P(X≤1)=P(X=0)+P(X=1)≈0.90438+0.09135=0.99573
P(package returned)=1−P(X≤1)≈1−0.99573=0.00427 So, approximately 0.427% of
packages are returned.
Part 2: If someone buys three packages, what is the probability that exactly one of
them will be returned? This is another binomial problem. Let Y be the number of
returned packages. N=3 (total packages bought) K=1 (exactly one returned package)
Preturn=P(package returned)≈0.00427 (from Part 1)
P(Y=1)=(13)(Preturn)1(1−Preturn)3−1 P(Y=1)=3×(0.00427)×(1−0.00427)2
P(Y=1)=3×0.00427×(0.99573)2 P(Y=1)≈3×0.00427×0.99147
P(Y=1)≈0.01281×0.99147≈0.01269
So, the probability that exactly one of the three packages will be returned is
approximately 0.01269.
22. Women's shoes are manufactured in sizes 2, 3, 4, ..., 8. Size 5 is suitable for a foot of
length ranging from 9.25 inches to 9.5 inches. If length of women's foot is normally
distributed with mean 9.4 inches and standard deviation 0.25 inches, how many
pairs of size 5 are required out of every 10,000 pairs manufactured?
Let L be the length of a woman's foot. L∼N(μ=9.4,σ=0.25). Size 5 is suitable for lengths
between 9.25 and 9.5 inches. We need to find P(9.25<L<9.5).
P(9.25<L<9.5)=0.6554−0.2743=0.3811
This is the proportion of women who would fit size 5. Out of every 10,000 pairs
manufactured, the number of pairs required for size 5 would be: Number of pairs =
0.3811×10,000=3811 pairs.
o b) Find the probability that the mean of a sample of size 36 will be within 10
units of the population mean, that is, between 118 and 138. We need to find
P(118<Xˉ<138). Since the sample size n=36 is large (typically n≥30), by the
Central Limit Theorem, the sampling distribution of the sample mean Xˉ can be
approximated by a normal distribution, regardless of the shape of the population
distribution. So, Xˉ∼N(μXˉ=128,σXˉ=11/3).
P(118<Xˉ<138)≈0.9968−0.0032=0.9936
24. Suppose the mean length of time that a caller is placed on hold when telephoning a
customer service center is 23.8 seconds, with standard deviation 4.6 seconds. Find
the probability that the mean length of time on hold in a sample of 1,200 calls will be
within 0.5 second of the population mean.
Population mean μ=23.8 seconds. Population standard deviation σ=4.6 seconds. Sample
size n=1200 calls.
We need to find the probability that the sample mean Xˉ is within 0.5 seconds of the
population mean. This means: 23.8−0.5<Xˉ<23.8+0.5 23.3<Xˉ<24.3
First, calculate the mean and standard deviation of the sample means: E(Xˉ)=μ=23.8 σXˉ
Since n=1200 is a large sample, by the Central Limit Theorem, the distribution of Xˉ is
approximately normal. Now, convert the values to Z-scores: For Xˉ=23.3: Z1
=0.1327923.3−23.8=0.13279−0.5≈−3.765 For Xˉ=24.3: Z2=0.1327924.3−23.8
=0.132790.5≈3.765
We need to find P(−3.765<Z<3.765). P(−3.765<Z<3.765)=P(Z<3.765)−P(Z<−3.765)
Using a Z-table or calculator, P(Z<3.765)≈0.99992
P(Z<−3.765)=1−P(Z<3.765)≈1−0.99992=0.00008
25. Of a large group of men, 5% are less than 60 inches in height and 40% are between
60 & 65 inches. Assuming a normal distribution, find the mean and standard
deviation of heights.
Given information:
1. P(H<60)=0.05
2. P(60<H<65)=0.40
From (1), convert to Z-score: P(Z<σ60−μ)=0.05. Looking up 0.05 in the Z-table (or 1-
0.05 = 0.95 and finding -Z value), the Z-score for 0.05 is approximately -1.645. So,
σ60−μ=−1.645 (Equation 1) 60−μ=−1.645σ⇒μ=60+1.645σ
3. μ=60+1.645σ
4. μ=65+0.125σ
So, the mean height is approximately 65.41 inches and the standard deviation is
approximately 3.29 inches.
26. Let X be normally distributed with mean 3 and variance 9 and Y=1−2X
P(−1<X<2)≈0.3707−0.0918=0.2789.
So, P(X>3 AND Y<2) means P(X>3 AND X>−0.5). If X>3, it automatically
satisfies X>−0.5. Therefore, P(X>3 AND Y<2)=P(X>3).
Given: Population standard deviation σ=0.1 mm Sample size n=50 Sample mean xˉ=5.11
mm Confidence level = 95%
Since the population standard deviation σ is known and the sample size n≥30, we use the
Z-distribution to construct the confidence interval. The formula for a confidence interval
For a 95% confidence level, α=1−0.95=0.05. So, α/2=0.025. Zα/2=Z0.025. Look up the
Z-score that leaves 0.025 in the upper tail (or 0.975 in the lower tail). This is Z0.025
=1.96.
Now, calculate the margin of error: Margin of Error (ME) = Zα/2n σ=1.96×50
28. In a random sample of 250 employed people, 61 said that they bring work home
with them at least occasionally.
Sample size n=250. Number of successes (people who bring work home) x=61.
o a. Give a point estimate of the proportion of all employed people who bring
work home with them at least occasionally. The point estimate of the
population proportion (p) is the sample proportion (p^). p^=nx=25061=0.244
o b. Construct a 99% confidence interval for that proportion. For a confidence
interval for a proportion, since np^=61≥10 and n(1−p^)=250−61=189≥10, we can
use the normal approximation. The formula for a confidence interval for a
=0.000737856 ≈0.02716
The 99% confidence interval for the proportion of all employed people who bring
work home is (0.174,0.314) or (17.4%,31.4%).
29. The amount of a particular biochemical substance related to bone breakdown was
measured in 30 healthy women. The sample mean and standard deviation were 3.3
nanograms per milliliter (ng/mL) and 1.4 ng/mL. Construct an 80% confidence
interval for the mean level of this substance in all healthy women.
Given: Sample size n=30 Sample mean xˉ=3.3 ng/mL Sample standard deviation s=1.4
ng/mL Confidence level = 80%
Since the population standard deviation is unknown and the sample size is n=30 (which is
generally considered large enough for the Central Limit Theorem to apply, but also just at
the threshold where t-distribution is preferred when σ is unknown), we use the t-
distribution for constructing the confidence interval. Degrees of freedom
df=n−1=30−1=29.
For an 80% confidence level, α=1−0.80=0.20. So, α/2=0.10. We need to find tα/2,df
=t0.10,29. From a t-distribution table, t0.10,29≈1.311.
The formula for a confidence interval for the population mean when σ is unknown is:
xˉ±tα/2,dfn s
Calculate the standard error of the mean: SExˉ=n s=30 1.4 30 ≈5.477 SExˉ
=5.4771.4≈0.2556
The 80% confidence interval for the mean level of this substance in all healthy women is
(2.9649 ng/mL,3.6351 ng/mL).
30. A physical therapist studying muscular strength assumed muscle strength scores are
normally distributed with σ=12. A sample of 15 individuals demonstrates a mean
muscular strength score of 84.3. Calculate a 95% confidence interval for μ.
Interpret the meaning of this confidence interval.
Given: Population standard deviation σ=12 Sample size n=15 Sample mean xˉ=84.3
Confidence level = 95%
Since the population standard deviation σ is known, we use the Z-distribution to construct
the confidence interval, even though the sample size is small (n<30). The problem also
states that scores are normally distributed. The formula for a confidence interval for the
Calculate the standard error of the mean: SExˉ=n σ=15 12 15 ≈3.873 SExˉ
=3.87312≈3.098
Interpretation of the confidence interval: We are 95% confident that the true average
muscular strength score for all individuals in the population falls between 78.228 and
90.372. This means that if we were to take many samples and construct a 95% confidence
interval for each, about 95% of these intervals would contain the true population mean.
31. Last year the government made a claim that the average income of the American
people was $33,950. However, a sample of 50 people taken recently showed an
average income of $34,076 with a population standard deviation of $324. Is the
government's estimate too low? Conduct a significance test to see if the true mean is
more than the reported average. Use an alpha =0.01.
This is a hypothesis testing problem for a population mean, with known population
standard deviation.
Given: Claimed population mean (from government) μ0=$33,950 Sample size n=50
Sample mean xˉ=$34,076 Population standard deviation σ=$324 Significance level
α=0.01
1. Formulate Hypotheses: Null Hypothesis H0: The true mean income is equal to or less
than the government's claim. H0:μ≤33,950 Alternative Hypothesis HA: The true mean
income is more than the government's claim. (This addresses "Is the government's
estimate too low?") HA:μ>33,950 This is a one-tailed (right-tailed) test.
2. Choose the Test Statistic: Since σ is known and n≥30, we use the Z-test for the mean.
4. Determine the Critical Value: For a one-tailed (right-tailed) test with α=0.01, we
need to find Zα=Z0.01. Look up the Z-score that leaves 0.01 in the upper tail (or 0.99 in
the lower tail). Z0.01≈2.33.
5. Make a Decision: Compare the calculated Z-statistic to the critical value. Calculated
Z=2.749 Critical value Zcritical=2.33 Since 2.749>2.33, the calculated Z-statistic falls in
the rejection region.
6. Conclusion: Reject the null hypothesis (H0). There is sufficient evidence at the 0.01
significance level to conclude that the true mean income of the American people is
significantly more than $33,950, suggesting that the government's estimate was too low.
32. A company sells exercise clothing and equipment on the Internet. To design the
clothing, the company collects data on the physical characteristics of different types
of customers. It takes a sample of 24 male runners found a mean weight of 61.79
kilograms. Assume that the population standard deviation of weight is σ=4.5
kilograms.
Given: Sample size n=24 Sample mean xˉ=61.79 kg Population standard deviation σ=4.5
kg
o a) Calculate a 95% confidence interval for the mean weight of all such
runners: Since σ is known, we use the Z-distribution. (The sample size is small,
but if the population is normally distributed, which is often assumed in such
problems, the Z-interval is appropriate). Confidence level = 95%. So α=0.05,
α/2=0.025. Zα/2=Z0.025=1.96.
The 95% confidence interval for the mean weight of all such runners is
(59.99 kg,63.59 kg).
For a two-tailed test, if the hypothesized mean μ0 falls within the confidence
interval, we do not reject H0. If it falls outside the confidence interval, we reject
H0. Our 95% confidence interval is (59.99 kg,63.59 kg). The hypothesized mean
μ0=61.3 kg. Since 61.3 kg is within the interval (59.99,63.59), we do not reject
H0 at the 5% significance level.
2. Choose the Test Statistic: Z-test, since σ is known. Test statistic Z=σ/n
xˉ−μ0
3. Calculate the Test Statistic: Z=4.5/24 61.79−61.3 Z=4.5/4.8990.49
=0.91850.49≈0.5335
4. Determine the Critical Values: For a two-tailed test with α=0.05, the critical
values are Zα/2=±Z0.025. Z0.025=1.96. So, the critical values are ±1.96.
6. Conclusion: Do not reject the null hypothesis (H0). There is not enough
evidence at the 0.05 significance level to conclude that the true mean weight of
male runners is different from 61.3 kg. This confirms the answer from part b.
33. Use the following data and fit a linear regression line to predict the price of used
car. The data is the about Asking price of some used Toyota Corollas advertised on
newspaper on October 2006.
Data: | Model Year (X) | Asking Price (Y) | | :------------- | :--------------- | | 2004 | $10950
| | 2003 | $9400 | | 2001 | $8990 | | 1998 | $5800 | | 1997 | $5850 | | 1994 | $3800 | | 1989 |
$1500 |
Let's define a new variable for 'Age of the car' (X) relative to 2006 (or the newest model
year). It's easier to work with smaller numbers for X, so let X be "Years since 1989".
Alternatively, let's use "Age" = 2006 - Model Year. | Model Year | Age (X) | Price (Y) | |
:--------- | :------ | :-------- | | 2004 | 2 | 10950 | | 2003 | 3 | 9400 | | 2001 | 5 | 8990 | | 1998 | 8
| 5800 | | 1997 | 9 | 5850 | | 1994 | 12 | 3800 | | 1989 | 17 | 1500 |
∑X=2+3+5+8+9+12+17=56 ∑Y=10950+9400+8990+5800+5850+3800+1500=46290
∑X2=22+32+52+82+92+122+172=4+9+25+64+81+144+289=616
∑Y2=109502+94002+89902+58002+58502+38002+15002=119902500+88360000+808
20100+33640000+34222500+14440000+2250000=373635100
∑XY=(2×10950)+(3×9400)+(5×8990)+(8×5800)+(9×5850)+(12×3800)+(17×1500)
=21900+28200+44950+46400+52650+45600+25500=265200
a) Write the equation of the fitted line The linear regression equation is Y^=a+bX,
where: b=n(∑X2)−(∑X)2n(∑XY)−(∑X)(∑Y) a=n∑Y−b(∑X)
b=7(616)−(56)27(265200)−(56)(46290) b=4312−31361856400−2592240=1176−735840
≈−625.714
a=746290−(−625.714)(56) a=746290+35040=781330≈11618.57
The equation of the fitted line is Y^=11618.57−625.714X, where X is the age of the car
(in years) and Y is the predicted asking price.
b) What is the slope of the line? What does it tell you? The slope of the line is
b=−625.714. This means that for every additional year of age, the asking price of a used
Toyota Corolla is predicted to decrease by approximately $625.71.
c) What is the correlation coefficient value? The correlation coefficient r is given by:
r=[n(∑X2)−(∑X)2][n(∑Y2)−(∑Y)2] n(∑XY)−(∑X)(∑Y)
d) What proportion of the variability of the asking price explained by the age of the
car? The proportion of variability explained is given by the coefficient of determination,
R2, which is the square of the correlation coefficient (r2). R2=r2=(−0.9872)2≈0.9746
So, approximately 97.46% of the variability in the asking price can be explained by the
age of the car. This indicates a very strong linear relationship.
34. It has been observed that the amount of soil eroded (in Kg) per day (Y) is
determined by the wind velocity (X) in that day (Km/Hr) Data obtained from the
ministry of agriculture for a certain area gave the following regression line:
Y=0.25+0.1123 X
The given regression line is Y^=0.25+0.1123X. Here, Y^ is the predicted amount of soil
eroded (in Kg) and X is the wind velocity (in Km/Hr).