Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
2K views151 pages

Edexcel AS Math S1 Practice Book

The document contains a series of statistics questions and exercises from the Edexcel curriculum, covering topics such as data representation, probability, correlation, regression, and distributions. It includes various problems related to real-world data analysis, requiring calculations of mean, variance, standard deviation, and probability. The document serves as a resource for students to practice and enhance their statistical skills.

Uploaded by

Nisreen ElZahar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2K views151 pages

Edexcel AS Math S1 Practice Book

The document contains a series of statistics questions and exercises from the Edexcel curriculum, covering topics such as data representation, probability, correlation, regression, and distributions. It includes various problems related to real-world data analysis, requiring calculations of mean, variance, standard deviation, and probability. The document serves as a resource for students to practice and enhance their statistical skills.

Uploaded by

Nisreen ElZahar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 151

“Knowing the path is good but not enough, walking the

path with determination leads to destiny”

NA
TU
RAAS Edexcel
L
Statistics
SC (WST01)
IE
ClassifiedN Questions
CE
SO
2019 – 2023T LU
IO
N

Naturalsciencesolution – Accelerating academic skills


Page 1 of 151
PAGE
S. NO TOPICS
NUMBER
1.1 REPRESENTATION AND ANALYSING THE DATA 3 - 38
1.2 PROBABILITY 39 – 80
1.3 CORRELATION AND REGRESSION 81 – 103
104 – 127
NA
1.4
1.5
DISCRETE RANDOM VARIABLE
THE NORMAL DISTRIBUTION 128 - 149
1.6
1.7
TU
NORMAL DISTRIBUTION FUNCTION (TABLE)
PERCENTAGE POINTS OF THE NORMAL
150
151
RA DISTRIBUTION

L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 2 of 151
1.1 Representation and analysing the data:

Reference Notes:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 3 of 151
Reference Notes:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 4 of 151
Q1.

Jim records the length, l mm, of 81 salmon. The data are coded using x = l – 600 and the following
summary statistics are obtained.

(a) Find the mean length of these salmon.


(3)
(b) Find the variance of the lengths of these salmon.
(2)
The weight, w grams, of each of the 81 salmon is recorded to the nearest gram.
The recorded results for the 81 salmon are summarised in the box plot below.

NA
TU
RA
(c) Find the maximum number of salmon that have weights in the interval

L 4600 < w ≤ 7700


(1)
SC
Raj says that the box plot is incorrect as Jim has not included outliers.

IE
For these data an outlier is defined as a value that is more than
1.5 × IQR above the upper quartile or 1.5 × IQR below the lower quartile
(d) Show that there are no outliers. NC
E (3)
(Total for question = 9 marks)

SO (Q03 WST01/01, June 2023)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 5 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 6 of 151
Q2.

Morgan is investigating the body length, b centimetres, of squirrels.


A random sample of 8 squirrels is taken and the data for each squirrel is coded using

The results for the coded data are summarised below

(a) Find the mean of b


(3)
(b) Find the standard deviation of b
(3)
NA
A 9th squirrel is added to the sample. Given that for all 9 squirrels
(c) find
TU
(i) the body length of the 9th squirrel,
RA
(ii) the standard deviation of x for all 9 squirrels.
(2)

L (2)
(Total for question = 10 marks)

SC (Q03 WST01/01, Oct 2022)

IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 7 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 8 of 151
Q3.

The histogram shows the times taken, t minutes, by each of 100 people to swim 500 metres.

NA
TU
RA
(a) Use the histogram to complete the frequency table for the times taken by the 100 people to swim 500
metres.
L
SC
IE (1)

NC
(b) Estimate the number of people who took less than 16 minutes to swim 500 metres.
(2)

E
(c) Find an estimate for the mean time taken to swim 500 metres.
(2)
Given that ∑ ft2 = 41 033 SO
LU
(d) find an estimate for the standard deviation of the times taken to swim 500 metres.
(2)
Given that Q3 = 23 TI
ON
(e) use linear interpolation to estimate the interquartile range of the times taken to swim 500 metres.
(3)
(Total for question = 10 marks)

(Q01 WST01/01, Jan 2023)

Naturalsciencesolution – Accelerating academic skills


Page 9 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 10 of 151
Q4.

The stem lengths of a sample of 120 tulips are recorded in the grouped frequency table below.

NA
A histogram is drawn to represent these data.
The area of the bar representing the class is 16.5 cm2

TU
(a) Calculate the exact area of the bar representing the class.
(2)
RA
The height of the tallest bar in the histogram is 10 cm.

L
(b) Find the exact height of the second tallest bar.
(3)
Q1 for these data is 45 cm. SC
IE
(c) Use linear interpolation to find an estimate for
(i) Q2
(ii) the interquartile range.
NC (4)
One measure of skewness is given by
E
(d) By calculating this measure, describe the skewness of these data.
SO
LU (2)
(Total for question = 11 marks)
TI (Q01 WST01/01, Oct 2022)

ON

Naturalsciencesolution – Accelerating academic skills


Page 11 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 12 of 151
Q5.

The histogram shows the distances, in km, that 274 people travel to work.

NA
TU
RA
L
SC
Given that 60 of these people travel between 10 km and 20 km to work, estimate

IE
(a) the number of people who travel between 22 km and 45 km to work,
(3)
NC
(b) the median distance travelled to work by these 274 people,

(c) the mean distance travelled to work by these 274 people.


E (2)

SO (3)
(Total for question = 8 marks)
LU (Q01 WST01/01, June 2023)

TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 13 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 14 of 151
Q6.

A disc of radius 1 cm is rolled onto a horizontal grid of rectangles so that the disc is equally likely to land
anywhere on the grid. Each rectangle is 5 cm long and 3 cm wide. There are no gaps between the
rectangles and the grid is sufficiently large so that no discs roll off the grid.
If the disc lands inside a rectangle without covering any part of the edges of the rectangle then a prize is
won.
By considering the possible positions for the centre of the disc,

(a) show that the probability of winning a prize on any particular roll is
(3)
A group of 15 students each roll the disc onto the grid twenty times and record the number of times, x, that
each student wins a prize. Their results are summarised as follows
NA
TU
(b) Find the standard deviation of the number of prizes won per student.
(2)

RA
A second group of 12 students each roll the disc onto the grid twenty times and the mean number of prizes
won per student is 3.5 with a standard deviation of 2
L
(c) Find the mean and standard deviation of the number of prizes won per student for the whole group of
27 students.
SC (7)

IE
The 27 students also recorded the number of times that the disc covered a corner of a rectangle and
estimated the probability to be 0.2216 (to 4 decimal places).

NC
(d) Explain how this probability could be used to find an estimate for the value of π and state the value of
your estimate.

E (3)
(Total for question = 15 marks)

SO (Q06 WST01/01, Jan 2021)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 15 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 16 of 151
Q7.

The time taken to complete a puzzle, in minutes, is recorded for each person in a club. The times are
summarised in a grouped frequency distribution and represented by a histogram.
One of the class intervals has a frequency of 20 and is shown by a bar of width 1.5 cm and height 12 cm on
the histogram. The total area under the histogram is 94.5 cm2
Find the number of people in the club.

(Total for question = 3 marks)

(Q02 WST11/01, Specimen papers )

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 17 of 151
Q8.

A researcher recorded the time, t minutes, spent using a mobile phone during a particular afternoon, for
each child in a club.

The researcher coded the data using and the results are summarised in the table below.

NA
TU
(a) Write down the value of a and the value of b.
(1)

RA
(b) Calculate an estimate of the mean of v.
(1)
L
(c) Calculate an estimate of the standard deviation of v.

SC
(d) Use linear interpolation to estimate the median of v.
(2)

IE (2)

NC
(e) Hence describe the skewness of the distribution. Give a reason for your answer.
(2)

E
(f) Calculate estimates of the mean and the standard deviation of the time spent using a mobile phone
during the afternoon by the children in this club.

SO (4)
(Total for question = 12 marks)
LU
(Q04 WST11/01, Specimen papers )

TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 18 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 19 of 151
Q9.

A random sample of 100 carrots is taken from a farm and their lengths, L cm, recorded.
The data are summarised in the following table.

NA
A histogram is drawn to represent these data.
The bar representing the class 5 ≤ L < 8 is 1.5 cm wide and 1 cm high.

TU
(a) Find the width and height of the bar representing the class 15 ≤ L < 20
(3)
RA
(b) Use linear interpolation to estimate the median length of these carrots.

(c) Estimate
L (2)

(i) the mean length of these carrots,SC (2)


IE
(ii) the standard deviation of the lengths of these carrots.
(3)

NC
A supermarket will only buy carrots with length between 9 cm and 22 cm.
(d) Estimate the proportion of carrots from the farm that the supermarket will buy.
E (2)

SO
Any carrots that the supermarket does not buy are sold as animal feed.
The farm makes a profit of 2.2 pence on each carrot sold to the supermarket, a profit of 0.8 pence on each
LU
carrot longer than 22 cm and a loss of 1.2 pence on each carrot shorter than 9 cm.
(e) Find an estimate of the mean profit per carrot made by the farm.
TI (2)
(Total for question = 14 marks)
ON
(Q03 WST01/01, June 2021)

Naturalsciencesolution – Accelerating academic skills


Page 20 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 21 of 151
Q10.

The company Seafield requires contractors to record the number of hours they work each week. A random
sample of 38 weeks is taken and the number of hours worked per week by contractor Kiana is summarised
in the stem and leaf diagram below.

The quartiles for this distribution are summarised in the table below.

NA
TU
(a) Find the values of w, x and y
RA (3)

L
Kiana is looking for outliers in the data. She decides to classify as outliers any observations greater than
Q3 + 1.0 × (Q3 - Q1)
SC
(b) Showing your working clearly, identify any outliers that Kiana finds.

IE
(c) Draw a box plot for these data in the space provided on the grid opposite.
(2)

(d) Use the formula


NC (3)

E
SO
to find the skewness of these data. Give your answer to 2 significant figures.

LU
Kiana's new employer, Landacre, wishes to know the average number of hours per week she worked
(2)

during her employment at Seafield to help calculate the cost of employing her. TI
(e) Explain why Landacre might prefer to know Kiana's mean, rather than median, number of hours worked
per week. ON (1)

(Total for question = 11 marks)

(Q01 WST01/01, June 2022)

Naturalsciencesolution – Accelerating academic skills


Page 22 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 23 of 151
Q11.

Gill buys a bag of logs to use in her stove. The lengths, l cm, of the 88 logs in the bag are summarised in
the table below.

A histogram is drawn to represent these data.


NA
The bar representing logs with length has a width of 1.5 cm and a height of 4 cm.

TU
(a) Calculate the width and height of the bar representing log lengths of
(3)

RA
(b) Use linear interpolation to estimate the median of l
(2)
L
The maximum length of log Gill can use in her stove is 26 cm.

SC
Gill estimates, using linear interpolation, that x logs from the bag will fit into her stove.
(c) Show that x = 62
IE (1)
Gill randomly selects 4 logs from the bag.
NC
(d) Using x = 62 , find the probability that all 4 logs will fit into her stove.

E
The weights, W grams, of the logs in the bag are coded using y = 0.5w - 255 and summarised by
(2)

SO
n = 88 ∑ y = 924 ∑ y2 = 12 862
(e) Calculate
(i) the mean of W
LU
(ii) the variance of W TI (3)

ON (3)

(Total for question = 14 marks)

(Q03 WST01/01, June 2022)

Naturalsciencesolution – Accelerating academic skills


Page 24 of 151
Q12.

The production cost, £c million, of a film and the total ticket sales, £t million, earned by the film are recorded
for a sample of 40 films.
Some summary statistics are given below.

(a) Find the exact value of Stt and the exact value of Sct
(3)
(b) Calculate the value of the product moment correlation coefficient for these data.
(2)
(c) Give an interpretation of your answer to part (b)

NA
(d) Show that the equation of the linear regression line of t on c can be written as
(1)

TU t = –5.84 + 0.976c
where the values of the intercept and gradient are given to 3 significant figures.
RA
(e) Find the expected total ticket sales for a film with a production cost of £90 million.
(3)

L (2)
Using the regression line in part (d)
SC
(f) find the range of values of the production cost of a film for which the total ticket sales are less than 80%
of its production cost.
IE (2)
NC
E (Total for question = 13 marks)

SO (Q02 WST01/01, Oct 2022)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 25 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 26 of 151
Q13.

The stem and leaf diagram below shows the ages (in years) of the residents in a care home.

(a) Find the median age of the residents.


(1)

NA
(b) Find the interquartile range (IQR) of the ages of the residents.
(2)
TU
An outlier is defined as a value that is either

RA
more than 1.5 × (IQR) below the lower quartile or
more than 1.5 × (IQR) above the upper quartile.

L
(c) Determine any outliers in these data. Show clearly any calculations that you use.
(3)
SC
(d) On the grid below, draw a box plot to summarise these data.

IE (3)

NC
E
SO
LU (Total for question = 9 marks)

(Q02 WST01/01, Jan 2021)


TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 27 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 28 of 151
Q14.

The weights, to the nearest kilogram, of a sample of 33 red kangaroos taken in December are summarised
in the stem and leaf diagram below.

(a) Find
NA
TU
(i) the value of the median

RA
(ii) the value of Q1 and the value of Q3
for the weights of these red kangaroos.
(3)
L
For these data an outlier is defined as a value that is
greater than Q3 + 1.5 × (Q3 − Q1)
SC
or smaller than Q1 − 1.5 × (Q3 − Q1)

IE
(b) Show that there are 2 outliers for these data.
(3)
NC
Figure 1 below shows a box plot for the weights of the same 33 red kangaroos taken in February, earlier in
the year.
E
(c) In the space on Figure 1, draw a box plot to represent the weights of these red kangaroos in December.

SO
(d) Compare the distribution of the weights of red kangaroos taken in February with the distribution of the
(4)

LU
weights of red kangaroos taken in December of the same year. You should interpret your comparisons in
the context of the question.

TI (3)

ON

(Total for question = 13 marks)

(Q02 WST01/01, Oct 2023)

Naturalsciencesolution – Accelerating academic skills


Page 29 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 30 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 31 of 151
Q15.

The stem and leaf diagram shows the ages of the 35 male passengers on a cruise.

(a) Find the median age of the male passengers.


(1)
NA
(b) Show that the interquartile range (IQR) of these ages is 16

TU
An outlier is defined as a value that is more than
(2)

RA
1.5 × IQR above the upper quartile
or
L
1.5 × IQR below the lower quartile

SC
(c) Show that there are 3 outliers amongst these ages.
(3)
IE
(d) On the grid in Figure 1 below, draw a box plot for the ages of the male passengers on the cruise.

NC
Figure 1 below also shows a box plot for the ages of the female passengers on the cruise.
(4)

E
(e) Comment on any difference in the distributions of ages of male and female passengers on the cruise.

SO
State the values of any statistics you have used to support your comment.
(1)
Anja, along with her 2 daughters and a granddaughter, now join the cruise.
Anja's granddaughter is younger than both of Anja's daughters.
Anja had her 23rd birthday on the day her eldest daughter was born.
LU
TI
When their 4 ages are included with the other female passengers on the cruise, the box plot does not
change.
(f) State, giving reasons, what you can say about
(i) the granddaughter's age
ON
(ii) Anja's age.
(3)

Naturalsciencesolution – Accelerating academic skills


Page 32 of 151
NA
TU
RA (Total for question = 14 marks)

L
SC (Q03 WST01/01, Oct 2021)

IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 33 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 34 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 35 of 151
Q16.

The stem and leaf diagram shows the number of deliveries made by Pat each day for 24 days

where a, b and c are positive integers with a < b < c


An outlier is defined as any value greater than 1.5 × interquartile range above the upper quartile.

NA
Given that there is only one outlier for these data,
(a) show that c = 9
TU (3)

RA
The number of deliveries made by Pat each day is represented by d
The data in the stem and leaf diagram are coded using
L x = d – 125

SC
and the following summary statistics are obtained

(b) Find the mean number of deliveries. IE


NC
(c) Find the standard deviation of the number of deliveries.
(3)

E
One of these 24 days is selected at random. The random variable D represents the number of deliveries
(2)

made by Pat on this day.


The random variable X = D – 125
SO
(d) Find P(D > 118 |X < 0) LU
TI (2)

ON
(Total for question = 10 marks)

(Q03 WST01/01, Jan 2022)

Naturalsciencesolution – Accelerating academic skills


Page 36 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 37 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 38 of 151
1.2 Probability:

Reference Notes:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 39 of 151
Q1.

Sally plays a game in which she can either win or lose.


A turn consists of up to 3 games. On each turn Sally plays the game up to 3 times. If she wins the first 2
games or loses the first 2 games, then she will not play the 3rd game.

• The probability that Sally wins the first game in a turn is 0.7

• If Sally wins a game the probability that she wins the next game is 0.6

• If Sally loses a game the probability that she wins the next game is 0.2
(a) Use this information to complete the tree diagram below.
(3)
NA
(b) Find the probability that Sally wins the first 2 games in a turn.

TU
(c) Find the probability that Sally wins exactly 2 games in a turn.
(2)

RA
Given that Sally wins 2 games in a turn,
(2)

L
(d) find the probability that she won the first 2 games.

SC
Given that Sally won the first game in a turn,
(2)

(e) find the probability that she won 2 games. IE (2)


NC
E
SO
LU
TI
ON

(Total for question = 11 marks)

(Q01 WST01/01, Oct 2023)

Naturalsciencesolution – Accelerating academic skills


Page 40 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 41 of 151
Q2.

The Venn diagram shows the events A, B, C and D, where p, q, r and s are probabilities.

(a) Write down the value of


(i) P(A)
(ii) P(AΙB)
NA
(iii) P(AΙC)
(3)

TU
RA
(b) find the exact value of q and the exact value of r
(6)
L
(c) find the exact value of s
SC (2)
IE
NC (Total for question = 11 marks)

E (Q06 WST01/01, Oct 2022)

SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 42 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 43 of 151
Q3.

Three events A, B and C are such that


P(A ) = 0.1 P(B|A ) = 0.3 P(A ∪ B ) = 0.25 P(C ) = 0.5
Given that A and C are mutually exclusive
(a) find P(A ∪ C )
(1)
(b) Show that P(B ) = 0.18
(3)
Given also that B and C are independent,
(c) draw a Venn diagram to represent the events A, B and C and the probabilities associated with each
region.
NA (5)

TU (Total for question = 9 marks)


RA
L (Q06 WST01/01, June 2023)

SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 44 of 151
Q4.

(i) Bob shops at a market each week. The event that


Bob buys carrots is denoted by C
Bob buys onions is denoted by O
At each visit, Bob may buy neither, or one, or both of these items. The probability that
Bob buys carrots is 0.65
Bob does not buy onions is 0.3
Bob buys onions but not carrots is 0.15
The Venn diagram below represents the events C and O

NA
TU
RA
where w, x, y and z are probabilities.

L
(a) Find the value of w, the value of x, the value of y and the value of z
(4)
For one visit to the market, SC
IE
(b) find the probability that Bob buys either carrots or onions but not both.
(1)
(c) Show that the events C and O are not independent.
NC (2)
E
(ii) F, G and H are 3 events. F and H are mutually exclusive. F and G are independent.
Given that
SO
(a) find P(F ∪ H) LU
(b) find P(G)
TI (1)

ON (3)
(c) find P(F ∩ G)
(1)

(Total for question = 12 marks)

(Q03 WST01/01, Oct 2023)

Naturalsciencesolution – Accelerating academic skills


Page 45 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 46 of 151
Q5.

Three events A, B and C are such that

Given that A and C are mutually exclusive find


(a) P(A ∪ C)
(1)
Given that A and B are independent

(b) show that


(4)

NA
(c) Find P(A | B)
(1)
TU
Given that P(C′ ∩ B′) = 0.3

RA
(d) draw a Venn diagram to represent the events A, B and C
(5)

L (Total for question = 11 marks)


SC
IE (Q06 WST11/01, Specimen papers )

NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 47 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 48 of 151
Q6.

There are 7 red counters, 3 blue counters and 2 yellow counters in a bag. Gina selects a counter at random
from the bag and keeps it. If the counter is yellow she does not select any more counters. If the counter is
not yellow she randomly selects a second counter from the bag.
(a) Complete the tree diagram.

NA
TU
RA
L
SC
IE (2)
Given that Gina has selected a yellow counter, NC
(b) find the probability that she has 2 counters.
E (3)

SO (Total for question = 5 marks)


LU
TI (Q01 WST01/01, June 2021)

ON

Naturalsciencesolution – Accelerating academic skills


Page 49 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 50 of 151
Q7.

In the Venn diagram below, A, B and C are events and p, q, r and s are probabilities.
The events A and C are independent and P(A) = 0.65

(a) State which two of the events A, B and C are mutually exclusive.
(1)

NA
(b) Find the value of r and the value of s.
(5)

TU
The events (A ∩ C ' ) and (B ∪ C ) are also independent.
(c) Find the exact value of p and the exact value of q. Give your answers as fractions.
RA (6)

L (Total for question = 12 marks)


SC
IE (Q02 WST01/01, June 2021)

NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 51 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 52 of 151
Q8.

The events H and W are such that

Given that H and W are independent,

(a) show that P(W) =


(4)
The event N is such that

(b) Find P
NA (2)
TU
Given that W and N are mutually exclusive,

RA
(c) draw a Venn diagram to represent the events H, W and N giving the exact probabilities of each region
in the Venn diagram.

L (5)

SC (Total for question = 11 marks)

IE
NC (Q04 WST01/01, June 2022)

E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 53 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 54 of 151
Q9.

The Venn diagram shows the events A, B and C and their associated probabilities.

Find
(a) P(B')
NA
(b) P(A ∪ C)
TU (1)

RA (2)
(c) P(A | B')
L (2)
SC
IE (Total for question = 5 marks)

NC (Q01 WST01/01, Jan 2021)

E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 55 of 151
Q10.

The Venn diagram shows the events A, B and C and their associated probabilities, where p and q are
probabilities.

(a) Find P(B)NA


TU
(b) Determine whether or not A and B are independent.
(1)

RA (2)
Given that P(C | B) = P(C)
L
(c) find the value of p and the value of q

The event D is such that


SC (3)

• A and D are mutually exclusive


IE
• P(B ∩ D) > 0 NC
E
(d) On the Venn diagram show a possible position for the event D
(1)

SO (Total for question = 7 marks)


LU
TI (Q01 WST01/01, Oct 2021)

ON

Naturalsciencesolution – Accelerating academic skills


Page 56 of 151
Q11.

(i) In the Venn diagram below, A and B represent events and p, q, r and s are probabilities.

NA
(a) Use algebra to show that 2p + 2q + 2r =
(4)

TU
(b) Find the value of p, the value of q, the value of r and the value of s
(5)

RA
(ii) Two events, C and D, are such that

L
where x is a positive constant.

SC
By considering P(C) + P(D) show that C and Dcannot be mutually exclusive.
(4)

IE (Total for question = 13 marks)


NC
E (Q04 WST01/01, Jan 2023)

SO
LU
TI
ON

Q12.
Naturalsciencesolution – Accelerating academic skills
Page 57 of 151
A bag contains a large number of coloured counters. Each counter is labelled A, B or C
30% of the counters are labelled A
45% of the counters are labelled B
The rest of the counters are labelled C
It is known that
2% of the counters labelled A are red
4% of the counters labelled B are red
6% of the counters labelled C are red
One counter is selected at random from the bag.
(a) Complete the tree diagram below to illustrate this information.
(2)

NA
(b) Calculate the probability that the counter is labelled A and is not red.
(2)
TU
(c) Calculate the probability that the counter is red.

RA
(d) Given that the counter is red, find the probability that it is labelled C
(2)

L (3)

SC
IE
NC
E
SO
LU
TI
ON
(Total for question = 9 marks)

(Q04 WST01/01, June 2023)

Naturalsciencesolution – Accelerating academic skills


Page 58 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 59 of 151
Q13.

Three bags A, B and C each contain coloured balls.


Bag A contains 4 red balls and 2 yellow balls only.
Bag B contains 4 red balls and 1 yellow ball only.
Bag C contains 6 red balls only.
In a game
Mike takes a ball at random from bag A, records the colour and places it in bag C.
He then takes a ball at random from bag B, records the colour and places it in bag C.
Finally, Mike takes a ball at random from bag C and records the colour.
(a) Complete the tree diagram below, to illustrate the game by adding the remaining branches and all
probabilities.
(3)

NA
(b) Show that the probability that Mike records a yellow ball exactly twice is

TU
Given that Mike records exactly 2 yellow balls,
(3)

RA
(c) find the probability that the ball drawn from bag A is red.

L
Mike plays this game a large number of times, each time starting with the bags containing balls as
(2)

SC
described above. The random variable X represents the number of yellow balls recorded in a single game.
(d) Find the probability distribution of X
IE (3)
(e) Find E(X )
NC (2)

E
SO
LU
TI
ON

(Total for question = 13 marks)

(Q04 WST01/01, Oct 2021)

Naturalsciencesolution – Accelerating academic skills


Page 60 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 61 of 151
Q14.

Two bags, X and Y, each contain green marbles (G) and blue marbles (B) only.

• Bag X contains 5 green marbles and 4 blue marbles


• Bag Y contains 6 green marbles and 5 blue marbles
A marble is selected at random from bag X and placed in bag Y
A second marble is selected at random from bag X and placed in bag Y
A third marble is then selected, this time from bag Y
(a) Use this information to complete the tree diagram shown below.
(3)
(b) Find the probability that the 2 marbles selected from bag X are of different colours.

NA
(c) Find the probability that all 3 marbles selected are the same colour.
(2)

TU (2)

RA
Given that all three marbles selected are the same colour,
(d) find the probability that they are all green.

L (3)

SC
IE
NC
E
SO
LU
TI
ON

(Total for question = 10 marks)

(Q02 WST01/01, Jan 2023)

Naturalsciencesolution – Accelerating academic skills


Page 62 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 63 of 151
Q15.

The lengths, L mm, of housefly wings are normally distributed with L ~ N(4.5, 0.42)
(a) Find the probability that a randomly selected housefly has a wing length of less than 3.86 mm.
(3)
(b) Find
(i) the upper quartile (Q3) of L
(ii) the lower quartile (Q1) of L
(4)
A value that is greater than Q3 + 1.5 × (Q3 – Q1) or smaller than Q1 – 1.5 × (Q3 – Q1) is defined as an
outlier.
(c) Find these two outlier limits.

NA
A housefly is selected at random.
(3)

TU
(d) Using standardisation, show that the probability that this housefly is not an outlier is 0.993 to 3 decimal
places.
RA (3)

L
Given that this housefly is not an outlier,
(e) showing your working, find the probability that the wing length of this housefly is greater than 5 mm.
SC (4)

IE (Total for question = 17 marks)

NC
E (Q05 WST01/01, Jan 2023)

SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 64 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 65 of 151
Q16.

A disc of radius 1 cm is rolled onto a horizontal grid of rectangles so that the disc is equally likely to land
anywhere on the grid. Each rectangle is 5 cm long and 3 cm wide. There are no gaps between the
rectangles and the grid is sufficiently large so that no discs roll off the grid.
If the disc lands inside a rectangle without covering any part of the edges of the rectangle then a prize is
won.
By considering the possible positions for the centre of the disc,

(a) show that the probability of winning a prize on any particular roll is
(3)
A group of 15 students each roll the disc onto the grid twenty times and record the number of times, x, that
each student wins a prize. Their results are summarised as follows
NA
TU
(b) Find the standard deviation of the number of prizes won per student.
(2)

RA
A second group of 12 students each roll the disc onto the grid twenty times and the mean number of prizes
won per student is 3.5 with a standard deviation of 2
L
(c) Find the mean and standard deviation of the number of prizes won per student for the whole group of
27 students.
SC (7)

IE
The 27 students also recorded the number of times that the disc covered a corner of a rectangle and
estimated the probability to be 0.2216 (to 4 decimal places).

NC
(d) Explain how this probability could be used to find an estimate for the value of π and state the value of
your estimate.

E (3)

SO (Total for question = 15 marks)

LU
TI (Q06 WST01/01, Jan 2021)

ON

Naturalsciencesolution – Accelerating academic skills


Page 66 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 67 of 151
Q17.

Xiang is designing shelves for a bookshop. The height, H cm, of books is modelled by the normal
distribution with mean 25.1 cm and standard deviation 5.5 cm
(a) Show that P(H > 30.8) = 0.15
(3)
Xiang decided that the smallest 5% of books and books taller than 30.8 cm would not be placed on the
shelves. All the other books will be placed on the shelves.
(b) Find the range of heights of books that will be placed on the shelves.
(3)
The books that will be placed on the shelves have heights classified as small, medium or large.
The numbers of small, medium and large books are in the ratios 2 : 3 : 3

NA
(c) The medium books have heights x cm where m < x < d
(i) Show that d = 25.8 to 1 decimal place.
TU
(ii) Find the value of m
(3)

RA
Xiang wants 2 shelves for small books, 3 shelves for medium books and 3 shelves for large books.
(4)

L
These shelves will be placed one above another and made of wood that is 1 cm thick.
(d) Work out the minimum total height needed.
SC (2)

IE (Total for question = 15 marks)

NC
E (Q06 WST01/01, Oct 2021)

SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 68 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 69 of 151
Q18.

A biased tetrahedral die has faces numbered 0, 1, 2 and 3. The die is rolled and the number face down on
the die, X, is recorded. The probability distribution of X is

If X = 3 then the final score is 3


If X ≠ 3 then the die is rolled again and the final score is the sum of the two numbers.
The random variable T is the final score.
(a) Find P(T = 2)
(2)
NA
(b) Find P(T = 3)

TU
(c) Given that the die is rolled twice, find the probability that the final score is 3
(3)

RA (3)
(Total for question = 8 marks)
L (Q05 WST11/01, Specimen papers )

SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 70 of 151
Q19.

Kris works in the mailroom of a large company and is responsible for all the letters sent by the company.
The weights of letters sent by the company, W grams, have a normal distribution with mean 165 g and
standard deviation 35 g.
(a) Estimate the proportion of letters sent by the company that weigh less than 120 g.
(3)

Kris splits the letters to be sent into 3 categories: heavy, medium and light, with of the letters in each
category.
(b) Find the weight limits that determine medium letters.
(4)

NA
A heavy letter is chosen at random.
(c) Find the probability that this letter weighs less than 200 g.

TU
Kris chooses a random sample of 3 letters from those in the mailroom one day.
(3)

RA
(d) Find the probability that there is one letter in each of the 3 categories.

L (3)

SC (Total for question = 13 marks)

IE
NC (Q04 WST01/01, June 2021)

E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 71 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 72 of 151
Q20.

A factory produces shoes.


A quality control inspector at the factory checks a sample of 120 shoes for each of three types of defect.
The Venn diagram represents the inspector's results.
A represents the event that a shoe has defective stitching
B represents the event that a shoe has defective colouring
C represents the event that a shoe has defective soles

NA
TU
RA
One of the shoes in the sample is selected at random.
L
(a) Find the probability that it does not have defective soles.

(b) Find P(A ∩ B ∩ C′)


SC (1)

IE (1)
(c) Find P(A ∪ B ∪ C′)
NC (2)

E
(d) Find the probability that the shoe has at most one type of defect.
(2)
SO
(e) Given the selected shoe has at most one type of defect, find the probability it has defective stitching.

LU
The random variable X is the number of the events A, B, C that occur for a randomly selected shoe.
(2)

(f) Find E(X) TI


ON (3)

(Total for question = 11 marks)

(Q01 WST01/01, Jan 2022)

Naturalsciencesolution – Accelerating academic skills


Page 73 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 74 of 151
Q21.

Gill buys a bag of logs to use in her stove. The lengths, l cm, of the 88 logs in the bag are summarised in
the table below.

A histogram is drawn to represent these data.


NA
The bar representing logs with length has a width of 1.5 cm and a height of 4 cm.

TU
(a) Calculate the width and height of the bar representing log lengths of
(3)

RA
(b) Use linear interpolation to estimate the median of l
(2)
L
The maximum length of log Gill can use in her stove is 26 cm.

SC
Gill estimates, using linear interpolation, that x logs from the bag will fit into her stove.
(c) Show that x = 62
IE (1)
Gill randomly selects 4 logs from the bag.
NC
(d) Using x = 62 , find the probability that all 4 logs will fit into her stove.

E
The weights, W grams, of the logs in the bag are coded using y = 0.5w - 255 and summarised by
(2)

SO
n = 88 ∑ y = 924 ∑ y2 = 12 862
(e) Calculate
(i) the mean of W
LU
(ii) the variance of W TI (3)

ON (3)

(Total for question = 14 marks)

(Q03 WST01/01, June 2022)

Naturalsciencesolution – Accelerating academic skills


Page 75 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 76 of 151
Q22.

The stem and leaf diagram shows the number of deliveries made by Pat each day for 24 days

where a, b and c are positive integers with a < b < c


An outlier is defined as any value greater than 1.5 × interquartile range above the upper quartile.

NA
Given that there is only one outlier for these data,
(a) show that c = 9
TU (3)

RA
The number of deliveries made by Pat each day is represented by d
The data in the stem and leaf diagram are coded using
L x = d – 125

SC
and the following summary statistics are obtained

(b) Find the mean number of deliveries. IE


NC
(c) Find the standard deviation of the number of deliveries.
(3)

E
One of these 24 days is selected at random. The random variable D represents the number of deliveries
(2)

made by Pat on this day.


The random variable X = D – 125
SO
(d) Find P(D > 118 |X < 0) LU
TI (2)

ON
(Total for question = 10 marks)

(Q03 WST01/01, Jan 2022)

Naturalsciencesolution – Accelerating academic skills


Page 77 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 78 of 151
Q23.

A red spinner is designed so that the score R is given by the following probability distribution.

(a) Show that E(R2) = 15.8


(1)
Given also that E(R) = 3.7
(b) find the standard deviation of R, giving your answer to 2 decimal places.
(2)

NA
A yellow spinner is designed so that the score Y is given by the probability distribution in the table below.
The cumulative distribution function F(y) is also given.

TU
RA
(c) Write down the value of d L
Given that E(Y) = 4.55
SC (1)

(d) find the value of c IE (5)


Pabel and Jessie play a game with these two spinners. NC
Pabel uses the red spinner.
Jessie uses the yellow spinner.
E
They take turns to spin their spinner. SO
LU
The winner is the first person whose spinner lands on the number 2 and the game ends.
Jessie spins her spinner first.
(e) Find the probability that Jessie wins on her second spin. TI (2)
ON
(f) Calculate the probability that, in a game, the score on Pabel's first spin is the same as the score on
Jessie's first spin.
(3)

(Total for question = 14 marks)

(Q05 WST01/01, June 2022)

Naturalsciencesolution – Accelerating academic skills


Page 79 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 80 of 151
1.3 Correlation and Regression:

Reference Notes:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 81 of 151
Q1.

The production cost, £c million, of a film and the total ticket sales, £t million, earned by the film are recorded
for a sample of 40 films.
Some summary statistics are given below.

(a) Find the exact value of Stt and the exact value of Sct
(3)
(b) Calculate the value of the product moment correlation coefficient for these data.
(2)
(c) Give an interpretation of your answer to part (b)

NA
(d) Show that the equation of the linear regression line of t on c can be written as
(1)

TU t = –5.84 + 0.976c
where the values of the intercept and gradient are given to 3 significant figures.
RA
(e) Find the expected total ticket sales for a film with a production cost of £90 million.
(3)

L (2)
Using the regression line in part (d)
SC
(f) find the range of values of the production cost of a film for which the total ticket sales are less than 80%
of its production cost.
IE (2)
NC
E (Total for question = 13 marks)

SO (Q02 WST01/01, Oct 2022)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 82 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 83 of 151
Q2.

Students on a psychology course were given a pre-test at the start of the course and a final exam at the
end of the course. The teacher recorded the number of marks achieved on the pre-test, p, and the number
of marks achieved on the final exam, f, for 34 students and displayed them on the scatter diagram.

NA
TU
RA
L
SC
The equation of the least squares regression line for these data is found to be

IE
f = 10.8 + 0.748 p
For these students, the mean number of marks on the pre-test is 62.4

NC
(a) Use the regression model to find the mean number of marks on the final exam.
(2)
E
(b) Give an interpretation of the gradient of the regression line.

SO
Considering the equation of the regression line, Priya says that she would expect someone who scored 0
(1)

marks on the pre-test to score 10.8 marks on the final exam.


(c) Comment on the reliability of Priya's statement.
LU
TI (1)

ON
(d) Write down the number of marks achieved on the final exam for the student who exceeded the
expectation of the regression model by the largest number of marks.
(1)
(e) Find the range of values of p for which this regression model, f = 10.8 + 0.748 p, predicts a greater
number of marks on the final exam than on the pre-test.
(3)

Naturalsciencesolution – Accelerating academic skills


Page 84 of 151
Later the teacher discovers an error in the recorded data. The student who achieved a score of 98 on the
pre-test, scored 92 not 29 on the final exam.
The summary statistics used for the model f = 10.8 + 0.748 p are corrected to include this information and a
new least squares regression line is found.
Given the original summary statistics were,

(f) calculate the gradient of the new regression line. Show your working clearly.
(5)

(Total for question = 13 marks)

NA (Q06 WST01/01, Jan 2022)

TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 85 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 86 of 151
Q3.

Two economics students, Andi and Behrouz, are studying some data relating to unemployment, x %, and
increase in wages, y %, for a European country. The least squares regression line of y on x has equation
y = 3.684 − 0.3242x
and
(a) Show that Syy = 7.524 375
(1)
(b) Find Sxx
(4)
(c) Find the product moment correlation coefficient between x and y.

NA
Behrouz claims that, assuming the model is valid, the data show that when unemployment is 2% wages
(3)

increase at over 3%
TU
(d) Explain how Behrouz could have come to this conclusion.
RA (1)
Andi uses the formula
L range = mean ± 3 × standard deviation
to estimate the range of values for x. SC
(e) Find estimates of the minimum value and the maximum value of x in these data using Andi's formula.
IE (3)

NC
(f) Comment, giving a reason, on the reliability of Behrouz's claim.
(2)
E
Andi suggests using the regression line with equation y = 3.684 – 0.3242x to estimate unemployment when
wages are increasing at 2%
(g) Comment, giving a reason, on Andi's suggestion. SO
LU (2)

TI
(Total for question = 16 marks)

ON
(Q06 WST01/01, June 2021)

Naturalsciencesolution – Accelerating academic skills


Page 87 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 88 of 151
Q4.

A large company is analysing how much money it spends on paper in its offices each year. The number of
employees in the office, x, and the amount spent on paper in a year, p ($ hundreds), in each of 12
randomly selected offices were recorded.
The results are summarised in the following statistics.

(a) Show that Sxp = 231.25


(1)
(b) Find the product moment correlation coefficient for these data.
(3)
(c) Find the equation of the regression line of p on x in the form p = a + bx
NA (4)

TU
(d) Give an interpretation of the gradient of your regression line.
(1)
RA
The director of the company wants to reduce the amount spent on paper each year.

part (c).
L
He wants each office to aim for a model of the form where a and b are the values found in

SC
Using the data for the 93 employees from the 12 offices,

IE
(e) estimate the percentage saving in the amount spent on paper each year by the company using the
director's model.

NC (3)

E (Total for question = 12 marks)

SO (Q02 WST01/01, Oct 2021)


LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 89 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 90 of 151
Q5.

Stuart is investigating the relationship between Gross Domestic Product (GDP) and the size of the
population for a particular country. He takes a random sample of 9 years and records the size of the
population, t millions, and the GDP, g billion dollars for each of these years.
The data are summarised as
n = 9 ∑ t = 7.87 ∑ g = 144.84 ∑ g2 = 3624.41 Stt = 1.29 Stg = 40.25
(a) Calculate the product moment correlation coefficient between t and g
(3)
(b) Give an interpretation of your product moment correlation coefficient.
(1)
(c) Find the equation of the least squares regression line of g on t in the form g = a + bt
NA (4)

TU
(d) Give an interpretation of the value of b in your regression line.
(1)

RA
(e) (i) Use the regression line from part (c) to estimate the GDP, in billions of dollars, for a population of 7
000 000
L
(ii) Comment on the reliability of your answer in part (i). Give a reason, in context, for your answer.
(2)

SC (1)
Using the regression line from part (c), Stuart estimates that for a population increase of x million there will
be an increase of 0.1 billion dollars in GDP.
(f) Find the value of x
IE
NC (2)

E (Total for question = 14 marks)


SO
LU (Q02 WST01/01, June 2022)

TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 91 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 92 of 151
Q6.

A company director wants to introduce a performance-related pay structure for her managers. A random
sample of 15 managers is taken and the annual salary, y in £1000, was recorded for each manager. The
director then calculated a performance score, x, for each of these managers.
The results are shown on the scatter diagram in Figure 1 on the next page.
(a) Describe the correlation between performance score and annual salary.
(1)
The results are also summarised in the following statistics.

(b) (i) Show that Sxy = 2006


(1)

NA
(ii) Find Syy
(2)

TU
(c) Find the product moment correlation coefficient between performance score and annual salary.
(2)
RA
The director believes that there is a linear relationship between performance score and annual salary.

L
(d) State, giving a reason, whether or not these data are consistent with the director's belief.
(1)

SC
(e) Calculate the equation of the regression line of y on x, in the form y = a + bx
Give the value of a and the value of b to 3 significant figures.

(f) Give an interpretation of the value of b.


IE (4)

NC (1)

E
(g) Plot your regression line on the scatter diagram in Figure 1
(2)
SO
The director hears that one of the managers in the sample seems to be underperforming.

LU
(h) On the scatter diagram, circle the point that best identifies this manager.
(1)

TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 93 of 151
The director decides to use this regression line for the new performance related pay structure.
(i) Estimate, to 3 significant figures, the new salary of a manager with a performance score of 30
(2)

NA
TU
RA
L
SC
IE
NC
E (Total for question = 17 marks)

SO (Q05 WST01/01, Jan 2021)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 94 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 95 of 151
Q7.

A research student is investigating the maximum weight, y grams, of sugar that will dissolve in 100 grams
of water at various temperatures, x °C, where 10 ≤ x ≤ 80
The research student calculated the regression line of y on x and found it to be
y = 151.2 + 2.72x
(a) Give an interpretation of the gradient of the regression line.
(1)
(b) Use the regression line to estimate the maximum weight of sugar that will dissolve in 100 grams of
water when the temperature is 90 °C.
(2)
(c) Comment on the reliability of your estimate, giving a reason for your answer.
NA (2)

TU
Using the regression line of y on x and the following summary statistics

RA
(d) show that the product moment correlation coefficient for these data is 0.988 to 3 decimal places.

L
The research student's supervisor plotted the original data on a scatter diagram, below.
(7)

SC
With reference to both the scatter diagram and the correlation coefficient,
(e) discuss the suitability of a linear regression model to describe the relationship between x and y.
IE (2)

NC
E
SO
LU
TI
ON

(Total for question = 14 marks)

(Q06 WST01/01, Jan 2023)

Naturalsciencesolution – Accelerating academic skills


Page 96 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 97 of 151
Q8.

The variables x and y have the following regression equations based on the same 12 observations.

(a) (i) Find the point of intersection of these lines.


(ii) Hence show that
(4)
Given that

NA
(b) Find Sxy
TU (4)

RA
(c) Find the product moment correlation coefficient between x and y
(4)
L
SC (Total for question = 12 marks)

IE (Q06 WST01/01, Oct 2023)


NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 98 of 151
Q9.

Two students, Olive and Shan, collect data on the weight, w grams, and the tail length, t cm, of 15 mice.
Olive summarised the data as follows

(a) Calculate the value of Stw and the value of Sww


(3)
(b) Calculate the value of the product moment correlation coefficient between w and t
(2)
(c) Show that the equation of the regression line of w on t can be written as
w = –16.7 + 4.77t

NA
(d) Give an interpretation of the gradient of the regression line.
(3)

TU (1)

RA
(e) Explain why it would not be appropriate to use the regression line in part (c) to estimate the weight of a
mouse with a tail length of 2 cm.

L (2)

SC
Shan decided to code the data using x = t – 6 and

IE
(f) Write down the value of the product moment correlation coefficient between x and y
(1)
NC
(g) Write down an equation of the regression line of y on x
You do not need to simplify your equation.
E (1)

SO (Total for question = 13 marks)


LU
TI (Q02 WST01/01, June 2023)

ON

Naturalsciencesolution – Accelerating academic skills


Page 99 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 100 of 151
Q10.

Tom's car holds 50 litres of petrol when the fuel tank is full.
For each of 10 journeys, each starting with 50 litres of petrol in the fuel tank, Tom records the distance
travelled, d kilometres, and the amount of petrol used, p litres.
The summary statistics for the 10 journeys are given below.

(a) Calculate the product moment correlation coefficient between d and p


(3)
The amount of petrol remaining in the fuel tank for each journey, w litres, is recorded.
(b) (i) Write down an equation for w in terms of p

NA
(ii) Hence, write down the value of the product moment correlation coefficient between w and p
(2)

TU
(c) Write down the value of the product moment correlation coefficient between d and w
(1)
RA
L (Total for question = 6 marks)

SC (Q02 WST01/01, Jan 2022)

IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 101 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 102 of 151
Q11.

The percentage oil content, p, and the weight, w milligrams, of each of 10 randomly selected sunflower
seeds were recorded. These data are summarised below.

(a) Find the value of Sww and the value of Swp


(3)
(b) Calculate the product moment correlation coefficient between p and w
(2)
(c) Give an interpretation of your product moment correlation coefficient.
(1)

NA
The equation of the regression line of p on w is given in the form p = a + bw
(d) Find the equation of the regression line of p on w

TU
(e) Hence estimate the percentage oil content of a sunflower seed which weighs 60 milligrams.
(4)

RA (2)

L (Total for question = 12 marks)


SC
IE (Q01 WST11/01, Specimen papers )

NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 103 of 151
1.4 Discrete Random Variable:

Reference Notes:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 104 of 151
Q1.

The discrete random variable X has probability distribution

(a) Write down the name given to this distribution.


(1)
Find
(b) P(X = 4)
(1)
(c) F(3)
(1)
NA
(d) P(3X − 3 > X + 4)

(e) Write down E(X)


TU (2)

RA (1)
(f) Find E(X2)
L (2)
(g) Hence find Var (X)
SC (2)
Given that E(aX − 3) = 11.4 IE
(h) find Var (aX − 3)
NC (4)
E
SO (Total for question = 14 marks)

LU
(Q03 WST11/01, Specimen papers )

TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 105 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 106 of 151
Q2.

The discrete random variable X has the following probability distribution

Given that E(X ) = 0.5


(a) find the value of a.
(2)
Given also that Var(X ) = 5.01
(b) find the value of b and the value of c.
(5)

NA
The random variable Y = 5 − 8X
(c) Find (i) E(Y )
(ii) Var(Y ) TU (3)
2
(d) Find P(4X > Y ) RA
L (5)

SC (Total for question = 15 marks)

IE
NC (Q05 WST01/01, June 2021)

E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 107 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 108 of 151
Q3.

A red spinner is designed so that the score R is given by the following probability distribution.

(a) Show that E(R2) = 15.8


(1)
Given also that E(R) = 3.7
(b) find the standard deviation of R, giving your answer to 2 decimal places.
(2)

NA
A yellow spinner is designed so that the score Y is given by the probability distribution in the table below.
The cumulative distribution function F(y) is also given.

TU
RA
(c) Write down the value of d L
Given that E(Y) = 4.55
SC (1)

(d) find the value of c IE (5)


Pabel and Jessie play a game with these two spinners. NC
Pabel uses the red spinner.
Jessie uses the yellow spinner.
E
They take turns to spin their spinner. SO
LU
The winner is the first person whose spinner lands on the number 2 and the game ends.
Jessie spins her spinner first.
(e) Find the probability that Jessie wins on her second spin. TI (2)
ON
(f) Calculate the probability that, in a game, the score on Pabel's first spin is the same as the score on
Jessie's first spin.
(3)

(Total for question = 14 marks)

(Q05 WST01/01, June 2022)

Naturalsciencesolution – Accelerating academic skills


Page 109 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 110 of 151
Q4.

The random variable W has a discrete uniform distribution where

P(W = w) = for w = 1, 2, 3, 4, 5
(a) Find P(2 ≤ W < 3.5)
(1)
The discrete random variable X = 5 – 2W
(b) Find E(X)
(3)
(c) Find P(X < W)

NA
The discrete random variable Y =
(2)

(d) Find TU
RA
(i) the probability distribution of Y
(ii) Var(Y), showing your working.

(e) Find Var(2 – 3Y) L (5)

SC (2)

IE (Total for question = 13 marks)

NC
E (Q04 WST01/01, Jan 2022)

SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 111 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 112 of 151
Q5.

A spinner can land on the numbers 10, 12, 14 and 16 only and the probability of the spinner landing on
each number is the same.
The random variable X represents the number that the spinner lands on when it is spun once.
(a) State the name of the probability distribution of X.
(1)
(b) (i) Write down the value of E(X)
(1)
(ii) Find Var(X)
(2)
A second spinner can land on the numbers 1, 2, 3, 4 and 5 only.

NA
The random variable Y represents the number that this spinner lands on when it is spun once. The
probability distribution of Y is given in the table below

TU
(c) Find (i) E(Y)
RA
(ii) Var(Y)
L (2)

SC
The random variable W = aX + b, where a and b are constants and a > 0
(3)

Given that E(W) = E(Y) and Var(W) = Var(Y)


(d) find the value of a and the value of b.
IE
NC (5)
Each of the two spinners is spun once.
(e) Find P(W = Y)
E
SO (2)

LU (Total for question = 16 marks)

TI
ON
(Q04 WST01/01, Jan 2021)

Naturalsciencesolution – Accelerating academic skills


Page 113 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 114 of 151
Q6.

The discrete random variable Y has the following probability distribution

where q, r and u are probabilities.


(a) Write down the value of E(Y)
(1)
The cumulative distribution function of Y is F(y)

Given that F(0) =


NA
(b) show that the value of u is
TU (3)

RA
Given also that Var (Y) = 37
(c) find the value of q and the value of r

L
The coordinates of a point P are (12, Y)
(4)

SC
The random variable D represents the length of OP
(d) Find the probability distribution of D
IE (6)
NC
E (Total for question = 14 marks)

SO (Q05 WST01/01, Oct 2021)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 115 of 151
Q7.

The cumulative distribution function of the discrete random variable W, which takes only the values 6, 7 and
8, is given by

Find E(W)
(4)

(Total for question = 4 marks)

NA (Q04 WST01/01, Oct 2022)

TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 116 of 151
Q8.

The discrete random variable X has the following probability distribution.

(a) Show that


(1)

(b) Find

NA (3)

The random variable


(c) Find
TU
(i) E(Y )
(ii) Var(Y )
RA
(d) Find P(X < 3|Y < 20)
L (3)

SC (5)

IE (Total for question = 12 marks)

NC
E (Q04 WST01/01, Oct 2023)

SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 117 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 118 of 151
Q9.

A discrete random variable Y has probability function

where k is a constant.

(a) Show that


(2)
Find the exact value of

NA
(b) P(1 < Y ≤ 4)
(2)
(c) E(Y ) TU
RA
The random variable X = 15 – 2Y
(2)

(d) Calculate P(Y ≥ X)


L (3)
(e) Calculate Var (X) SC
IE (4)

NC (Total for question = 13 marks)

E
SO (Q05 WST01/01, June 2023)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 119 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 120 of 151
Q10.

A biased tetrahedral die has faces numbered 0, 1, 2 and 3. The die is rolled and the number face down on
the die, X, is recorded. The probability distribution of X is

If X = 3 then the final score is 3


If X ≠ 3 then the die is rolled again and the final score is the sum of the two numbers.
The random variable T is the final score.
(a) Find P(T = 2)
(2)
NA
(b) Find P(T = 3)

TU
(c) Given that the die is rolled twice, find the probability that the final score is 3
(3)

RA (3)

L (Total for question = 8 marks)

SC
IE (Q05 WST11/01, Specimen papers )

NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 121 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 122 of 151
Q11.

Adana selects one number at random from the distribution of X which has the following probability
distribution.

(a) Given that the number selected by Adana is not 5, write down the probability it is 0
(1)
(b) Show that E(X2) = 75
(1)

NA
(c) Find Var(X )
(3)
TU
(d) Find Var(4 – 3X )

RA
Bruno and Charlie each independently select one number at random from the distribution of X
(2)

L
(e) Find the probability that the number Bruno selects is greater than the number Charlie selects.
(3)
SC
Devika multiplies Bruno's number by Charlie's number to obtain a product, D
(f) Determine the probability distribution of D
IE (4)
NC
E (Total for question = 14 marks)

SO (Q07 WST01/01, Oct 2022)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 123 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 124 of 151
Q12.

The probability distribution of the discrete random variable X is given by

where a is a constant.
(a) Find, in terms of a, E(X)
(2)
(b) Find the range of the possible values of E(X)
(3)
NA
Given that Var(X) = 0.56

TU
(c) find the possible values of a
(6)

RA (Total for question = 11 marks)


L
SC (Q03 WST01/01, Jan 2023)

IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 125 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 126 of 151
Q13.

A factory produces shoes.


A quality control inspector at the factory checks a sample of 120 shoes for each of three types of defect.
The Venn diagram represents the inspector's results.
A represents the event that a shoe has defective stitching
B represents the event that a shoe has defective colouring
C represents the event that a shoe has defective soles

NA
TU
RA
One of the shoes in the sample is selected at random.
L
(a) Find the probability that it does not have defective soles.

(b) Find P(A ∩ B ∩ C′)


SC (1)

IE (1)
(c) Find P(A ∪ B ∪ C′)
NC (2)

E
(d) Find the probability that the shoe has at most one type of defect.
(2)
SO
(e) Given the selected shoe has at most one type of defect, find the probability it has defective stitching.

LU
The random variable X is the number of the events A, B, C that occur for a randomly selected shoe.
(2)

(f) Find E(X) TI


ON (3)

(Total for question = 11 marks)

(Q01 WST01/01, Jan 2022)

Naturalsciencesolution – Accelerating academic skills


Page 127 of 151
1.5 The Normal Distribution:

Reference Notes:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 128 of 151
Q1.

A machine fills bottles with water. The volume of water delivered by the machine to a bottle is X ml where
X ~ N( μ, σ2)
One of these bottles of water is selected at random.
Given that μ = 503 and σ = 1.6
(a) find
(i) P(X > 505)
(ii) P(501 < X < 505)
(5)
(b) Find w such that P(1006 − w < X < w) = 0.9426
(3)

NA
Following adjustments to the machine, the volume of water delivered by the machine to a bottle is such that
μ = 503 and σ = q

TU
Given that P(X < r) = 0.01 and P(X > r + 6) = 0.05
(c) find the value of r and the value of q
RA (7)

L (Total for question = 15 marks)


SC
IE (Q07 WST11/01, Specimen papers )

NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 129 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 130 of 151
Q2.

The weights of packages that arrive at a factory are normally distributed with a mean of 18 kg and a
standard deviation of 5.4 kg
(a) Find the probability that a randomly selected package weighs less than 10 kg
(3)
The heaviest 15% of packages are moved around the factory by Jemima using a forklift truck.
(b) Find the weight, in kg, of the lightest of these packages that Jemima will move.
(3)
One of the packages not moved by Jemima is selected at random.
(c) Find the probability that it weighs more than 18 kg

NA
A delivery of 4 packages is made to the factory.
(4)

TU
The weights of the packages are independent.
(d) Find the probability that exactly 2 of them will be moved by Jemima.

RA (3)

L (Total for question = 13 marks)

SC
IE (Q03 WST01/01, Jan 2021)

NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 131 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 132 of 151
Q3.

A manufacturer fills bottles with oil.


The volume of oil in a bottle, Vml, is normally distributed with V ~ N(100,2.52)
(a) Find P(V > 104.9)
(3)
(b) In a pack of 150 bottles, find the expected number of bottles containing more than 104.9 ml
(2)
(c) Find the value of v, to 2 decimal places, such that P(V > v | V < 104.9) = 0.2801
(6)

NA (Total for question = 11 marks)

TU (Q06 WST01/01, June 2022)


RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 133 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 134 of 151
Q4.

The weights, W grams, of kiwi fruit grown on a farm are normally distributed with mean 80 grams and
standard deviation 8 grams.
The table shows the classifications of the kiwi fruit by their weight, where k is a positive constant.

One kiwi fruit is selected at random from those grown on the farm.
(a) Find the probability that this kiwi fruit is Large.
(3)
NA
35% of the kiwi fruit are Jumbo.

TU
(b) Find the value of k to one decimal place.
(4)

RA
75% of Tiny kiwi fruit weigh more than y grams.
(c) Find the value of y giving your answer to one decimal place.
L (5)

SC (Total for question = 12 marks)


IE
NC (Q05 WST01/01, Oct 2022)

E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 135 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 136 of 151
Q5.

The weights, X grams, of a particular variety of fruit are normally distributed with

A fruit of this variety is selected at random.


(a) Show that the probability that the weight of this fruit is less than 240 grams is 0.8849
(2)
(b) Find the probability that the weight of this fruit is between 190 grams and 240 grams.
(2)
(c) Find the value of k such that P(210 − k < X < 210 + k) = 0.95

NA
A wholesaler buys large numbers of this variety of fruit and classifies the lightest 15% as small.
(3)

TU
(d) Find the maximum weight of a fruit that is classified as small.
You must show your working clearly.
RA
The weights, Y grams, of a second variety of this fruit are normally distributed with
(3)

L
SC
Given that 5% of these fruit weigh less than 152 grams and 40% weigh more than 180 grams,
(e) calculate the mean and standard deviation of the weights of this variety of fruit.
IE (5)

NC (Total for question = 15 marks)

E
SO (Q05 WST01/01, Oct 2023)

LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 137 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 138 of 151
Q6.

Kris works in the mailroom of a large company and is responsible for all the letters sent by the company.
The weights of letters sent by the company, W grams, have a normal distribution with mean 165 g and
standard deviation 35 g.
(a) Estimate the proportion of letters sent by the company that weigh less than 120 g.
(3)

Kris splits the letters to be sent into 3 categories: heavy, medium and light, with of the letters in each
category.
(b) Find the weight limits that determine medium letters.
(4)
NA
A heavy letter is chosen at random.

TU
(c) Find the probability that this letter weighs less than 200 g.
(3)

RA
Kris chooses a random sample of 3 letters from those in the mailroom one day.
(d) Find the probability that there is one letter in each of the 3 categories.
L (3)

SC (Total for question = 13 marks)


IE
NC (Q04 WST01/01, June 2021)

E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 139 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 140 of 151
Q7.

Xiang is designing shelves for a bookshop. The height, H cm, of books is modelled by the normal
distribution with mean 25.1 cm and standard deviation 5.5 cm
(a) Show that P(H > 30.8) = 0.15
(3)
Xiang decided that the smallest 5% of books and books taller than 30.8 cm would not be placed on the
shelves. All the other books will be placed on the shelves.
(b) Find the range of heights of books that will be placed on the shelves.
(3)
The books that will be placed on the shelves have heights classified as small, medium or large.
The numbers of small, medium and large books are in the ratios 2 : 3 : 3

NA
(c) The medium books have heights x cm where m < x < d
(i) Show that d = 25.8 to 1 decimal place.
TU
(ii) Find the value of m
(3)

RA
Xiang wants 2 shelves for small books, 3 shelves for medium books and 3 shelves for large books.
(4)

L
These shelves will be placed one above another and made of wood that is 1 cm thick.
(d) Work out the minimum total height needed.
SC (2)

IE (Total for question = 15 marks)

NC
E (Q06 WST01/01, Oct 2021)

SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 141 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 142 of 151
Q8.

A machine squeezes apples to extract their juice. The volume of juice, J ml, extracted from 1 kg of apples is
modelled by a normal distribution with mean μ and standard deviation σ
Given that μ = 500 and σ = 25 use standardisation to
(a) (i) show that P(J > 510) = 0.3446
(2)
(ii) calculate the value of d such that P(J > d ) = 0.9192
(3)
Zen randomly selects 5 bags each containing 1 kg of apples and records the volume of juice extracted from
each bag of apples.
(b) Calculate the probability that each of the 5 bags of apples produce less than 510 ml of juice.

NA
Following adjustments to the machine, the volume of juice, R ml, extracted from 1 kg of apples is such that
(2)

μ = 520 and σ = k
TU
Given that P(R < r) = 0.15 and P(R > 3r – 800) = 0.005
RA
(c) find the value of r and the value of k

L (7)

SC (Total for question = 14 marks)

IE (Q07 WST01/01, June 2023)


NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 143 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 144 of 151
Q9.

Jia writes a computer program that randomly generates values from a normal distribution.
He sets the mean as 40 and the standard deviation as 2.4
(a) Find the probability that a particular value generated by the computer program is less than 37
(3)
Jia changes the mean to m but leaves the standard deviation as 2.4
The computer program then randomly generates 2 independent values from this normal distribution.
The probability that both of these values are greater than 32 is 0.16
(b) Find the value of m, giving your answer to 2 decimal places.
(4)

NA
Jia now changes the mean to 4 and the standard deviation to 8
The computer program then randomly generates 5 independent values from this normal distribution.
TU
(c) Find the probability that at least one of these values is negative.

RA (4)

L (Total for question = 11 marks)

SC
IE (Q05 WST01/01, Jan 2022)

NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 145 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 146 of 151
Q10.

The lengths, L mm, of housefly wings are normally distributed with L ~ N(4.5, 0.42)
(a) Find the probability that a randomly selected housefly has a wing length of less than 3.86 mm.
(3)
(b) Find
(i) the upper quartile (Q3) of L
(ii) the lower quartile (Q1) of L
(4)
A value that is greater than Q3 + 1.5 × (Q3 – Q1) or smaller than Q1 – 1.5 × (Q3 – Q1) is defined as an
outlier.
(c) Find these two outlier limits.

NA
A housefly is selected at random.
(3)

TU
(d) Using standardisation, show that the probability that this housefly is not an outlier is 0.993 to 3 decimal
places.
RA (3)

L
Given that this housefly is not an outlier,
(e) showing your working, find the probability that the wing length of this housefly is greater than 5 mm.
SC (4)

IE (Total for question = 17 marks)

NC
E (Q05 WST01/01, Jan 2023)

SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 147 of 151
Extra space for working:

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 148 of 151
Q11.

The random variable W has a discrete uniform distribution where

P(W = w) = for w = 1, 2, 3, 4, 5
(a) Find P(2 ≤ W < 3.5)
(1)
The discrete random variable X = 5 – 2W
(b) Find E(X)
(3)
(c) Find P(X < W)

NA
The discrete random variable Y =
(2)

(d) Find TU
RA
(i) the probability distribution of Y
(ii) Var(Y), showing your working.

(e) Find Var(2 – 3Y) L (5)

SC (2)

IE (Total for question = 13 marks)

NC
E (Q04 WST01/01, Jan 2022)

SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 149 of 151
1.6

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 150 of 151
1.7

NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON

Naturalsciencesolution – Accelerating academic skills


Page 151 of 151

You might also like