Edexcel AS Math S1 Practice Book
Edexcel AS Math S1 Practice Book
NA
TU
RAAS Edexcel
L
Statistics
SC (WST01)
IE
ClassifiedN Questions
CE
SO
2019 – 2023T LU
IO
N
L
SC
IE
NC
E
SO
LU
TI
ON
Reference Notes:
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Jim records the length, l mm, of 81 salmon. The data are coded using x = l – 600 and the following
summary statistics are obtained.
NA
TU
RA
(c) Find the maximum number of salmon that have weights in the interval
IE
For these data an outlier is defined as a value that is more than
1.5 × IQR above the upper quartile or 1.5 × IQR below the lower quartile
(d) Show that there are no outliers. NC
E (3)
(Total for question = 9 marks)
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
L (2)
(Total for question = 10 marks)
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The histogram shows the times taken, t minutes, by each of 100 people to swim 500 metres.
NA
TU
RA
(a) Use the histogram to complete the frequency table for the times taken by the 100 people to swim 500
metres.
L
SC
IE (1)
NC
(b) Estimate the number of people who took less than 16 minutes to swim 500 metres.
(2)
E
(c) Find an estimate for the mean time taken to swim 500 metres.
(2)
Given that ∑ ft2 = 41 033 SO
LU
(d) find an estimate for the standard deviation of the times taken to swim 500 metres.
(2)
Given that Q3 = 23 TI
ON
(e) use linear interpolation to estimate the interquartile range of the times taken to swim 500 metres.
(3)
(Total for question = 10 marks)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The stem lengths of a sample of 120 tulips are recorded in the grouped frequency table below.
NA
A histogram is drawn to represent these data.
The area of the bar representing the class is 16.5 cm2
TU
(a) Calculate the exact area of the bar representing the class.
(2)
RA
The height of the tallest bar in the histogram is 10 cm.
L
(b) Find the exact height of the second tallest bar.
(3)
Q1 for these data is 45 cm. SC
IE
(c) Use linear interpolation to find an estimate for
(i) Q2
(ii) the interquartile range.
NC (4)
One measure of skewness is given by
E
(d) By calculating this measure, describe the skewness of these data.
SO
LU (2)
(Total for question = 11 marks)
TI (Q01 WST01/01, Oct 2022)
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The histogram shows the distances, in km, that 274 people travel to work.
NA
TU
RA
L
SC
Given that 60 of these people travel between 10 km and 20 km to work, estimate
IE
(a) the number of people who travel between 22 km and 45 km to work,
(3)
NC
(b) the median distance travelled to work by these 274 people,
SO (3)
(Total for question = 8 marks)
LU (Q01 WST01/01, June 2023)
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A disc of radius 1 cm is rolled onto a horizontal grid of rectangles so that the disc is equally likely to land
anywhere on the grid. Each rectangle is 5 cm long and 3 cm wide. There are no gaps between the
rectangles and the grid is sufficiently large so that no discs roll off the grid.
If the disc lands inside a rectangle without covering any part of the edges of the rectangle then a prize is
won.
By considering the possible positions for the centre of the disc,
(a) show that the probability of winning a prize on any particular roll is
(3)
A group of 15 students each roll the disc onto the grid twenty times and record the number of times, x, that
each student wins a prize. Their results are summarised as follows
NA
TU
(b) Find the standard deviation of the number of prizes won per student.
(2)
RA
A second group of 12 students each roll the disc onto the grid twenty times and the mean number of prizes
won per student is 3.5 with a standard deviation of 2
L
(c) Find the mean and standard deviation of the number of prizes won per student for the whole group of
27 students.
SC (7)
IE
The 27 students also recorded the number of times that the disc covered a corner of a rectangle and
estimated the probability to be 0.2216 (to 4 decimal places).
NC
(d) Explain how this probability could be used to find an estimate for the value of π and state the value of
your estimate.
E (3)
(Total for question = 15 marks)
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The time taken to complete a puzzle, in minutes, is recorded for each person in a club. The times are
summarised in a grouped frequency distribution and represented by a histogram.
One of the class intervals has a frequency of 20 and is shown by a bar of width 1.5 cm and height 12 cm on
the histogram. The total area under the histogram is 94.5 cm2
Find the number of people in the club.
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A researcher recorded the time, t minutes, spent using a mobile phone during a particular afternoon, for
each child in a club.
The researcher coded the data using and the results are summarised in the table below.
NA
TU
(a) Write down the value of a and the value of b.
(1)
RA
(b) Calculate an estimate of the mean of v.
(1)
L
(c) Calculate an estimate of the standard deviation of v.
SC
(d) Use linear interpolation to estimate the median of v.
(2)
IE (2)
NC
(e) Hence describe the skewness of the distribution. Give a reason for your answer.
(2)
E
(f) Calculate estimates of the mean and the standard deviation of the time spent using a mobile phone
during the afternoon by the children in this club.
SO (4)
(Total for question = 12 marks)
LU
(Q04 WST11/01, Specimen papers )
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A random sample of 100 carrots is taken from a farm and their lengths, L cm, recorded.
The data are summarised in the following table.
NA
A histogram is drawn to represent these data.
The bar representing the class 5 ≤ L < 8 is 1.5 cm wide and 1 cm high.
TU
(a) Find the width and height of the bar representing the class 15 ≤ L < 20
(3)
RA
(b) Use linear interpolation to estimate the median length of these carrots.
(c) Estimate
L (2)
NC
A supermarket will only buy carrots with length between 9 cm and 22 cm.
(d) Estimate the proportion of carrots from the farm that the supermarket will buy.
E (2)
SO
Any carrots that the supermarket does not buy are sold as animal feed.
The farm makes a profit of 2.2 pence on each carrot sold to the supermarket, a profit of 0.8 pence on each
LU
carrot longer than 22 cm and a loss of 1.2 pence on each carrot shorter than 9 cm.
(e) Find an estimate of the mean profit per carrot made by the farm.
TI (2)
(Total for question = 14 marks)
ON
(Q03 WST01/01, June 2021)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The company Seafield requires contractors to record the number of hours they work each week. A random
sample of 38 weeks is taken and the number of hours worked per week by contractor Kiana is summarised
in the stem and leaf diagram below.
The quartiles for this distribution are summarised in the table below.
NA
TU
(a) Find the values of w, x and y
RA (3)
L
Kiana is looking for outliers in the data. She decides to classify as outliers any observations greater than
Q3 + 1.0 × (Q3 - Q1)
SC
(b) Showing your working clearly, identify any outliers that Kiana finds.
IE
(c) Draw a box plot for these data in the space provided on the grid opposite.
(2)
E
SO
to find the skewness of these data. Give your answer to 2 significant figures.
LU
Kiana's new employer, Landacre, wishes to know the average number of hours per week she worked
(2)
during her employment at Seafield to help calculate the cost of employing her. TI
(e) Explain why Landacre might prefer to know Kiana's mean, rather than median, number of hours worked
per week. ON (1)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Gill buys a bag of logs to use in her stove. The lengths, l cm, of the 88 logs in the bag are summarised in
the table below.
TU
(a) Calculate the width and height of the bar representing log lengths of
(3)
RA
(b) Use linear interpolation to estimate the median of l
(2)
L
The maximum length of log Gill can use in her stove is 26 cm.
SC
Gill estimates, using linear interpolation, that x logs from the bag will fit into her stove.
(c) Show that x = 62
IE (1)
Gill randomly selects 4 logs from the bag.
NC
(d) Using x = 62 , find the probability that all 4 logs will fit into her stove.
E
The weights, W grams, of the logs in the bag are coded using y = 0.5w - 255 and summarised by
(2)
SO
n = 88 ∑ y = 924 ∑ y2 = 12 862
(e) Calculate
(i) the mean of W
LU
(ii) the variance of W TI (3)
ON (3)
The production cost, £c million, of a film and the total ticket sales, £t million, earned by the film are recorded
for a sample of 40 films.
Some summary statistics are given below.
(a) Find the exact value of Stt and the exact value of Sct
(3)
(b) Calculate the value of the product moment correlation coefficient for these data.
(2)
(c) Give an interpretation of your answer to part (b)
NA
(d) Show that the equation of the linear regression line of t on c can be written as
(1)
TU t = –5.84 + 0.976c
where the values of the intercept and gradient are given to 3 significant figures.
RA
(e) Find the expected total ticket sales for a film with a production cost of £90 million.
(3)
L (2)
Using the regression line in part (d)
SC
(f) find the range of values of the production cost of a film for which the total ticket sales are less than 80%
of its production cost.
IE (2)
NC
E (Total for question = 13 marks)
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The stem and leaf diagram below shows the ages (in years) of the residents in a care home.
NA
(b) Find the interquartile range (IQR) of the ages of the residents.
(2)
TU
An outlier is defined as a value that is either
RA
more than 1.5 × (IQR) below the lower quartile or
more than 1.5 × (IQR) above the upper quartile.
L
(c) Determine any outliers in these data. Show clearly any calculations that you use.
(3)
SC
(d) On the grid below, draw a box plot to summarise these data.
IE (3)
NC
E
SO
LU (Total for question = 9 marks)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The weights, to the nearest kilogram, of a sample of 33 red kangaroos taken in December are summarised
in the stem and leaf diagram below.
(a) Find
NA
TU
(i) the value of the median
RA
(ii) the value of Q1 and the value of Q3
for the weights of these red kangaroos.
(3)
L
For these data an outlier is defined as a value that is
greater than Q3 + 1.5 × (Q3 − Q1)
SC
or smaller than Q1 − 1.5 × (Q3 − Q1)
IE
(b) Show that there are 2 outliers for these data.
(3)
NC
Figure 1 below shows a box plot for the weights of the same 33 red kangaroos taken in February, earlier in
the year.
E
(c) In the space on Figure 1, draw a box plot to represent the weights of these red kangaroos in December.
SO
(d) Compare the distribution of the weights of red kangaroos taken in February with the distribution of the
(4)
LU
weights of red kangaroos taken in December of the same year. You should interpret your comparisons in
the context of the question.
TI (3)
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The stem and leaf diagram shows the ages of the 35 male passengers on a cruise.
TU
An outlier is defined as a value that is more than
(2)
RA
1.5 × IQR above the upper quartile
or
L
1.5 × IQR below the lower quartile
SC
(c) Show that there are 3 outliers amongst these ages.
(3)
IE
(d) On the grid in Figure 1 below, draw a box plot for the ages of the male passengers on the cruise.
NC
Figure 1 below also shows a box plot for the ages of the female passengers on the cruise.
(4)
E
(e) Comment on any difference in the distributions of ages of male and female passengers on the cruise.
SO
State the values of any statistics you have used to support your comment.
(1)
Anja, along with her 2 daughters and a granddaughter, now join the cruise.
Anja's granddaughter is younger than both of Anja's daughters.
Anja had her 23rd birthday on the day her eldest daughter was born.
LU
TI
When their 4 ages are included with the other female passengers on the cruise, the box plot does not
change.
(f) State, giving reasons, what you can say about
(i) the granddaughter's age
ON
(ii) Anja's age.
(3)
L
SC (Q03 WST01/01, Oct 2021)
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The stem and leaf diagram shows the number of deliveries made by Pat each day for 24 days
NA
Given that there is only one outlier for these data,
(a) show that c = 9
TU (3)
RA
The number of deliveries made by Pat each day is represented by d
The data in the stem and leaf diagram are coded using
L x = d – 125
SC
and the following summary statistics are obtained
E
One of these 24 days is selected at random. The random variable D represents the number of deliveries
(2)
ON
(Total for question = 10 marks)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Reference Notes:
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
• The probability that Sally wins the first game in a turn is 0.7
• If Sally wins a game the probability that she wins the next game is 0.6
• If Sally loses a game the probability that she wins the next game is 0.2
(a) Use this information to complete the tree diagram below.
(3)
NA
(b) Find the probability that Sally wins the first 2 games in a turn.
TU
(c) Find the probability that Sally wins exactly 2 games in a turn.
(2)
RA
Given that Sally wins 2 games in a turn,
(2)
L
(d) find the probability that she won the first 2 games.
SC
Given that Sally won the first game in a turn,
(2)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The Venn diagram shows the events A, B, C and D, where p, q, r and s are probabilities.
TU
RA
(b) find the exact value of q and the exact value of r
(6)
L
(c) find the exact value of s
SC (2)
IE
NC (Total for question = 11 marks)
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
where w, x, y and z are probabilities.
L
(a) Find the value of w, the value of x, the value of y and the value of z
(4)
For one visit to the market, SC
IE
(b) find the probability that Bob buys either carrots or onions but not both.
(1)
(c) Show that the events C and O are not independent.
NC (2)
E
(ii) F, G and H are 3 events. F and H are mutually exclusive. F and G are independent.
Given that
SO
(a) find P(F ∪ H) LU
(b) find P(G)
TI (1)
ON (3)
(c) find P(F ∩ G)
(1)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
(c) Find P(A | B)
(1)
TU
Given that P(C′ ∩ B′) = 0.3
RA
(d) draw a Venn diagram to represent the events A, B and C
(5)
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
There are 7 red counters, 3 blue counters and 2 yellow counters in a bag. Gina selects a counter at random
from the bag and keeps it. If the counter is yellow she does not select any more counters. If the counter is
not yellow she randomly selects a second counter from the bag.
(a) Complete the tree diagram.
NA
TU
RA
L
SC
IE (2)
Given that Gina has selected a yellow counter, NC
(b) find the probability that she has 2 counters.
E (3)
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
In the Venn diagram below, A, B and C are events and p, q, r and s are probabilities.
The events A and C are independent and P(A) = 0.65
(a) State which two of the events A, B and C are mutually exclusive.
(1)
NA
(b) Find the value of r and the value of s.
(5)
TU
The events (A ∩ C ' ) and (B ∪ C ) are also independent.
(c) Find the exact value of p and the exact value of q. Give your answers as fractions.
RA (6)
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
(b) Find P
NA (2)
TU
Given that W and N are mutually exclusive,
RA
(c) draw a Venn diagram to represent the events H, W and N giving the exact probabilities of each region
in the Venn diagram.
L (5)
IE
NC (Q04 WST01/01, June 2022)
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The Venn diagram shows the events A, B and C and their associated probabilities.
Find
(a) P(B')
NA
(b) P(A ∪ C)
TU (1)
RA (2)
(c) P(A | B')
L (2)
SC
IE (Total for question = 5 marks)
E
SO
LU
TI
ON
The Venn diagram shows the events A, B and C and their associated probabilities, where p and q are
probabilities.
RA (2)
Given that P(C | B) = P(C)
L
(c) find the value of p and the value of q
ON
(i) In the Venn diagram below, A and B represent events and p, q, r and s are probabilities.
NA
(a) Use algebra to show that 2p + 2q + 2r =
(4)
TU
(b) Find the value of p, the value of q, the value of r and the value of s
(5)
RA
(ii) Two events, C and D, are such that
L
where x is a positive constant.
SC
By considering P(C) + P(D) show that C and Dcannot be mutually exclusive.
(4)
SO
LU
TI
ON
Q12.
Naturalsciencesolution – Accelerating academic skills
Page 57 of 151
A bag contains a large number of coloured counters. Each counter is labelled A, B or C
30% of the counters are labelled A
45% of the counters are labelled B
The rest of the counters are labelled C
It is known that
2% of the counters labelled A are red
4% of the counters labelled B are red
6% of the counters labelled C are red
One counter is selected at random from the bag.
(a) Complete the tree diagram below to illustrate this information.
(2)
NA
(b) Calculate the probability that the counter is labelled A and is not red.
(2)
TU
(c) Calculate the probability that the counter is red.
RA
(d) Given that the counter is red, find the probability that it is labelled C
(2)
L (3)
SC
IE
NC
E
SO
LU
TI
ON
(Total for question = 9 marks)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
(b) Show that the probability that Mike records a yellow ball exactly twice is
TU
Given that Mike records exactly 2 yellow balls,
(3)
RA
(c) find the probability that the ball drawn from bag A is red.
L
Mike plays this game a large number of times, each time starting with the bags containing balls as
(2)
SC
described above. The random variable X represents the number of yellow balls recorded in a single game.
(d) Find the probability distribution of X
IE (3)
(e) Find E(X )
NC (2)
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Two bags, X and Y, each contain green marbles (G) and blue marbles (B) only.
NA
(c) Find the probability that all 3 marbles selected are the same colour.
(2)
TU (2)
RA
Given that all three marbles selected are the same colour,
(d) find the probability that they are all green.
L (3)
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The lengths, L mm, of housefly wings are normally distributed with L ~ N(4.5, 0.42)
(a) Find the probability that a randomly selected housefly has a wing length of less than 3.86 mm.
(3)
(b) Find
(i) the upper quartile (Q3) of L
(ii) the lower quartile (Q1) of L
(4)
A value that is greater than Q3 + 1.5 × (Q3 – Q1) or smaller than Q1 – 1.5 × (Q3 – Q1) is defined as an
outlier.
(c) Find these two outlier limits.
NA
A housefly is selected at random.
(3)
TU
(d) Using standardisation, show that the probability that this housefly is not an outlier is 0.993 to 3 decimal
places.
RA (3)
L
Given that this housefly is not an outlier,
(e) showing your working, find the probability that the wing length of this housefly is greater than 5 mm.
SC (4)
NC
E (Q05 WST01/01, Jan 2023)
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A disc of radius 1 cm is rolled onto a horizontal grid of rectangles so that the disc is equally likely to land
anywhere on the grid. Each rectangle is 5 cm long and 3 cm wide. There are no gaps between the
rectangles and the grid is sufficiently large so that no discs roll off the grid.
If the disc lands inside a rectangle without covering any part of the edges of the rectangle then a prize is
won.
By considering the possible positions for the centre of the disc,
(a) show that the probability of winning a prize on any particular roll is
(3)
A group of 15 students each roll the disc onto the grid twenty times and record the number of times, x, that
each student wins a prize. Their results are summarised as follows
NA
TU
(b) Find the standard deviation of the number of prizes won per student.
(2)
RA
A second group of 12 students each roll the disc onto the grid twenty times and the mean number of prizes
won per student is 3.5 with a standard deviation of 2
L
(c) Find the mean and standard deviation of the number of prizes won per student for the whole group of
27 students.
SC (7)
IE
The 27 students also recorded the number of times that the disc covered a corner of a rectangle and
estimated the probability to be 0.2216 (to 4 decimal places).
NC
(d) Explain how this probability could be used to find an estimate for the value of π and state the value of
your estimate.
E (3)
LU
TI (Q06 WST01/01, Jan 2021)
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Xiang is designing shelves for a bookshop. The height, H cm, of books is modelled by the normal
distribution with mean 25.1 cm and standard deviation 5.5 cm
(a) Show that P(H > 30.8) = 0.15
(3)
Xiang decided that the smallest 5% of books and books taller than 30.8 cm would not be placed on the
shelves. All the other books will be placed on the shelves.
(b) Find the range of heights of books that will be placed on the shelves.
(3)
The books that will be placed on the shelves have heights classified as small, medium or large.
The numbers of small, medium and large books are in the ratios 2 : 3 : 3
NA
(c) The medium books have heights x cm where m < x < d
(i) Show that d = 25.8 to 1 decimal place.
TU
(ii) Find the value of m
(3)
RA
Xiang wants 2 shelves for small books, 3 shelves for medium books and 3 shelves for large books.
(4)
L
These shelves will be placed one above another and made of wood that is 1 cm thick.
(d) Work out the minimum total height needed.
SC (2)
NC
E (Q06 WST01/01, Oct 2021)
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A biased tetrahedral die has faces numbered 0, 1, 2 and 3. The die is rolled and the number face down on
the die, X, is recorded. The probability distribution of X is
TU
(c) Given that the die is rolled twice, find the probability that the final score is 3
(3)
RA (3)
(Total for question = 8 marks)
L (Q05 WST11/01, Specimen papers )
SC
IE
NC
E
SO
LU
TI
ON
Kris works in the mailroom of a large company and is responsible for all the letters sent by the company.
The weights of letters sent by the company, W grams, have a normal distribution with mean 165 g and
standard deviation 35 g.
(a) Estimate the proportion of letters sent by the company that weigh less than 120 g.
(3)
Kris splits the letters to be sent into 3 categories: heavy, medium and light, with of the letters in each
category.
(b) Find the weight limits that determine medium letters.
(4)
NA
A heavy letter is chosen at random.
(c) Find the probability that this letter weighs less than 200 g.
TU
Kris chooses a random sample of 3 letters from those in the mailroom one day.
(3)
RA
(d) Find the probability that there is one letter in each of the 3 categories.
L (3)
IE
NC (Q04 WST01/01, June 2021)
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
One of the shoes in the sample is selected at random.
L
(a) Find the probability that it does not have defective soles.
IE (1)
(c) Find P(A ∪ B ∪ C′)
NC (2)
E
(d) Find the probability that the shoe has at most one type of defect.
(2)
SO
(e) Given the selected shoe has at most one type of defect, find the probability it has defective stitching.
LU
The random variable X is the number of the events A, B, C that occur for a randomly selected shoe.
(2)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Gill buys a bag of logs to use in her stove. The lengths, l cm, of the 88 logs in the bag are summarised in
the table below.
TU
(a) Calculate the width and height of the bar representing log lengths of
(3)
RA
(b) Use linear interpolation to estimate the median of l
(2)
L
The maximum length of log Gill can use in her stove is 26 cm.
SC
Gill estimates, using linear interpolation, that x logs from the bag will fit into her stove.
(c) Show that x = 62
IE (1)
Gill randomly selects 4 logs from the bag.
NC
(d) Using x = 62 , find the probability that all 4 logs will fit into her stove.
E
The weights, W grams, of the logs in the bag are coded using y = 0.5w - 255 and summarised by
(2)
SO
n = 88 ∑ y = 924 ∑ y2 = 12 862
(e) Calculate
(i) the mean of W
LU
(ii) the variance of W TI (3)
ON (3)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The stem and leaf diagram shows the number of deliveries made by Pat each day for 24 days
NA
Given that there is only one outlier for these data,
(a) show that c = 9
TU (3)
RA
The number of deliveries made by Pat each day is represented by d
The data in the stem and leaf diagram are coded using
L x = d – 125
SC
and the following summary statistics are obtained
E
One of these 24 days is selected at random. The random variable D represents the number of deliveries
(2)
ON
(Total for question = 10 marks)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A red spinner is designed so that the score R is given by the following probability distribution.
NA
A yellow spinner is designed so that the score Y is given by the probability distribution in the table below.
The cumulative distribution function F(y) is also given.
TU
RA
(c) Write down the value of d L
Given that E(Y) = 4.55
SC (1)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Reference Notes:
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The production cost, £c million, of a film and the total ticket sales, £t million, earned by the film are recorded
for a sample of 40 films.
Some summary statistics are given below.
(a) Find the exact value of Stt and the exact value of Sct
(3)
(b) Calculate the value of the product moment correlation coefficient for these data.
(2)
(c) Give an interpretation of your answer to part (b)
NA
(d) Show that the equation of the linear regression line of t on c can be written as
(1)
TU t = –5.84 + 0.976c
where the values of the intercept and gradient are given to 3 significant figures.
RA
(e) Find the expected total ticket sales for a film with a production cost of £90 million.
(3)
L (2)
Using the regression line in part (d)
SC
(f) find the range of values of the production cost of a film for which the total ticket sales are less than 80%
of its production cost.
IE (2)
NC
E (Total for question = 13 marks)
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Students on a psychology course were given a pre-test at the start of the course and a final exam at the
end of the course. The teacher recorded the number of marks achieved on the pre-test, p, and the number
of marks achieved on the final exam, f, for 34 students and displayed them on the scatter diagram.
NA
TU
RA
L
SC
The equation of the least squares regression line for these data is found to be
IE
f = 10.8 + 0.748 p
For these students, the mean number of marks on the pre-test is 62.4
NC
(a) Use the regression model to find the mean number of marks on the final exam.
(2)
E
(b) Give an interpretation of the gradient of the regression line.
SO
Considering the equation of the regression line, Priya says that she would expect someone who scored 0
(1)
ON
(d) Write down the number of marks achieved on the final exam for the student who exceeded the
expectation of the regression model by the largest number of marks.
(1)
(e) Find the range of values of p for which this regression model, f = 10.8 + 0.748 p, predicts a greater
number of marks on the final exam than on the pre-test.
(3)
(f) calculate the gradient of the new regression line. Show your working clearly.
(5)
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Two economics students, Andi and Behrouz, are studying some data relating to unemployment, x %, and
increase in wages, y %, for a European country. The least squares regression line of y on x has equation
y = 3.684 − 0.3242x
and
(a) Show that Syy = 7.524 375
(1)
(b) Find Sxx
(4)
(c) Find the product moment correlation coefficient between x and y.
NA
Behrouz claims that, assuming the model is valid, the data show that when unemployment is 2% wages
(3)
increase at over 3%
TU
(d) Explain how Behrouz could have come to this conclusion.
RA (1)
Andi uses the formula
L range = mean ± 3 × standard deviation
to estimate the range of values for x. SC
(e) Find estimates of the minimum value and the maximum value of x in these data using Andi's formula.
IE (3)
NC
(f) Comment, giving a reason, on the reliability of Behrouz's claim.
(2)
E
Andi suggests using the regression line with equation y = 3.684 – 0.3242x to estimate unemployment when
wages are increasing at 2%
(g) Comment, giving a reason, on Andi's suggestion. SO
LU (2)
TI
(Total for question = 16 marks)
ON
(Q06 WST01/01, June 2021)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A large company is analysing how much money it spends on paper in its offices each year. The number of
employees in the office, x, and the amount spent on paper in a year, p ($ hundreds), in each of 12
randomly selected offices were recorded.
The results are summarised in the following statistics.
TU
(d) Give an interpretation of the gradient of your regression line.
(1)
RA
The director of the company wants to reduce the amount spent on paper each year.
part (c).
L
He wants each office to aim for a model of the form where a and b are the values found in
SC
Using the data for the 93 employees from the 12 offices,
IE
(e) estimate the percentage saving in the amount spent on paper each year by the company using the
director's model.
NC (3)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Stuart is investigating the relationship between Gross Domestic Product (GDP) and the size of the
population for a particular country. He takes a random sample of 9 years and records the size of the
population, t millions, and the GDP, g billion dollars for each of these years.
The data are summarised as
n = 9 ∑ t = 7.87 ∑ g = 144.84 ∑ g2 = 3624.41 Stt = 1.29 Stg = 40.25
(a) Calculate the product moment correlation coefficient between t and g
(3)
(b) Give an interpretation of your product moment correlation coefficient.
(1)
(c) Find the equation of the least squares regression line of g on t in the form g = a + bt
NA (4)
TU
(d) Give an interpretation of the value of b in your regression line.
(1)
RA
(e) (i) Use the regression line from part (c) to estimate the GDP, in billions of dollars, for a population of 7
000 000
L
(ii) Comment on the reliability of your answer in part (i). Give a reason, in context, for your answer.
(2)
SC (1)
Using the regression line from part (c), Stuart estimates that for a population increase of x million there will
be an increase of 0.1 billion dollars in GDP.
(f) Find the value of x
IE
NC (2)
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A company director wants to introduce a performance-related pay structure for her managers. A random
sample of 15 managers is taken and the annual salary, y in £1000, was recorded for each manager. The
director then calculated a performance score, x, for each of these managers.
The results are shown on the scatter diagram in Figure 1 on the next page.
(a) Describe the correlation between performance score and annual salary.
(1)
The results are also summarised in the following statistics.
NA
(ii) Find Syy
(2)
TU
(c) Find the product moment correlation coefficient between performance score and annual salary.
(2)
RA
The director believes that there is a linear relationship between performance score and annual salary.
L
(d) State, giving a reason, whether or not these data are consistent with the director's belief.
(1)
SC
(e) Calculate the equation of the regression line of y on x, in the form y = a + bx
Give the value of a and the value of b to 3 significant figures.
NC (1)
E
(g) Plot your regression line on the scatter diagram in Figure 1
(2)
SO
The director hears that one of the managers in the sample seems to be underperforming.
LU
(h) On the scatter diagram, circle the point that best identifies this manager.
(1)
TI
ON
NA
TU
RA
L
SC
IE
NC
E (Total for question = 17 marks)
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A research student is investigating the maximum weight, y grams, of sugar that will dissolve in 100 grams
of water at various temperatures, x °C, where 10 ≤ x ≤ 80
The research student calculated the regression line of y on x and found it to be
y = 151.2 + 2.72x
(a) Give an interpretation of the gradient of the regression line.
(1)
(b) Use the regression line to estimate the maximum weight of sugar that will dissolve in 100 grams of
water when the temperature is 90 °C.
(2)
(c) Comment on the reliability of your estimate, giving a reason for your answer.
NA (2)
TU
Using the regression line of y on x and the following summary statistics
RA
(d) show that the product moment correlation coefficient for these data is 0.988 to 3 decimal places.
L
The research student's supervisor plotted the original data on a scatter diagram, below.
(7)
SC
With reference to both the scatter diagram and the correlation coefficient,
(e) discuss the suitability of a linear regression model to describe the relationship between x and y.
IE (2)
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The variables x and y have the following regression equations based on the same 12 observations.
NA
(b) Find Sxy
TU (4)
RA
(c) Find the product moment correlation coefficient between x and y
(4)
L
SC (Total for question = 12 marks)
Two students, Olive and Shan, collect data on the weight, w grams, and the tail length, t cm, of 15 mice.
Olive summarised the data as follows
NA
(d) Give an interpretation of the gradient of the regression line.
(3)
TU (1)
RA
(e) Explain why it would not be appropriate to use the regression line in part (c) to estimate the weight of a
mouse with a tail length of 2 cm.
L (2)
SC
Shan decided to code the data using x = t – 6 and
IE
(f) Write down the value of the product moment correlation coefficient between x and y
(1)
NC
(g) Write down an equation of the regression line of y on x
You do not need to simplify your equation.
E (1)
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Tom's car holds 50 litres of petrol when the fuel tank is full.
For each of 10 journeys, each starting with 50 litres of petrol in the fuel tank, Tom records the distance
travelled, d kilometres, and the amount of petrol used, p litres.
The summary statistics for the 10 journeys are given below.
NA
(ii) Hence, write down the value of the product moment correlation coefficient between w and p
(2)
TU
(c) Write down the value of the product moment correlation coefficient between d and w
(1)
RA
L (Total for question = 6 marks)
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The percentage oil content, p, and the weight, w milligrams, of each of 10 randomly selected sunflower
seeds were recorded. These data are summarised below.
NA
The equation of the regression line of p on w is given in the form p = a + bw
(d) Find the equation of the regression line of p on w
TU
(e) Hence estimate the percentage oil content of a sunflower seed which weighs 60 milligrams.
(4)
RA (2)
NC
E
SO
LU
TI
ON
Reference Notes:
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
RA (1)
(f) Find E(X2)
L (2)
(g) Hence find Var (X)
SC (2)
Given that E(aX − 3) = 11.4 IE
(h) find Var (aX − 3)
NC (4)
E
SO (Total for question = 14 marks)
LU
(Q03 WST11/01, Specimen papers )
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
The random variable Y = 5 − 8X
(c) Find (i) E(Y )
(ii) Var(Y ) TU (3)
2
(d) Find P(4X > Y ) RA
L (5)
IE
NC (Q05 WST01/01, June 2021)
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A red spinner is designed so that the score R is given by the following probability distribution.
NA
A yellow spinner is designed so that the score Y is given by the probability distribution in the table below.
The cumulative distribution function F(y) is also given.
TU
RA
(c) Write down the value of d L
Given that E(Y) = 4.55
SC (1)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
P(W = w) = for w = 1, 2, 3, 4, 5
(a) Find P(2 ≤ W < 3.5)
(1)
The discrete random variable X = 5 – 2W
(b) Find E(X)
(3)
(c) Find P(X < W)
NA
The discrete random variable Y =
(2)
(d) Find TU
RA
(i) the probability distribution of Y
(ii) Var(Y), showing your working.
SC (2)
NC
E (Q04 WST01/01, Jan 2022)
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A spinner can land on the numbers 10, 12, 14 and 16 only and the probability of the spinner landing on
each number is the same.
The random variable X represents the number that the spinner lands on when it is spun once.
(a) State the name of the probability distribution of X.
(1)
(b) (i) Write down the value of E(X)
(1)
(ii) Find Var(X)
(2)
A second spinner can land on the numbers 1, 2, 3, 4 and 5 only.
NA
The random variable Y represents the number that this spinner lands on when it is spun once. The
probability distribution of Y is given in the table below
TU
(c) Find (i) E(Y)
RA
(ii) Var(Y)
L (2)
SC
The random variable W = aX + b, where a and b are constants and a > 0
(3)
TI
ON
(Q04 WST01/01, Jan 2021)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
RA
Given also that Var (Y) = 37
(c) find the value of q and the value of r
L
The coordinates of a point P are (12, Y)
(4)
SC
The random variable D represents the length of OP
(d) Find the probability distribution of D
IE (6)
NC
E (Total for question = 14 marks)
LU
TI
ON
The cumulative distribution function of the discrete random variable W, which takes only the values 6, 7 and
8, is given by
Find E(W)
(4)
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
(b) Find
NA (3)
SC (5)
NC
E (Q04 WST01/01, Oct 2023)
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
where k is a constant.
NA
(b) P(1 < Y ≤ 4)
(2)
(c) E(Y ) TU
RA
The random variable X = 15 – 2Y
(2)
E
SO (Q05 WST01/01, June 2023)
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A biased tetrahedral die has faces numbered 0, 1, 2 and 3. The die is rolled and the number face down on
the die, X, is recorded. The probability distribution of X is
TU
(c) Given that the die is rolled twice, find the probability that the final score is 3
(3)
RA (3)
SC
IE (Q05 WST11/01, Specimen papers )
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Adana selects one number at random from the distribution of X which has the following probability
distribution.
(a) Given that the number selected by Adana is not 5, write down the probability it is 0
(1)
(b) Show that E(X2) = 75
(1)
NA
(c) Find Var(X )
(3)
TU
(d) Find Var(4 – 3X )
RA
Bruno and Charlie each independently select one number at random from the distribution of X
(2)
L
(e) Find the probability that the number Bruno selects is greater than the number Charlie selects.
(3)
SC
Devika multiplies Bruno's number by Charlie's number to obtain a product, D
(f) Determine the probability distribution of D
IE (4)
NC
E (Total for question = 14 marks)
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
where a is a constant.
(a) Find, in terms of a, E(X)
(2)
(b) Find the range of the possible values of E(X)
(3)
NA
Given that Var(X) = 0.56
TU
(c) find the possible values of a
(6)
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
One of the shoes in the sample is selected at random.
L
(a) Find the probability that it does not have defective soles.
IE (1)
(c) Find P(A ∪ B ∪ C′)
NC (2)
E
(d) Find the probability that the shoe has at most one type of defect.
(2)
SO
(e) Given the selected shoe has at most one type of defect, find the probability it has defective stitching.
LU
The random variable X is the number of the events A, B, C that occur for a randomly selected shoe.
(2)
Reference Notes:
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A machine fills bottles with water. The volume of water delivered by the machine to a bottle is X ml where
X ~ N( μ, σ2)
One of these bottles of water is selected at random.
Given that μ = 503 and σ = 1.6
(a) find
(i) P(X > 505)
(ii) P(501 < X < 505)
(5)
(b) Find w such that P(1006 − w < X < w) = 0.9426
(3)
NA
Following adjustments to the machine, the volume of water delivered by the machine to a bottle is such that
μ = 503 and σ = q
TU
Given that P(X < r) = 0.01 and P(X > r + 6) = 0.05
(c) find the value of r and the value of q
RA (7)
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The weights of packages that arrive at a factory are normally distributed with a mean of 18 kg and a
standard deviation of 5.4 kg
(a) Find the probability that a randomly selected package weighs less than 10 kg
(3)
The heaviest 15% of packages are moved around the factory by Jemima using a forklift truck.
(b) Find the weight, in kg, of the lightest of these packages that Jemima will move.
(3)
One of the packages not moved by Jemima is selected at random.
(c) Find the probability that it weighs more than 18 kg
NA
A delivery of 4 packages is made to the factory.
(4)
TU
The weights of the packages are independent.
(d) Find the probability that exactly 2 of them will be moved by Jemima.
RA (3)
SC
IE (Q03 WST01/01, Jan 2021)
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The weights, W grams, of kiwi fruit grown on a farm are normally distributed with mean 80 grams and
standard deviation 8 grams.
The table shows the classifications of the kiwi fruit by their weight, where k is a positive constant.
One kiwi fruit is selected at random from those grown on the farm.
(a) Find the probability that this kiwi fruit is Large.
(3)
NA
35% of the kiwi fruit are Jumbo.
TU
(b) Find the value of k to one decimal place.
(4)
RA
75% of Tiny kiwi fruit weigh more than y grams.
(c) Find the value of y giving your answer to one decimal place.
L (5)
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The weights, X grams, of a particular variety of fruit are normally distributed with
NA
A wholesaler buys large numbers of this variety of fruit and classifies the lightest 15% as small.
(3)
TU
(d) Find the maximum weight of a fruit that is classified as small.
You must show your working clearly.
RA
The weights, Y grams, of a second variety of this fruit are normally distributed with
(3)
L
SC
Given that 5% of these fruit weigh less than 152 grams and 40% weigh more than 180 grams,
(e) calculate the mean and standard deviation of the weights of this variety of fruit.
IE (5)
E
SO (Q05 WST01/01, Oct 2023)
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Kris works in the mailroom of a large company and is responsible for all the letters sent by the company.
The weights of letters sent by the company, W grams, have a normal distribution with mean 165 g and
standard deviation 35 g.
(a) Estimate the proportion of letters sent by the company that weigh less than 120 g.
(3)
Kris splits the letters to be sent into 3 categories: heavy, medium and light, with of the letters in each
category.
(b) Find the weight limits that determine medium letters.
(4)
NA
A heavy letter is chosen at random.
TU
(c) Find the probability that this letter weighs less than 200 g.
(3)
RA
Kris chooses a random sample of 3 letters from those in the mailroom one day.
(d) Find the probability that there is one letter in each of the 3 categories.
L (3)
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Xiang is designing shelves for a bookshop. The height, H cm, of books is modelled by the normal
distribution with mean 25.1 cm and standard deviation 5.5 cm
(a) Show that P(H > 30.8) = 0.15
(3)
Xiang decided that the smallest 5% of books and books taller than 30.8 cm would not be placed on the
shelves. All the other books will be placed on the shelves.
(b) Find the range of heights of books that will be placed on the shelves.
(3)
The books that will be placed on the shelves have heights classified as small, medium or large.
The numbers of small, medium and large books are in the ratios 2 : 3 : 3
NA
(c) The medium books have heights x cm where m < x < d
(i) Show that d = 25.8 to 1 decimal place.
TU
(ii) Find the value of m
(3)
RA
Xiang wants 2 shelves for small books, 3 shelves for medium books and 3 shelves for large books.
(4)
L
These shelves will be placed one above another and made of wood that is 1 cm thick.
(d) Work out the minimum total height needed.
SC (2)
NC
E (Q06 WST01/01, Oct 2021)
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
A machine squeezes apples to extract their juice. The volume of juice, J ml, extracted from 1 kg of apples is
modelled by a normal distribution with mean μ and standard deviation σ
Given that μ = 500 and σ = 25 use standardisation to
(a) (i) show that P(J > 510) = 0.3446
(2)
(ii) calculate the value of d such that P(J > d ) = 0.9192
(3)
Zen randomly selects 5 bags each containing 1 kg of apples and records the volume of juice extracted from
each bag of apples.
(b) Calculate the probability that each of the 5 bags of apples produce less than 510 ml of juice.
NA
Following adjustments to the machine, the volume of juice, R ml, extracted from 1 kg of apples is such that
(2)
μ = 520 and σ = k
TU
Given that P(R < r) = 0.15 and P(R > 3r – 800) = 0.005
RA
(c) find the value of r and the value of k
L (7)
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
Jia writes a computer program that randomly generates values from a normal distribution.
He sets the mean as 40 and the standard deviation as 2.4
(a) Find the probability that a particular value generated by the computer program is less than 37
(3)
Jia changes the mean to m but leaves the standard deviation as 2.4
The computer program then randomly generates 2 independent values from this normal distribution.
The probability that both of these values are greater than 32 is 0.16
(b) Find the value of m, giving your answer to 2 decimal places.
(4)
NA
Jia now changes the mean to 4 and the standard deviation to 8
The computer program then randomly generates 5 independent values from this normal distribution.
TU
(c) Find the probability that at least one of these values is negative.
RA (4)
SC
IE (Q05 WST01/01, Jan 2022)
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
The lengths, L mm, of housefly wings are normally distributed with L ~ N(4.5, 0.42)
(a) Find the probability that a randomly selected housefly has a wing length of less than 3.86 mm.
(3)
(b) Find
(i) the upper quartile (Q3) of L
(ii) the lower quartile (Q1) of L
(4)
A value that is greater than Q3 + 1.5 × (Q3 – Q1) or smaller than Q1 – 1.5 × (Q3 – Q1) is defined as an
outlier.
(c) Find these two outlier limits.
NA
A housefly is selected at random.
(3)
TU
(d) Using standardisation, show that the probability that this housefly is not an outlier is 0.993 to 3 decimal
places.
RA (3)
L
Given that this housefly is not an outlier,
(e) showing your working, find the probability that the wing length of this housefly is greater than 5 mm.
SC (4)
NC
E (Q05 WST01/01, Jan 2023)
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
P(W = w) = for w = 1, 2, 3, 4, 5
(a) Find P(2 ≤ W < 3.5)
(1)
The discrete random variable X = 5 – 2W
(b) Find E(X)
(3)
(c) Find P(X < W)
NA
The discrete random variable Y =
(2)
(d) Find TU
RA
(i) the probability distribution of Y
(ii) Var(Y), showing your working.
SC (2)
NC
E (Q04 WST01/01, Jan 2022)
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON
NA
TU
RA
L
SC
IE
NC
E
SO
LU
TI
ON