Statistics II
Statistics II
CHAPTER 2
STATISTICS II
THE MEAN
Example I
1|Page
Solution
Step 2
Step 3
−20+22
= 5
2
=5
= 0.4
Step 4
Mass(kg) 57 58 59 60 61 62 63 64 65 66 67 68 69 70
Frequency 3 1 2 3 2 1 4 7 6 4 3 3 2 1
Solution
Step 1
Step 2
- Multiply the deviation (t) and frequency (f) and add to get
∑ ft where ∑sum of.
3|Page
Step 3
Step 4
Total Ft = -18-5-8-9-4-1+0+7+12+12+12+15+12+7= 32
32
Mean of t (t ) = 42
4|Page
= 0.7619 (4 d.p)
= 63.7619
- For a grouped data, we use the midpoint of the class as our x and
follow the other steps.
Example 1
Solution
- Let 120.5 be assumed mean. This is the midpoint of the class 119-
122.
119+122
= 2
241
= 2
= 120.5
- Draw another table including the midpoint (x) column and the
deviation (t) and ft column,
5|Page
Assumed mean = 120.5
quantity Mid-point T= x – A f ft
(x)
111-114 112.5 -8 2 -16
115-118 116.5 -4 16 -64
119-122 120.5 0 41 0
123-126 124.5 4 30 120
127-130 128.5 8 8 64
131-134 132.5 12 3 36
∑f=100 ∑ft=140
∑ft
t=
∑f
140
= 100
= 1.4
Actual mean x = A + t
= 120.5 + 1.4
= 121.9
x = 121.9
6|Page
Method I of scaling down
= 0.35
t= 0.35 x 4
7|Page
¿ 1.4
actual mean=A +t
= 120.5 + 1.4
= 121.9
Quantity Mid- x T= f ft
4 x
point (x) −30.125
4
111-114 112.5 28.125 -2 2 -4
115-118 116.5 29.125 -1 16 -16
119-122 120.5 30.125 0 41 0
123-126 124.5 31.125 1 30 30
127-130 128.5 32.125 2 8 16
131-134 132.5 33.125 3 3 9
∑f= 100 ∑ft=35
35
t=
100
= 0.35
8|Page
= (30.125 + 0.35) x 4
= 30.475 x 4
= 121.9
Example 2
Marks 1- 11- 21- 31- 41- 51- 61- 71- 81- 91-
10 20 30 40 50 60 70 80 90 100
frequency 6 8 4 10 16 22 4 2 6 2
Method I
9|Page
∑F= 80 ∑ft=4
∑ ft 4
t= =
∑f 80
= 0.05
= 45.5 + 0.5
= 46.0
Method 2
Divide the Assumed mean 45.5 by class width by 10, also divide the
mid-point by the class width (10) then get mean of t (t ¿ ¿
10 | P a g e
class Midpoint x x f ft
10 T= 10 -4.55
x
1-10 5.5 0.55 -4 6 -24
11-20 15.5 1.55 -3 8 -24
21-30 25.5 2.55 -2 4 -8
31-40 35.5 3.55 -1 10 -10
41-50 45.5 4.55 0 16 0
51-60 55.5 5.55 1 22 22
61-70 65.5 6.55 2 4 8
71-80 75.5 7.55 3 2 6
81-90 85.5 8.55 4 6 24
91-100 95.5 9.55 5 2 10
∑F= 80 ∑ft = 4
4
t=
80
= 0.05
x=¿ + assumed mean) 10
= (0.05 + 4.55)10
= 4.60 x 10
x = 46.0
Exercise
11 | P a g e
Marks 0-9 10- 20- 30- 40- 50- 60- 70- 80- 90-
19 29 39 49 59 69 79 89 99
frequency 3 14 2 8 4 23 34 15 10 2
More exercises on
THE MEDIAN
12 | P a g e
- To identify the value we add a column of cumulative frequency
(c.f)
- For a grouped data, we use the formula.
1
( −ci)
M= L+ 2
f
M – Median
L- Lower class boundary if median class.
n – Total frequency.
c- Cumulative Frequency above median class.
i. - Class interval / size.
F-frequency of the median class.
Example
13 | P a g e
Solution
100
- Get the median class. 2 = 50th
The lower class boundary will be the lower class limit (119) less 0.5
which will be 118.5
L = 118.5
C= 18
I = (122-119) + 1
14 | P a g e
=3+1
=4
F= 41
( 50−18 ) 4
Median = 118.5 + 41
32
= 118.5 + ×4
41
= 121.62
- Quartile (Q) is the value that divides a distribution into four equal
parts.
- Lower Quartile (QI) is the value below which lies a quarter of the
distribution (25%)
- Middle Quartile (Q2)or median is the value below which lies half
of the distribution (50%)
- Upper Quartile (Q3) is the value below which lies ¾ of the
distribution (75%)
- Deciles (D) are values which divide a distribution into ten equal
parts.
15 | P a g e
- Percentiles (p) divide a distribution into 100 equal parts.
Example
a) Estimate, by calculation
i) Lower and upper Quartile.
ii) The pass marks if 40% of the students were to fail.
iii) The minimum number of marks required to obtain grade A if
only 8 students were to get A.
iv) How many students were to pass if the pass mark was set at
35 marks.
Solution
16 | P a g e
marks frequency cumulative
frequency
21-30 4 4
31-40 16 20
41-50 27 47
51-60 32 79
61-70 15 94
71-80 4 98
81-90 2 100
100
QI = 4 = 25th.
L – 41-0.5 = 40.5
C-20
i-(40-31) + 1 = 10
f- 27
100
(
Q = 40.5 + 4
I
−20 ) 10
27
5
= 40.5 + 27 × 10
= 42.35 marks.
17 | P a g e
3
Q3 = l + ( 4 )
n−c i
3/ 4 n = ¾ X 100
= 75th
Class = 51-60
L = 51 – 0.5 = 50.5
C= 47
i= 10
f=32
( 75−47 ) 10
Q3 = 50.5 + 32
28× 10
= 50.5 + 32
=50.5 + 8.75
= 59.25 marks.
40 40
N= X 100
100 100
= 40 Students.
L = 40.5
C= 20
i= 10
18 | P a g e
F= 27
P40
= 40.5 + 7.4
= 47.9
≈ 48 marks
92
100
X100= 92rd student is in the class 60.5 – 70.5.
L = 60.5
C= 79
i= 10
f= 15
13
= 60.5 + ×10
15
19 | P a g e
= 60.5 + 8.7
= 69.2
≈69 marks
X
(
30.5 + 100
×100−4 ) 10
16
( X−4 ) 10=35
30.5 + 16
¿) 10 = 35- 30.5
( X16−4 )10=4.5
10
(x- 4) 16 =4.5
16
Multiplying both sides by 10
10 16 4.5 × 16
(x – 4) × =
16 10 10
x- 4 = 7.2
x= 7.2 + 4
20 | P a g e
x = 11.2
≈11 students
= 89 students.
Example 2
Determine the lower Quartile and upper Quartile for the following sets
of numbers.
6,11,7,6,9,8,4,3,8,9,8
Solution
21 | P a g e
- The median is estimated by locating the mid-point of a
distribution on its cumulative Frequency curve.
Example
Solution
On graph paper
Note:
22 | P a g e
- The lowest class boundary (in the case 20.5) has a cumulative
Frequency zero.
- The Ogive is characterized elongated ‘s’ shape.
23 | P a g e
24 | P a g e
Exercise
10 to 12 3
13 to 15 12
16 to 18 40
19 to 21 37
22 to 24 8
25 | P a g e
More exercise on
Measure of dispersion
Consider the table below which shows end – of- term marks for the
pupils in eight subjects.
Pupil 52 50 57 47 49 40 52 61
A
Pupil 80 27 50 82 19 73 44 33
B
Pupil B largest = 82
26 | P a g e
Smallest = 19
Range = 82- 19
= 63
NOTE
- The greater the variation of the values in a distribution the greater
the range.
- Range disadvantages are that it depends on two extremes.
Interquartile range
Example
QI =
40,47,49,50,52,57,61
47+ 49
Q1 = 2
= 48
52+57
Q3 = 2
= 54.5
= 6.5
27 | P a g e
Semi interquartile range/Quartile deviation
This is half the interquartile range.
Example
Q3−Q1
Quartile deviation of pupil A = 2
6.5
= 2
= 3.25
Marks 52 50 57 47 49 40 52 61
Deviation 1 -1 6 -4 -2 -11 1 10
Absolute 1 1 6 4 2 11 1 10
deviation
28 | P a g e
1+ 1+ 6+4 +2+11+1+10
= 8
36
=8
= 4.5
Exercise
- It is denoted as S2
Example
= x – 51
Marks (x) 52 50 57 47 49 40 52 61
Deviation 1 -1 6 -4 -2 -11 1 10
(d)
D2 1 1 36 16 4 121 1 100 ∑dz
∑ of d z ∑ d2
Variance = = ∑f
∑ of frequency
29 | P a g e
2
∑ d =1+1+36+ 16+4 +121+1+100
= 280
∑f= 1+1+1+1+1+1+1+1
=8
280
Variance = 8
S2 = 35
S =√ Variance
S =√ S 2
Example
30 | P a g e
The table below shows marks scored in a mathematic test.
Solution
- When using a grouped data we use the midpoint to work for the
mean, then use the difference of mean and the marks to get the
deviation.
- Multiply the squared deviation by the frequency.
- Get the sum of the deviation squared multiplied by frequency,.
- Get mean of deviation
∑fd 2
= ∑f
∑ fx
Mean ( x ¿ ¿= ∑ f
4675 4675
= 90
=
90
=¿ 51.9 (1 d.p)
31 | P a g e
2 2
∑ fd ∑ fd
Variance (S) = ∑f
= ∑f
15462.4
= 90
=171.80 (2 d.p)
=√ 171.80
= 13.11 (2 d.p)
2
2 ∑ fd 2 ∑( x −x) f
S = ∑f
=
∑f
32 | P a g e
2
∑ fx
Mean = ∑f
2
∑ fx
Then we subtract (x ¿ ¿ from ∑f
To get the variance and get the square root of the variance to get the
standard deviation.
Example
X 0 1 2 3 4 5 6 7 8 9
f 2 3 2 0 1 5 3 2 6 1
Solution
∑ fx
- Create a column for f x to work for mean of x = ∑f
2
- Create another column for x and fx2
2
∑ fx
- Work for ∑f
x F Fx X2 Fx2
33 | P a g e
0 2 0 0 0
1 3 3 1 3
2 2 4 4 8
3 0 0 9 0
4 1 4 16 16
5 5 25 25 125
6 3 18 36 108
7 2 14 49 98
8 6 48 64 384
9 1 9 81 81
∑ f =25 ∑ fx=125 2
∑ fx =823
∑f x
x=
∑f
125
= 25
x=5
823
S= √ 25
−(5)
2
=√ 32.92−(5)2
=√ 32.92−25
=√ 7.92
34 | P a g e
= 2.81
t= x – A (Assumed mean)
∑ fx
t=
∑f
Example
X 0 1 2 3 4 5 6 7 8 9
f 2 3 2 0 1 5 3 2 6 1
35 | P a g e
x f T=x-6 ft T2 Ft2
0 2 -6 -12 36 72
1 3 -5 -15 25 75
2 2 -4 -8 16 32
3 0 -3 0 9 0
4 1 -2 -2 4 4
5 5 -1 -5 1 5
6 3 0 0 0 0
7 2 1 2 1 2
8 6 2 12 4 24
9 1 3 3 9 9
∑ f =25 ∑ ft=−25 ∑ fx
2
= 223
∑ fx
t=
∑f
−25
= 25
t= -1
223
S= √ 25
−(−1)2
=√ 8.92−1
= √ 7.92
S = 2.81
36 | P a g e
Using scaled down deviation to calculate standard deviation.
We use the Assumed mean (A) and (C) a scaling down factor i.e. class
d
width Deviation t where it is c
d
i.e. t = c and d = x – A and the formula used is variance
2 2
∑ ft ∑ft
= C2 ∑ f - ∑f
= C2 × variance of t
2 2
∑ ft ∑ ft
Standard deviation s=
√
c
∑f
−
∑f
= CX standard deviation of t.
EXAMPLE
Solution
A= 144.5
37 | P a g e
C= class width
= 10
Class Mid- F d= x- A d ft Ft 2
t= 10
(cm) point(x)
120- 129 124.5 8 -20 -2 -16 32
130- 139 134.5 18 -10 -1 -18 18
140-149 144.5 33 0 0 0 0
150-159 154.5 20 10 1 20 20
160-169 164.5 16 20 2 32 64
170-179 174.5 5 30 3 15 45
∑ f =100 ∑ ft=33 2
∑ ft =179
Standard deviation
∑ ft 2 ∑ ft 2
s=
√
c
∑f
−
∑f
2
= 10 179 − 33
√ 100 100
= 10√ 1.79−(0.33)2
= 10√ 1.79−0.1089
= 10 √ 1.6811
= 10 x 1.297
38 | P a g e
= 12.97
Exercise
8,4,6,5,26,12,18,6,13,15,14,10,8,8,9,18,13,10,17,19,24,21,20,22,2
3,12,16and 24.
More exercise on
39 | P a g e