Statistics Theory
Statistics Theory
Mathematics
NURTURE COURSE
STUDY MATERIAL
Statistics
English Medium
All rights including trademark and copyrights and rights of translation etc. reserved and vested
exclusively with ALLEN Career Institute Private Limited. (ALLEN)
No part of this work may be copied, reproduced, adapted, abridged or translated, transcribed,
transmitted, stored or distributed in any form retrieval system, computer system, photographic or
other system or transmitted in any form or by any means whether electronic, magnetic, chemical or
manual, mechanical, digital, optical, photocopying, recording or otherwise, or stood in any retrieval
system of any nature without the written permission of the Allen Career Institute Private Limited.
Any breach will entail legal action and prosecution without further notice.
This work is sold/distributed by Allen Career Institute Private Limited subject to the condition and
undertaking given by the student that all proprietary rights (under the Trademark Act, 1999 and
Copyright Act, 1957) of the work shall be exclusively belong to ALLEN Career Institute Private
Limited. Neither the Study Materials and/or Test Series and/or the contents nor any part thereof i.e.
work shall be reproduced, modify, re-publish, sub-license, upload on website, broadcast, post,
transmit, disseminate, distribute, sell in market, stored in a retrieval system or transmitted in any
form or by any means for reproducing or making multiple copies of it.
Any person who does any unauthorised act in relation to this work may be liable to criminal
prosecution and civil claims for damages. Any violation or infringement of the propriety rights of
Allen shall be punishable under Section- 29 & 52 of the Trademark Act, 1999 and under Section- 51,
58 & 63 of the Copyright Act, 1957 and any other Act applicable in India. All disputes are subjected to
the exclusive jurisdiction of courts, tribunals and forums at Kota, Rajasthan only.
Note:- This publication is meant for educational and learning purposes. All
reasonable care and diligence have been taken while editing and printing this
publication. ALLEN Career Institute Private Limited shall not hold any
responsibility for any error that may have inadvertently crept in.
ALLEN Career Institute Private Limited is not responsible for the consequences
of any action taken on the basis of this publication.
Statistics
STATISTICS
An average value or a central value of a distribution is the value of variable which is representative of the
entire distribution, this representative value are called the measures of central tendency.
Generally the following five measures of central tendency.
(a) Mathematical average
(i) Arithmetic mean (ii) Geometric mean (iii) Harmonic mean
(b) Positional average
(i) Median (ii) Mode
x1 + x 2 + ..... + x n
∑x
i =1
i
x = =
n n
⇒ Σxi = n x
(ii) For ungrouped and grouped freq. dist. : If x1, x2, .... xn are values of variate with corresponding
frequencies f1, f2, ... fn then their A.M. is given by
n
f x + f x + .... + fn x n
∑f x i i n
x = 1 1 2 2
f1 + f2 + .... + fn
= i =1
N
, where N = ∑f
i =1
i
(iii) By short method : If the value of xi are large, then the calculation of A.M. by using previous
formula is quite tedious and time consuming. In such case we take deviation of variate from an
arbitrary point a.
Let di = xi – a
Σfi d i
∴ x = a+ , where a is assumed mean
N
(iv) By step deviation method : Sometime during the application of short method of finding the A.M.
If each deviation di are divisible by a common number h(let)
di x −a
Let ui = = i
h h
Σf u
∴ x = a + i i h
N
E 3
JEE-Mathematics
(v) Weighted mean : If w1, w2, ...... wn are the weights assigned to the values x1, x2, ..... xn
respectively then their weighted mean is defined as
n
w x + w 2 x 2 + ..... + w n x n
∑w x
i =1
i i
Weighted mean = 1 1 = n
w1 + ..... + w n
∑w
i =1
i
Illustration 3. Find the weighted mean of first n natural numbers when their weights are equal to their
squares respectively
1.12 + 2.22 + .... + n.n 2 13 + 23 + ..... + n 3
Solution Weighted Mean = =
12 + 22 + .... + n 2 12 + 22 + .... + n 2
[n(n + 1) / 2]2 3n(n + 1)
= =
[n(n + 1)(2n + 1) / 6] 2(2n + 1)
(vi) Combined mean : If x1 and x 2 be the means of two groups having n1 and n2 terms respectively
then the mean (combined mean) of their composite group is given by
n x + n2 x 2
combined mean = 1 1
n1 + n 2
n1 x1 + n1 x 2 + n 3 x 3 + ....
If there are more than two groups then, combined mean =
n1 + n 2 + n 3 + ....
Illustration 4. The mean income of a group of persons is Rs. 400 and another group of persons is Rs. 480.
If the mean income of all the persons of these two groups is Rs. 430 then find the ratio of the
number of persons in the groups.
Solution Here x1 = 400, x 2 = 480, x = 430
n1 x1 + n 2 x 2 400n1 + 480n 2
∵ x = ⇒ 430 =
n1 + n 2 n1 + n 2
n1 5
⇒ =
n2 3
4 E
Statistics
1. Mean of the first n terms of the A.P. a, (a + d), (a + 2d), ........ is-
nd (n − 1)d
(A) a + (B) a + (C) a + (n – 1) d (D) a + nd
2 2
2. The A.M. of first n even natural number is -
n +1 n
(A) n(n + 1) (B) (C) (D) n + 1
2 2
3. If the mean of n observations x1, x2, .......xn is x , then the sum of deviations of observations from mean is -
x
(A) 0 (B) nx (C) (D) None of these
n
n+7
4. If the mean of first n natural numbers is equal to , then n is equal to-
3
(A) 10 (B) 11 (C) 12 (D) none of these
5. If the mean of five observations x, x + 2 , x + 4, x+ 6 and x + 8 is 11, then the mean of last three
observations is-
(A) 11 (B) 13 (C) 15 (D) 17
6. The mean of a set of numbers is x . If each number is decreased by λ, the mean of the new set is-
(A) x (B) x + λ (C) λ − x (D) x − λ
1 1 1
7. The mean of values 1, , ,...... which have frequencies 1, 2, 3, ........ n resp., is :
2 3 n
2n + 1 2 n +1 2
(A) (B) (C) (D)
3 n 2 n +1
9. The weighted mean of first n natural numbers when their weights are equal to corresponding natural
number, is :-
n +1 2n + 1 (n + 1)(2n + 1)
(A) (B) (C) (D) None of these
2 3 6
10. In a group of students, the mean weight of boys is 65 kg. and mean weight of girls is 55 kg. If the mean
weight of all students of group is 61 kg, then the ratio of the number of boys and girls in the group is :-
(A) 2 : 3 (B) 3 : 1 (C) 3 : 2 (D) 4 : 3
E 5
JEE-Mathematics
(i) For ungrouped dist. : If x1, x2, ..... xn are n positive values of variate then their geometric mean G
is given by
G = (x1 × x2 × ..... × xn)1/n
1 n
⇒ G = antilog
n
∑ log x
i =1
i
(ii) For freq. dist. : If x1, x2, .... xn are n positive values with corresponding frequencies f1, f2, .... fn
1 n
⇒ G = antilog
N
∑ f log x
i =1
i i
Note :- If G1 and G2 are geometric means of two series which containing n1 and n2 positive values
n log G1 + n 2 log G2
⇒ G = antilog 1
n1 + n 2
(i) For ungrouped dist. : If x1, x2, ..... xn are n non-zero values of variate then their harmonic mean
H is defined as
n n
H= =
1 1 1 n
1
+
x1 x 2
+ .... +
xn ∑x
i =1 i
(ii) For freq. dist. : If x1, x2, .... xn are n non-zero values of variate with corresponding frequencies
f1, f2, ..... fn respectively the their H.M.
N N
H= =
f1 f2 f n
fi
+
x1 x 2
+ .... + n
xn ∑x
i =1 i
6 E
Statistics
1 1 1 1
Illustration 6. Find the H.M. of , , ,....., .
2 3 4 17
n 16 2
Solution H.M. = = =
1 1 1 2 + 3 + .... + 17 19
+ + ..... +
x1 x 2 xn
Note :- If A, G, H are A.M. G.M. H.M. of a series respectively then
A≥G≥H
3. If G1 and G2 are geometric mean of two series of sizes n1 and n2 resp. and G is geometric mean of their
combined series, then log G is equal to :-
log11 + log G2 n1log11 + n 2 log G2
(A) log G1 + log G2 (B) n1 log G1 + n2log G2 (C) (D)
n1 + n 2 n1 + n 2
7. A boy goes to school from his home at a speed of x km/hr. and comes back at a speed of y km/hr. then the
average speed of the boy is :-
x+y 2xy x+y
(A) km / hr (B) xy km / hr (C) km / hr (D) km / hr
2 x+y 2xy
E 7
JEE-Mathematics
1.4 MEDIAN
The median of a series is the value of middle term of the series when the values are written in ascending
order. Therefore median, divided an arranged series into two equal parts.
Formulae of median :
(i) For ungrouped distribution : Let n be the number of variate in a series then
n + 1 th
term, (when n is odd)
2
Median =
th th
Mean of n and n + 1 terms, (when n is even)
2 2
(ii) For ungrouped freq. dist. : First we prepare the cumulative frequency (c.f.) column and Find
value of N then
N + 1 th
term, (when N is odd)
2
Median =
th th
Mean of N and N + 1 terms, (when N is even)
2 2
(iii) For grouped freq. dist : Prepare c.f. column and find value of then find the class which contain
value of c.f. is equal or just greater to N/2, this is median class
N
2 −F
∴ Median = + × h
f
where — lower limit of median class
f — freq. of median class
F — c.f. of the class preceeding median class
h — Class interval of median class
N 100
Here = = 50 which lies in the value 78 of c.f. hence corresponding class of this c.f.
2 2
is 20-30 is the median class, so
= 20, f = 40, F = 38, h = 10
N
2 −F
∴ Median = + × h = 20 + (50 − 38) × 10 = 23
f 40
8 E
Statistics
1.5 MODE :
In a frequency distribution the mode is the value of that variate which have the maximum frequency
Method for determining mode :
(i) For ungrouped dist. : The value of that variate which is repeated maximum number of times
(ii) For ungrouped freq. dist. : The value of that variate which have maximum frequency.
(iii) For grouped freq. dist. : First we find the class which have maximum frequency, this is model calss
f0 − f1
∴ Mode = + × h
2f0 − f1 − f2
where — lower limit of model class
f0 — freq. of the model class
f1 — freq. of the class preceeding model class
f2 — freq. of the class succeeding model class
h — class interval of model class
In a moderately asymmetric distribution following relation between mean, median and mode of a
distribution. It is known as empirical formula.
Mode = 3 Median – 2 Mean
Note (i) Median always lies between mean and mode
(ii) For a symmetric distribution the mean, median and mode are coincide.
E 9
JEE-Mathematics
Illustration 9. Find the range of following numbers 10, 8, 12, 11, 14, 9, 6
Solution Here greatest value and least value of the distribution are 14 and 6 resp. therefore
Range = 14 – 6 = 8
(ii) Mean deviation (M.D.) : The mean deviation of a distribution is, the mean of absolute value of
deviations of variate from their statistical average (Mean, Median, Mode).
If A is any statistical average of a distribution then mean deviation about A is defined as
n
∑| x
i =1
i − A|
Mean deviation = (for ungrouped dist.)
n
n
∑f |x
i =1
i i − A|
Mean deviation = (for freq. dist.)
N
Note : Mean deviation is minimum when it taken about the median
Illustration 10. Find the mean deviation of number 3, 4, 5, 6, 7 about the mean.
Solution Here n = 5, x = 5
Σ | xi − x |
∴ Mean deviation =
n
1
= [|3 – 5| + |4 – 5| + |5 – 5| + |6 – 5| + |7 – 5|]
5
1 6
= [2 + 1 + 0 + 1 + 2] = = 1.2
5 5
Illustration 11. Find the mean deviation about mean from the following data
Xi 3 9 17 23 27
fi 8 10 12 9 5
Solution
xi fi fi xi |xi – x | Xi | xi – x |
3 8 24 12 96
9 10 90 6 60
17 12 204 2 24
23 9 207 8 72
27 5 135 12 60
N = 44 Σfixi = 660 Σfi |xi – x | = 312
Σfi x i 660
Mean ( x ) = = = 15
N 44
Σfi | x i − x | 312
Mean deviation = = = 7.09
N 44
10 E
Statistics
5. The relationship between mean, median and mode for a moderately skewed distribution is-
(A) mode = median – 2 mean (B) mode = 2 median – mean
(C) mode = 2 median – 3 mean (D) mode = 3 median – 2 mean
8. Mean deviation of the observations 70, 42, 63, 34, 44, 54, 55, 46, 38, 48 from median is :-
(A) 7.8 (B) 8.6 (C) 7.6 (D) 8.8
9. Mean deviation of 5 observations from their mean 3 is 1.2, then coefficient of mean deviation is :-
(A) 0.24 (B) 0.4 (C) 2.5 (D) None of these
E 11
JEE-Mathematics
(iii) Variance and standard deviation : The variance of a distribution is, the mean of squares of
deviation of variate from their mean. It is denoted by σ2 or var(x).
The positive square root of the variance are called the standard deviation. It is denoted by σ or S.D.
Hence standard deviation = + variance
σ
(iii) Coefficient of S.D. =
x
σ
Coefficient of variation = × 100 (in percentage)
x
Note :- σ2 = σx2 = σd2 = h2 σu2
{ }
2
n(n + 1)(2n + 1) n(n + 1) n2 − 1
= − =
6n 2n 12
18 18
Illustration 13. If ∑
i =1
(x i − 8) =
9 and ∑ (x
i =1
i − 8)2 =
45 , then find the standard deviation of x1, x2, .... x18
12 E
Statistics
The mean square deviation of a distribution is the mean of the square of deviations of variate from
assumed mean. It is denoted by S2
Σ(x i − a)2 Σd i2
Hence S2 = = (for ungrouped dist.)
n n
Σfi (x i − a)2 Σf d 2
S2 = = i i (for freq. dist.), where di = (xi – a)
N N
Solution Let a = 7, h = 2
xi – a
Class xi fi ui = fiui fi u2i
h
0-2 1 2 –3 –6 18
2-4 3 7 –2 –14 28
4-6 5 12 –1 –12 12
6-8 7 19 0 0 0
8-10 9 9 1 9 9
10-12 11 1 2 2 4
N = 50 Σfi ui = –21 Σfi u2i = 71
Σf u 2 Σf u
2 71 −21 2
∴ σ2 = h 2 i i − i i = 4 − = 4[1.42 – 0.1764] = 4.97
N N 50 50
E 13
JEE-Mathematics
n2 − 1 n2 − 1 n2 − 1 n2 − 1
(A) (B) (C) (D)
2 3 4 12
10 10
4. If ∑
i =1
(x i − 15) =
12 and ∑ (x
i =1
i − 15)2 =
18 then the S.D. of observations x1, x2 ............ x10 is :-
2 3 4
(A) (B) (C) (D) None of these
5 5 5
7. If each observation of a dist. whose S.D. is σ, is increased by λ, then the variance of the new observations
is -
(A) σ (B) σ + λ (C) σ2 (D) σ2 + λ
8. If each observation of a dist., whose variance is σ2, is multiplied by λ, then the S.D. of the new new
observations is-
(A) σ (B) λσ (C) |λ|σ (D) λ2 σ
14 E
Statistics
∑w x
i =1
i i
Solution Weighted mean = n
∑w
i =1
i
2 × 60 + 1 × 70 + 1 × 70 + 2 × 80
= = 70
6
Illustration 2. The mean of two groups of sizes 200 and 300 are 25 and 10 respectively. Their standard
deviation are 3 and 4 respectively. The variance of combined sample of size 500 is :
(A) 64 (B) 65.2 (C) 67.2 (D) 64.2
n1 x1 + n 2 x 2 200 × 25 + 300 × 10
Solution =
Combined mean x = = 16
n1 + n 2 500
n (σ 2 + d12 ) + n 2 (σ 22 + d 22 )
We know that σ 2 = 1 1
n1 + n 2
Illustration 3. If the mean of the series x1, x2, ......, xn is x , then the mean of the series xi + 2i, i = 1, 2,
........, n will be :
x1 + x 2 + ..... + x n
Solution As given x = ....(1)
n
If the mean of the series xi + 2i, i = 1, 2, ....., n be x , then
2n(n + 1)
= x+ from (1)
2n
= x + n +1
E 15
JEE-Mathematics
Solution Σf = 120 = 17 + f1 + 32 + f2 + 19
⇒ f1 + f2 = 52 ...... (1)
and Σfx = (10 × 17) + (30 × f1) + (50 × 32) + (70 × f2) + (90 × 19)
= 30f1 + 70f2 + 3480
Σ fx 30f1 + 70f2 + 3480
∴ x = ⇒ 50 =
Σf 120
⇒ 30f1 + 70f2 = 2520 ⇒ 3f1 + 7f2 = 252 ..... (2)
by (1) and (2) we get f1 = 28, f2 = 24
Illustration 6. A student obtained 75%, 80%, 85% marks in three subjects. If the marks of another subject
are added then his average marks can not be less than :
(A) 60% (B) 65% (C) 80% (D) 90%
Solution Total marks obtained from three subjects out of 300 = 75 + 80 + 85 = 240
if the marks of another subject is added then total marks obtained out of 400 is greater than 240
if marks obtained in fourth subject is 0 then
240
minimum average marks = × 100 = 60%
400
Illustration 7. The mean and variance of a series containing 5 terms are 8 and 24 respectively. The mean
and variance of another series containing 3 terms are also 8 and 24 respectively. The
variance of their combined series will be :
(A) 20 (B) 24 (C) 25 (D) 42
n1σ12 + n 2 σ 22 n1n 2
Solution =
Using σ2 + (x1 − x 2 )2
n1 + n 2 (n1 + n 2 )2
5(24) + 3(24) 5(3)
⇒
= σ2 + (8 − 8)2 = 24
5+3 (5 + 3)2
16 E
Statistics
Illustration 8. The mean deviation about median from the following data 340, 150, 210, 240, 300, 310,
320, is-
(A) 52.4 (B) 52.5 (C) 52.8 (D) None of these
Solution Arranging the observations in ascending order of magnitude, we have 150, 210, 240, 300,
310, 320, 340. Clearly, the middle observation is 300. So, median = 300
Calculation of Mean deviation
xi |xi – 300|
340 40
150 150
210 90
240 60
300 0
310 10
320 20
1 370
Mean deviation from median = ∑ | x i − 300|= = 52.8
7 7
3.5 3 –3 –9 27
4.5 7 –2 –14 28
5.5 22 –1 –22 22
6.5 60 0 0 0
7.5 85 1 85 85
8.5 32 2 64 128
9.5 8 3 24 72
N = ∑fi = 217 ∑fidi = 128 ∑fidi2 = 362
E 17
JEE-Mathematics
Illustration 10. If a variable takes the value 0, 1, 2......n with frequencies proportional to the binomial
coefficients nC0, nC1,.....,nCn then the mean of the distribution is-
1 n
=
Thus x = n
(n 2n −1 ) .
2 2
Illustration 11. The mean and variance of 5 observations of an experiment are 4 and 5.2 respectively. If from
these observations three are 1, 2 and 6, then the remaining will be-
(A) 2, 9 (B) 5, 6 (C) 4, 7 (D) 3, 8
2
Solution As given x = 4 , n = 5 and σ = 5.2. If the remaining observations are x1, x2 then
∑(x i − x)2
σ2 = = 5.2
n
x1 + x 2 + 1 + 2 + 6
Also x=4 ⇒ = 4 ⇒ x1 + x2 = 11 ....(2)
5
from eq.(1), (2) x1, x2 = 4, 7
Illustration 12. The mean deviation of the series a, a + d, a + 2d, ......, a + 2nd from its mean is-
n +1 n(n + 1) n(n − 1)
(A) |d | (B) |d | (C) |d | (D) none of these
2n + 1 2n + 1 2n + 1
Solution Number of terms in the series = 2n + 1
a + (a + d) + (a + 2d) + ..... + (a + 2nd)
∴ mean x =
2n + 1
1 2n + 1
= (a + a + 2nd) = a + nd
2n + 1 2
Also ∑|xi – x | =|–nd| + |(1–n)d|+.......+|–d|+0 + |d| + ........+ |nd|
n(n + 1)
= 2|d|[n+(n–1) + .......+ 1] = 2|d| = n(n + 1)|d |
2
∑ | x i − x | n(n + 1)
∴ mean deviation from mean = = |d |
N 2n + 1
18 E
Statistics
Illustration 13. Let x1, x2 ....., xn be values taken by a variable X and y1, y2, ..., yn be the values taken by a
[∵ yi = axi + b; i = 1, 2, ....., n ⇒ Y = a X + b]
n
1
⇒ Var(Y) = ∑
n i =1
a 2 (x i − X)2
1 n
2
⇒ Var(Y) = a2
n
∑ (x − X) = a
i =1
i
2
Var(X)
Illustration 14. The mean square deviation of a set of n observations x1, x2, ....., xn about a point c is defined
n
1
as
n ∑ (x − c)
i =1
i
2
. The mean square deviation about –2 and 2 are 18 and 10 respectively,
⇒ 2Σ x i2 + 8n = 28 n and 8Σxi = 8n
⇒ Σ x i2 = 10 n and Σxi = n
Σx i2 Σx i
⇒ = 10 and =1
n n
2
Σx i2 Σx
∴ σ= − i = 10 − (1)2 = 3
n n
E 19
JEE-Mathematics
ANSWER KEY
BEGINNER'S BOX-1
1. (B) 2. (D) 3. (A) 4. (B) 5. (B) 6. (D)
BEGINNER'S BOX-2
1. (C) 2. (B) 3. (D) 4. (D) 5. (C) 6. (C)
7. (C)
BEGINNER'S BOX-3
1. (D) 2. (C) 3. (B) 4. (D) 5. (D) 6. (B)
BEGINNER'S BOX-4
1. (A) 2. (D) 3. (B) 4. (B) 5. (B) 6. (A)
7. (C) 8. (C)
20 E