Central Tendency
Central Tendency
Department of Statistics
ii. They may differ in the extent to which the observations are dispersed about the central
value.
Central Tendency: Most of the frequency distributions have a tendency to cluster around
a central value. This property of a frequency distribution is known as central tendency.
Usually bell shaped distributions exhibit this property.
Any measure which can account for such tendency is called a measure of central tendency.
Fundamentally there are two broad types of measures:
i. Moments: based on the distances of the values from a fixed point
ii. Fractiles: based on the position of the values.
Apart, we may use some special kind of measures depending on the nature of the data
reported.
[Note:
i. 𝑤𝑖 may be interpreted as the relative importance of 𝑥𝑖 in the data set. So this
formula is often called the formula for weighted mean.
1
ii. 𝐼𝑓 𝑤𝑖 = 𝑛 ∀ 𝑖 we get formula for simple AM (given by (1)), where equal weights
have been given to each value.
iii. In case of grouped frequency distribution, in the formula given by (2), we
take 𝑥𝑖 as the class mark of the ith class to get an approximate AM of the data set.
Result-2: Let there be two sets of observations with respective sizes 𝑛1 𝑎𝑛𝑑 𝑛2 with
respective arithmetic means 𝑥̅1 𝑎𝑛𝑑 𝑥̅ 2. Then the arithmetic mean for the combined set of
𝑛1 + 𝑛2 observations is given by
𝑛1 𝑥̅1 + 𝑛2𝑥̅2
𝑥̅ 𝑐 = ------------------- (4)
𝑛1 + 𝑛2
Proof: Let the two sets of observations be
𝑥11 , 𝑥12 , ⋯ , 𝑥1𝑖 , ⋯ , 𝑥1𝑛1 𝑎𝑛𝑑 𝑥21 , 𝑥22 , ⋯ , 𝑥2𝑗 , ⋯ , 𝑥2𝑛2
Then
1
𝑥̅1 = ∑𝑛𝑖=1
1
𝑥1𝑖 => ∑𝑖=1
𝑛1
𝑥1𝑖 = 𝑛1 𝑥̅1 𝑎𝑛𝑑
𝑛1
1
𝑥̅ 2 = ∑𝑛𝑗=1
2 𝑛2
𝑥2𝑗 => ∑𝑗=1 𝑥2𝑗 = 𝑛2 𝑥̅ 2
𝑛2
Now, the arithmetic mean for the combined set of 𝑛1 + 𝑛2 observations will be
𝑛
1 𝑥 +∑ 2 𝑥 𝑛
∑𝑖=1 1𝑖 𝑗=1 2𝑗 𝑛1 𝑥̅1 + 𝑛2𝑥̅2
𝑥̅ 𝑐 = =
𝑛1 + 𝑛2 𝑛1 + 𝑛2
Result-3: If 𝑥̅1 𝑎𝑛𝑑 𝑥̅ 2 be respectively the arithmetic means of two sets of observations
with respective sizes 𝑛1 𝑎𝑛𝑑 𝑛2 and 𝑥̅ 𝑐 be the arithmetic mean for the combined set of 𝑛1 +
𝑛2 observations, then 𝑥̅ 𝑐 lies between 𝑥̅1 𝑎𝑛𝑑 𝑥̅ 2.
Proof: Without any loss of generality, let us assume that 𝑥̅1 ≤ 𝑥̅ 2 .
Then we need to prove that 𝑥̅1 ≤ 𝑥̅ 𝑐 ≤ 𝑥̅ 2 .
𝑛1 𝑥̅1+ 𝑛2 𝑥̅2
We have 𝑥̅ 𝑐 = [we need to prove this as result-2]
𝑛1 + 𝑛2
Since 𝑥̅1 ≤ 𝑥̅ 2 (by assumption)
𝑛1 𝑥̅1 + 𝑛2𝑥̅2 𝑛1 𝑥̅2 + 𝑛2 𝑥̅2
𝑥̅ 𝑐 = ≤ = 𝑥̅ 2 ----------- (5)
𝑛1 + 𝑛2 𝑛1 + 𝑛2
𝑛1 𝑥̅1 + 𝑛2 𝑥̅2 𝑛1 𝑥̅1 + 𝑛2 𝑥̅1
And 𝑥̅ 𝑐 = ≥ = 𝑥̅1 ------------ (6)
𝑛1 + 𝑛2 𝑛1 + 𝑛2
Combining (5) and (6), we have
𝑥̅1 ≤ 𝑥̅ 𝑐 ≤ 𝑥̅ 2 . ------------------ (7)
In that case, median is not rigidly defined as any value between 𝑥(𝑛) 𝑎𝑛𝑑 𝑥(𝑛+1) may be
2 2
𝑥 𝑛 +𝑥 𝑛
( ) ( +1)
2 2
taken as median. For uniqueness we take, 𝑥̃ = .
2
𝑛 (𝑛+1) 𝑛
We find m such that 𝐹𝑚−1 < or AND 𝐹𝑚 ≥ 2, then 𝑥𝑚 is the median.
2 2
3. Grouped Frequency Distribution:
In case of grouped frequency distribution, using cumulative frequencies we can locate the
class that includes the median. It is referred to as the median-class; let us denote it by [𝑥𝑚𝐿 −
𝑥𝑚𝑈 ].
𝑥 +𝑥
A crude approximation to the median may be taken as 𝑥̃ = 𝑚𝐿 2 𝑚𝑈
To get an improved formula, let us look at the segment of less than equal to type ogive with
in median class.
If we can assume that the frequencies are uniformly distributed over the median class, then
we can use linear interpolation.
[Please recall that ≤ type cumulative frequency of a class corresponds to its upper
boundary so that, we have
Where, 𝑥𝑚𝐿 and 𝑥𝑚𝑈 are the lower and the upper boundaries respectively of the median
class,
n is the total frequency,
𝑓𝑚 = 𝐹𝑚 − 𝐹𝑚−1 𝑖𝑠 the frequency of the median class,
𝑤𝑚 = 𝑥𝑚𝑈 − 𝑥𝑚𝐿 is the width of the median class,
𝐹𝑚−1 is the cumulative frequency of the class preceding the median class.
Note that: Neither we require classes of equal width, nor uniformity outside the median class.
i.e, the relative positions of y values will be same as the corresponding x values. Thus, the
median of y will be the value of y corresponding to the median of x,
i.e, 𝑦𝑚 = 𝑎𝑥𝑚 + 𝑏.
Case 2: a<0
Relative position of the y values will be the reverse of that of the corresponding x values,
i.e, a𝑥(1) + 𝑏 ≥ 𝑎𝑥(2) + 𝑏 ≥ ⋯ ≥ 𝑎𝑥(𝑛) + 𝑏.
Hence, in this case too, the median of y will be the value of y corresponding to the median
of x, i.e, 𝑦𝑚 = 𝑎𝑥𝑚 + 𝑏. -------------------- (9)
Result: The abscissa of the point of intersection of the two types of ogives gives the
median.
Proof: The less-than equal to type ogive and the greater-than equal to type ogive
respectively are, by nature, monotonically increasing and monotonically decreasing over
the same range of variation of the variable.. Hence, they are expected to intersect.
I(𝑥0 𝑦0)
≥ 𝑡𝑦𝑝𝑒 𝑜𝑔𝑖𝑣𝑒
≤ 𝑡𝑦𝑝𝑒 𝑜𝑔𝑖𝑣𝑒
Let I(𝑥0 , 𝑦0 ) be the point of intersection of the two ogives.
Then, I is a point on the “≤” Type ogive.
⇒ 𝑦𝑜 number of observations have values ≤ 𝑥0 .
Again, I is a point on the “≥” Type ogive.
⇒ 𝑦0 observations have values ≥ 𝑥0 .
∴ No.of observations with values ≤ 𝑥0 = No. of observations with values ≥ 𝑥0
Thus, by definition, 𝑥0 is the median. But 𝑥0 is the point of intersection of the two ogives.
Hence the result.
Result: Let there be two sets of observations with respective sizes 𝑛1 𝑎𝑛𝑑 𝑛2 , respective
medians 𝑀1 & 𝑀2 . Let M be the median of the combined set. Then, M lies between 𝑀1 𝑎𝑛𝑑 𝑀2.
Proof: Without any loss of generality, we assume that 𝑀1 ≤ 𝑀2.
[And for simplicity we take both 𝑛1 𝑎𝑛𝑑 𝑛2 to be even.]
Since 𝑀1 is the median of the first dataset with 𝑛1 observations,
𝑛1
number of observations of the first set are 𝑏𝑒𝑙𝑜𝑤 𝑀1 .
2
Let 𝑛0 be the number of observations of the second set which are 𝑏𝑒𝑙𝑜𝑤 𝑀1 . ----- (10)
𝑛
∴ ( 21 + 𝑛0 ) number of observations in total (combined set) are 𝑏𝑒𝑙𝑜𝑤𝑀1 ----- (11)
By assumption, since 𝑀1 ≤ 𝑀2 ,
So from (10) and (12),
𝑛2
𝑛0 ≤
2
𝑛1 𝑛1+𝑛2
⇒ 𝑛0 + ≤ = number of observations of the combined set that are ≤ 𝑀.
2 2
However, by assumption, 𝑀1 ≤ 𝑀2 .
𝑛1
⇒ k≤ 2
𝑛1
[Since k observations of the first set lie above 𝑀2 and number of observations of the first
2
set are above 𝑀1 ]
𝑛2 𝑛1 + 𝑛2
⇒ +𝑘 ≤
2 2
⇒ Number of observations above 𝑀2 ≤ Number of observations above M.
⇒ M ≤ 𝑀2 ---- (14)
Combining (13) & (14) we get, 𝑀1 ≤ 𝑀 ≤ 𝑀2 ---- (15)
Geometric Mean:
The Geometric Mean of n observations 𝑥1 , 𝑥2 , … , 𝑥𝑛 on some variable x is defined as,
1⁄
𝑥𝑔 = (∏𝑛𝑖=1 𝑥𝑖 ) 𝑛
----------- (16)
1
⇒ log 𝑥𝑔 = ∑𝑛𝑖=1 log 𝑥𝑖 --------- (17)
𝑛
Note:
i. The logarithm of geometric mean of a variable is the arithmetic mean of its logarithm.
ii. If the variable relates to the rate and ratios (i.e., the observations are in multiplicative
mode, rather than additive mode), then geometric mean is an appropriate measure of
average.
iii. The above statement may be justified by the following result
𝑥
Result: If 𝑧 = 𝑦 , and the geometric means of x, y and z respectively are
𝑥𝑔
𝑥𝑔 , 𝑦𝑔 𝑎𝑛𝑑 𝑧𝑔 , then 𝑧𝑔 = , i.e., the geometric mean of the ratio of the two
𝑦𝑔
variables is the ratio of their geometric means.
𝑥
Proof: For ih unit, If 𝑧𝑖 = 𝑦𝑖 𝑓𝑜𝑟 𝑖 = 1,2, ⋯ , 𝑛
𝑖
1⁄ 𝑥 1⁄𝑛
By definition, 𝑧𝑔 = (∏𝑛𝑖=1 𝑧𝑖 ) 𝑛 = (∏𝑛𝑖=1 𝑦𝑖 )
𝑖
1⁄
(∏𝑛
𝑖=1 𝑥𝑖 )
𝑛 𝑥𝑔
= 1⁄ = ----------------------- (18)
(∏𝑛 𝑛 𝑦𝑔
𝑖=1 𝑦𝑖 )
iv. Some statisticians consider the geometric mean to be the natural form of average for
averaging the price relative (relative change in price of a commodity). The reason
may be described as follows:
1
Suppose the price relative of two commodities, say A and B, be respectively 5 𝑎𝑛𝑑 5
1
[i.e., new price of A is 5 time its old price, while for B it is 5 times.] If equal
importance may be assigned to each price relative then on an average there will be no
change in the overall price level meaning that the average price relative is 1, which is
1
nothing but the geometric mean of 5 𝑎𝑛𝑑 5, the price relatives of A and B
respectively
v. Let us consider a data set like
𝑥1 ≈ 𝑎𝑐, 𝑥2 ≈ 𝑎𝑐 2 , ⋯ , 𝑥𝑖 ≈ 𝑎𝑐 𝑖 , ⋯ , 𝑥𝑛 ≈ 𝑎𝑐 𝑛 -------- (19)
Note that the powers of ‘c’ are first n natural numbers. Then we can get a central value
𝑛+1
of the data set when ‘c’ has a power = a central value of 1, 2,---, n = 2
This can be achieved approximately by computing geometric mean given by (16)
[That is why geometric mean comes in if one wants to compute the value at the mid-
point of a time interval when the variable changes over time exponentially.]
vi. How can you justify (17) and from that (16)?
[For the data like (19), log of the observations will be approximately in AP and
naturally for the transformed data set arithmetic mean will be an appropriate
measure of central tendency. Also log function is monotonic. Thus the formula.]
vii. Geometric mean makes sense when all the values of the variable are strictly positive.
viii. Because of its abstract nature and computational labour, geometric mean is not of
common use in statistical work.
ix. For a frequency distribution,
1⁄
𝑓 ∑ 𝑓𝑖
𝑥𝑔 = (∏𝑘𝑖=1 𝑥𝑖 𝑖 ) -------- (20)
1
⇒ log 𝑥𝑔 = ∑𝑘𝑖=1 𝑓𝑖 log 𝑥𝑖 --------- (21)
𝑛
Question: Let there be two sets of observations with respective sizes 𝑛1 & 𝑛2 and respective
geometric means 𝐺1 & 𝐺2. What will be the G.M of the combined set?
Hint: set 1: 𝑥11 , 𝑥12 , … , 𝑥1𝑛1
𝑛 1⁄
𝑛1
G.M of the 1st set: 𝐺1 = (∏𝑖=1
1
𝑥1𝑖 )
𝑛 1⁄
𝑛2
G.M of the 2nd set: 𝐺2 = (∏𝑗=1
2
𝑥2𝑗 )
Generalisation: If there are k sets with respective sizes 𝑛1 , 𝑛2 , … , 𝑛𝑘 and respective G.Ms Let
𝐺1 , 𝐺2 , … , 𝐺. Then, G.M for the combined set will be,
1
⁄∑𝑘 𝑛
𝐺𝑐 = (∏𝑘𝑖=1 𝐺 𝑛𝑖 𝑖 ) 𝑖=1 𝑖 -------------------- (23)
Question: 𝐺1 is a representative of the 𝑛1 observations of the 1st set.
𝐺2 is a representative of the 𝑛2 observations of the 2nd set.
𝑛1 𝐺1 +𝑛2 𝐺2
If one takes to be a representative of the entire dataset, what would be your opinion
𝑛1 +𝑛2
about that measure?
Some observations:
i. G.M is rigidly defined but requires 𝑥𝑖 > 0 ∀ 𝑖.
ii. With respect to sampling fluctuations G.M is preferred to A.M.
iii. Further algebraic treatment is easy.
iv. Interpretation of G.M is not as simple as that of A.M.
v. Computational difficulty of G.M is more than that of A.M.
Problem:
For a frequency distribution the upper class boundary bears a constant ratio r to the lower class
boundary. If 𝑥𝑖 𝑎𝑛𝑑 𝑓𝑖 , 𝑓𝑜𝑟 𝑖 = 1,2, ⋯ , 𝑘, be respectively the class mark and the frequency of
the ith class and G be the geometric mean of the distribution, show that
log 𝑟
log 𝐺 = log 𝑥1 + ∑𝑘𝑖=1(𝑖 − 1)𝑓𝑖 , 𝑤ℎ𝑒𝑟𝑒 𝑛 = ∑𝑘𝑖=1 𝑓𝑖
𝑛
Proof:
Let 𝑥1𝑙 be the lower boundary of the first class, then we have a frequency distribution as
Lower Boundary Upper Boundary Class Mark frequency
𝑥1𝑙 𝑟. 𝑥1𝑙 (1 + 𝑟)
𝑥1 = 𝑥1𝑙 𝑓1
2
𝑟. 𝑥1𝑙 𝑟 2 . 𝑥1𝑙 (1 + 𝑟)
𝑥2 = 𝑟. 𝑥1𝑙 = 𝑟. 𝑥1 𝑓2
2
𝑟 2 . 𝑥1𝑙 𝑟 3 . 𝑥1𝑙 (1 + 𝑟)
𝑥3 = 𝑟 2 . 𝑥1𝑙 = 𝑟 2 . 𝑥1 𝑓3
2
⋮ ⋮ ⋮ ⋮
𝑟 𝑖−1
. 𝑥1𝑙 𝑖
𝑟 . 𝑥1𝑙 (1 + 𝑟)
𝑥𝑖 = 𝑟 𝑖−1 . 𝑥1𝑙 = 𝑟 𝑖−1 . 𝑥1 𝑓𝑖
2
⋮ ⋮ ⋮ ⋮
𝑘−1 𝑘
𝑟 . 𝑥1𝑙 𝑟 . 𝑥1𝑙 𝑥𝑘 = 𝑟 𝑘−1 . 𝑥1 𝑓𝑘
1
Now, log 𝐺 = ∑𝑘𝑖−1 𝑓𝑖 log 𝑥𝑖
𝑛
1 1
= ∑𝑘𝑖−1 𝑓𝑖 log(𝑟 𝑖−1 . 𝑥1 ) = ∑𝑘𝑖−1 𝑓𝑖 [log 𝑟 𝑖−1 + log 𝑥1 ]
𝑛 𝑛
1 1
= log 𝑥1 ∑𝑘𝑖=1 𝑓𝑖 + ∑𝑘𝑖−1 𝑓𝑖 [log 𝑟 𝑖−1 ]
𝑛 𝑛
log 𝑟
= log 𝑥1 + ∑𝑘𝑖−1(𝑖 − 1)𝑓𝑖
𝑛
Harmonic Mean:
Let us consider the following problems of finding the average speed:
1. A car travels certain distance in ‘n’ hours. During ith hour it travels at a constant speed
𝑣𝑖 km per hour, i= 1(1) n.
which is nothing but the arithmetic mean of the values assumed by the variable v which
denotes speed here.
2. Now if the car travels 1st s km at a speed 𝑣1 km per hour, 2nd s km at a speed 𝑣2 km per
hour, ......and so on the last (i.e. nth) s km at a speed 𝑣𝑛 km per hour,
𝑑
It may be noted that for both the cases the variable under study is ‘speed’ v = 𝑡 , where d denotes
distance and t denotes time and while calculating average, we land up with two different
formulae. The reason is that in both the cases the variable under study is of the form ‘a per
unit b’, but in the 1st case the numerator a (here d) varies but in the 2nd case the denominator
b (here t) varies and accordingly we get two different formulae. 2nd one is known as harmonic
mean.
Thus, if the variable under study is of the form ‘a per unit b’ and the denominator b varies
Then an appropriate measure of central tendency is harmonic mean.
Basic data:
𝑥1 , 𝑥2 , . . . . . . . , 𝑥𝑛 , 𝑖. 𝑒, 𝑥𝑖 , 𝑖 = 1(1)𝑛 𝑤𝑖𝑡ℎ 𝑥𝑖 ≠ 0 ∀ 𝑖
It may be noted that this formula can be expressed as ‘the reciprocal of the arithmetic mean of
the reciprocals (of the variable x)’.
For an ungrouped frequency distribution like
x 𝑥1 𝑥2 ⋯⋯ 𝑥𝑘 Total
Frequency.(f) 𝑓1 𝑓2 ⋯⋯ 𝑓𝑘 n
The formula for (weighted) harmonic mean is given by
∑𝑘
𝑖=1 𝑓𝑖
𝑥̅ ℎ = 𝑓𝑖 ----------------------------------- (27)
∑𝑘
𝑖=1 𝑥𝑖
𝑛 1 1
= 𝑓𝑖 = 𝑓𝑖 = 𝑤𝑖
∑𝑘 ∑𝑘
𝑖=1
𝑖=1𝑥 ∑𝑘 𝑛 𝑥𝑖
𝑖 𝑖=1 𝑥
𝑖
𝑓
where 𝑤𝑖 = 𝑛𝑖 , the relative frequency of 𝑥𝑖 , is the weight attached to the value 𝑥𝑖 of the variable
x.
[In case of grouped frequency distribution, in formula (27), 𝑥𝑖 can be taken as the class mark of
the ith class.]
Result: Let there be two sets of observations with respective sizes 𝑛1 and 𝑛2 and respective
harmonic means ℎ1 and ℎ2 . Then the harmonic mean for the combined set of 𝑛1 +
𝑛2 observations will be
𝑛 +𝑛2
ℎ𝑐 = 𝑛11 𝑛2 --------------------------------------- (28)
+
ℎ1 ℎ2
Then by (26)
𝑛1 𝑛 1 𝑛
ℎ1 = 𝑛1 1 ⇒ ∑𝑖=1
1
= ℎ1 ------------------------------------------ (29)
∑𝑖=1 𝑥1𝑖 1
𝑥1𝑖
1 𝑛
∑𝑛𝑗=1
2
= ℎ2 ---------------------------------------- (30)
𝑥2𝑗 2
Again ℎ𝑐 , being harmonic mean of the combined set of 𝑛1 + 𝑛2 , will be
𝑛1 +𝑛2
ℎ𝑐 = 𝑛1 1 𝑛2 1
∑𝑖=1 + ∑𝑗=1
𝑥1𝑖 𝑥2𝑗
𝑛 +𝑛2
=𝑛11 𝑛2 [using (29) and (30)]
+
ℎ1 ℎ2
[The result can easily be generalised for k sets of observations].
Result:
For a set of n positive observations AM ≥ GM ≥ HM
Proof:
Let 𝑥1 , 𝑥2 , . . . . . . . , 𝑥𝑛 , 𝑤𝑖𝑡ℎ 𝑥𝑖 > 0∀𝑖 be a set of n positive observations on some variable x.
Then
1
AM = 𝑛 ∑𝑛𝑖=1 𝑥𝑖 = A (say)
1
GM = (∏𝑛𝑖=1 𝑥𝑖 ) = G (say)
𝑛
𝑛
HM = 1 = H (say)
∑𝑛
𝑖=1 𝑥
𝑖
So AM ≥ GM for n=2
√𝑥1 𝑥2 +√𝑥1𝑥2
≥ [since we have proved that AM ≥ GM for n=2]
2
= 4√𝑥1 𝑥2 𝑥3 𝑥4
Thus AM ≥ GM for n=4
[We have taken the choice of n to be of the form 2𝑚 because while proving the result for n=4
we need to divide the number of observations into two halves so that we can use AM ≥ GM for
n=2. Then if each half can be divided into two halves, the fact that AM ≥ GM for n=2 can
repeatedly be used. Thus at each step, each half will have to be divided into two halves so that
we need n=2𝑚 ]
It may be noted that the AM and GM of the given n observations are A and G respectively.
Let us augment (2𝑚 − 𝑛) additional observations along with 𝑥1 , 𝑥2 , . . . . . . . , 𝑥𝑛 as
𝑚 1 𝑚 −𝑛 1
And GM=(∏𝑛𝑖=1 𝑥𝑖 ∏2𝑖=𝑛+1 𝑥𝑖 )2𝑚 = (𝐺 𝑛 . 𝐴2 )2𝑚
1 1
⇒𝐻 ≥ ⇒𝐺 ≥𝐻
𝐺
i.e. GM ≥ HM ---------------------------------(34)
AM ≥ GM ≥ HM
Note that: Equality holds when all the observations are equal.
We know, from Cauchy- Schwartz inequality, that for two sets of real numbers
(𝑎1 , 𝑎2 , . . . . . . , 𝑎𝑛 )and (𝑏1 , 𝑏2 , . . . . . . , 𝑏𝑛 )
Mode:
Mode is the value of a variable corresponding to the maximum of the ideal curve which gives
the closest possible fit to the actual distribution. It represents the value which is most frequent
or typical. Mode refers to the value which is, in fact the fashion (La Mode). Empirically, we
can find a crude approximation to the mode as follows:
1. For ungrouped frequency distribution, mode = 𝑥𝑚 , where 𝑥𝑚 is the observation
corresponding to the maximum frequency 𝑓𝑚 .
2. For a grouped frequency distribution, it is a bit difficult to obtain the mode. Under the
assumption of uniformity of distributions of frequencies over all classes, we can find
what is called the modal class that corresponds to the maximum frequency. Let mth
class be the modal class.
𝑥 +𝑥
Then, a very crude approximation to the mode is, 𝑚𝐿 2 𝑚𝑈
Where 𝑥𝑚𝐿 𝑎𝑛𝑑 𝑥𝑚𝑈 are respectively the lower class and upper class boundaries of
the modal class.
A modified formula involving the frequencies of the modal class and its two adjacent
classes is given by
0 −1𝑓 −𝑓
Mode, 𝑀0 = 𝑥𝑚𝐿 + 2𝑓 −𝑓 𝑤𝑚 ------------- (36)
−𝑓
0 −1 1
Where 𝑓0 is the frequency of the modal class
𝑓−1 : frequency of the class preceding the modal class.
𝑓1 :frequency of the class next to the modal class.
𝑤𝑚 :width of the modal class.
There is an empirical formula for bell-shaped distribution which are not much
deviated from symmetry and is given by
For some distributions we may have more than one mode. These are called multi
modal distributions.
Result: Effect of change of origin and scale on Mode:
If y = ax + b and mode of x be 𝑥0 , then mode of y will be (a𝑥0 + b).