MEASURES OF
VARIATION, SKEWNESS
AND KURTOSIS
GROUP D
MEASURES OF VARIATION, SKEWNESS AND KURTOSIS
TOPICS REPORTERS TOPIC REPORTERS
1. VARIABILITY Mizzie Carvajal 7. STANDARD DEVIATION Julius Gilbert
Mallen
2. RANGE Eyra Jane Velasco
8. COEFFICAIENT OF Ronel Alde
3. INTERQUARTILE RANGE Ericah Joy Lesigues VARIATION
4. SEMI- INTERQUARTILE
5. MEAN ABSOLUTE Mybel Candido
9. SKEWNESS Gina Mae
DEVIATION Gabilan
6. VARIANCE Edson Louis Para 10. KURTOSIS Jessa Arago
Let's Learn
VARIABILITY
What is variability?
• is a summary statistic that
represents the amount of
dispersion in a dataset.
• it measures how much your scores
differ from each other.
0, 2, 6, 10, 12 mean = 6
8, 7, 6, 5, 4 mean = 6
6, 6, 6, 6, 6 mean = 6
Using the mean to predict a highly
variable data set is much less useful
because the score is less
representative, but with low
variability, we have high predictability
LOW DISPERSION
easier to make predictions about the
population
HIGH DISPERSION
harder to make predictions about the
population
Example
You are investigating the amounts of time spent on
phones daily by different groups of people.
Sample A: HIGH SCHOOL STUDENTS
Sample B: COLLEGE STUDENTS
Sample C: ADULT FULL-TIME EMPLOYEES
In statistics, the range of
observations is the value that is
obtained from the difference
between the highest and lowest
values.
Formula for finding the range:
R = Highest observation – Lowest observation
or
R = Highest value – Lowest value
Find the range of given observations:
32, 41, 28, 54, 35, 26, 23, 33, 38, 40
23, 26, 28, 32, 33, 35, 38, 40, 41, 54
Range = 54 – 23
= 31
Find the range of given observations:
48, 50, 50, 51, 53, 90, 90, 91, 92, 93
Range = 93 – 48
=45
Find the range of the given grouped data.
Scores of Grade 7 in Mathematics class.
Scores f Range = Highest boundary -Lowest boundary
of the higher and lower class interval
36-40 3
R = Hb - Lb
31-35 11
26-30 14
21-25 10
Find the range of the given grouped data.
Scores of Grade 8 in English class.
Scores f Range = Highest boundary -Lowest boundary
of the higher and lower class interval
31-35 6
R = Hb - Lb
26-30 18
21-25 14
16-20 10
LET’S HAVE A
SHORT REVIEW!
SHORT
REVIEW?!?
Quartiles are the values that divide a list of numerical data
into three quarters. There are three quartiles, first,
second and third, denoted by Q1, Q2 and Q3. Here, Q2 is
nothing but the median of the given data.
Ex: 12 13 14 14 15 16 17 18 19 19 20
In Statistics, the range is the smallest of all the measures of dispersion.
It is the difference between the two extreme conclusions of the
distribution. In other words, the range is the difference between
the maximum and the minimum observation of the distribution.
It is defined by
Range = Xmax – Xmin
20-12= 8
Where Xmax is the largest observation and Xmin is the smallest
observation of the variable values.
The interquartile range defines the difference between the
third and the first quartile. Quartiles are the partitioned
values that divide the whole series into 4 equal parts. So,
there are 3 quartiles. First Quartile is denoted by Q1 known
as the lower quartile, the second Quartile is denoted by
Q2 and the third Quartile is denoted by Q3 known as the
upper quartile. Therefore, the interquartile range is equal
to the upper quartile minus lower quartile.
The difference between the upper and lower quartile is
known as the interquartile range. The formula for the
interquartile range is
Interquartile range = Upper Quartile – Lower Quartile
IQR = Q3 – Q1
How to Calculate the Interquartile Range?
The procedure to calculate the interquartile range is given as follows:
● Arrange the given set of numbers into increasing or decreasing order.
● Then count the given values. If it is odd, then the center value is median
otherwise obtain the mean value for two center values. This is known as
Q2 value. If there are even number of values, the median will be the average
of the middle two values.
● Median equally cuts the given values into two equal parts. They are
described as Q1 and Q3 parts.
● The median of data values below the median represents Q1.
● The median of data values above the median value represents Q3.
● Finally, we can subtract the median values of Q1 and Q3.
● The resulting value is the interquartile range.
Let’s try.
In the given data set, solve for the
Interquartile Range.
12 13 14 14 15 16 17 18
19 19 20
Let’s try.
The following are the exam scores of fifteen
grade 7 students,solve for the Interquartile
Range.
44 42 50 47 49
44 43 42 45 48
41 40 41 46 48
The semi-interquartile range is defined as the measures
of dispersion. Semi interquartile range also is defined as
half of the interquartile range. It is computed as one half
the difference between the 75th percentile (Q3) and the
25th percentile (Q1). The semi-interquartile range is one-
half of the difference between the first and third
quartiles. The Formula for Semi Interquartile Range is
● Semi Interquartile Range = (Q3– Q1) / 2
Let’s try.
In the given data set, solve for the
Interquartile Range.
12 13 14 14 15 16 17 18
19 19 20
Let’s try.
The following are the exam scores of fifteen
grade 7 students,solve for the Interquartile
Range.
44 42 50 47 49
44 43 42 45 48
41 40 41 46 48
INTERQUARTILE AND SEMI- INTERQUARTILE IN
RESEARCH
•Descriptive Statistics: IQR helps summarize data by
indicating the range within which the central 50% of values lie,
giving a clear picture of variability without being affected by
outliers.
•Comparative Studies: Researchers often use IQR to
compare the spread of different groups or conditions, helping to
identify differences in data distributions.
THANK YOU
THANK YOU?!?
Reference:
https://byjus.com/maths/interquartile-range/
MEAN ABSOLUTE
DEVIATION
Mybel B. Candido
Definition
• is a statistical measure that quantifies the
average distance between each data point in a
dataset and the mean of that dataset.
• is a valuable tool for assessing variability and
dispersion, providing insights into the consistency
of data.
FORMULA
Example:
dataset:
Calculate the Mean:
Calculate Absolute
Deviations:
Sum of Absolute Deviations:
1.2+2.8+0.8+0.2+2.2=7.2
Calculate MAD:
VARIANCE
GROUP D
VARIANCE
In statistics, variance measures variability from
the average or mean. It is calculated by taking
the differences between each number in the
data set and the mean, squaring the differences
to make them positive, and then dividing the
sum of the squares by the number of values in
the data set.
POPULATION VARIANCE
If we have a population data set, the formula is written as
σ2 = ∑ (xi – x̄)2
n
where,
•x̄ is the mean of population data set
•n is the total number of observations
SAMPLE VARIANCE
If the data set is a sample the formula of variance is given by
s2 = ∑ (xi – x̄)2
(n – 1)
● where,
• x̄ is the mean of sample data set
• n is the total number of observations
STANDARD DEVIATION
Standard Deviation
It is a measure of how dispersed the
data is in relation to the mean. It tells
us how far each score lies from the
mean.
Standard Deviation
In normal distribution, a high standard
deviation means the values are generally
far from the mean, while a low standard
deviation indicates that values are
clustered close to the mean.
Population Standard Deviation
σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵
Sample Standard Deviation
σ 𝒙𝒊 − 𝒙 𝟐
𝒔=
𝒏−𝟏
Sample Standard Deviation
82, 88, 89, 93, 98
𝒔𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒂𝒍𝒍 𝒔𝒄𝒐𝒓𝒆𝒔
𝒙=
𝑵
𝟖𝟐 + 𝟖𝟖 + 𝟖𝟗 + 𝟗𝟑 + 𝟗𝟖
𝒙=
𝟓
𝟒𝟓𝟎
𝒙=
𝟓
𝒙 = 𝟗𝟎
Sample Standard Deviation
82, 88, 89, 93, 98
Score Deviation Squared Deviation
𝒙𝒊 𝒙𝒊 − 𝒙 𝒙𝒊 − 𝒙 𝟐
98 8 64
93 3 9
89 -1 1
88 -2 4
82 -8 64
Sample Standard Deviation
σ 𝒙𝒊 − 𝒙 𝟐
𝒔=
𝒏− 𝟏
𝟏𝟒𝟐
𝒔=
𝟓−𝟏
𝟏𝟒𝟐
𝒔=
𝟒
Sample Standard Deviation
𝒔 = 𝟑𝟓. 𝟓
𝒔 = 𝟓. 𝟗𝟓𝟖 𝒐𝒓 𝟔
Population Standard Deviation
58, 69, 73, 76, 77, 84, 85, 89, 92,
97
𝝁=
𝒔𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒂𝒍𝒍 𝒔𝒄𝒐𝒓𝒆𝒔
𝑵
58 + 69 + 73 + 76 + 77 + 84 + 85 + 89 + 92 + 97
𝝁=
𝟏𝟎
𝟏𝟔𝟎
𝝁=
𝟓
𝝁= 𝟑𝟐
Population Standard Deviation
σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵
Population Standard Deviation
Score Deviation Squared Deviation
𝒙𝒊 𝒙𝒊 − 𝝁 𝒙𝒊 − 𝝁 𝟐 𝝁 = 𝟑𝟐
97 65 4,225
92 60 3,600
89 57 3,249
85 53 2,809 σ 𝒙𝒊 − 𝝁 𝟐 = 24,274
84 52 2,704
77 45 2,025
76 44 1,936
73 41 1,681
69 37 1,369
58 26 676
Population Standard Deviation
σ 𝒙𝒊 − 𝝁 𝟐
𝝈=
𝑵
24,274
𝝈=
𝟏𝟎
𝝈= 𝟐, 𝟒𝟐𝟕. 𝟒
𝝈 = 𝟒𝟗. 𝟐𝟔
WHY DO WE USE SKEWNESS IN
REASEARCH?
-Skewness is used to detect outliers in a data set. Outliers
are to be removed when a good analysis is required in a
data set. Outliers are also indicators of bad data and may
be worth investigating.
An outlier is a single data point that goes far outside the
average value of a group of statistics. Outliers may be
exceptions that stand outside individual samples of
populations as well.
KURTOSIS
jessa p. arago
Group d reporter
Understanding Kurtosis
▪ Kurtosis is a descriptive statistic used to help measure how data
disperse between a distribution's center and tails, with larger values
indicating a data distribution may have “heavy” tails that are thickly
concentrated with observations or that are long with extreme
observations.
▪ Kurtosis is a measure of the combined weight of a distribution’s tails
relative to the center of the distribution curve (the mean).
▪ Kurtosis is sometimes confused with a measure of the peakedness of
a distribution.
▪ kurtosis measures “tailedness,” not “peakedness.”
FORMULA AND CALCULATIONS: