MEASURES OF LOCATION
MEASURES OF CENTRAL TENDENCY
FRACTILES
Bienvenido D. Vendiola III, MATHEd Dalubhasaan ng Lungsod ng Lucena
MEASURES OF CENTRAL TENDENCY
are numerical descriptive measures which
indicate or locate the center of a distribution
or data set.
• Mean
• Median
• Mode
1
STATISTICAL MEAN
• The statistical mean is commonly called the average
• To find the mean of a group of numbers:
• Add the numbers together
• Divide by how many numbers were added together
• where:
2
• x̄ = mean
• = sum of the measurements or values
• n = number of measurements
THE WEIGHTED MEAN
• where:
• x̄ = mean
3 • X = measurement or value
• W = weight
THE WEIGHTED MEAN
• where:
• WM = weighted mean
4 • f = frequency
• n = total number of respondents
THE WEIGHTED MEAN
3.26 -4.00 Strongly Agree (SA) 3.51 -4.00 Strongly Agree (SA)
2.51 -3.25 Agree (A) 2.51 -3.50 Agree (A)
1.76 – 2.50 Disagree (D) 1.51 – 2.50 Disagree (D)
4 1.00 – 1.75 Strongly Disagree (SD) 1.00 – 1.50 Strongly Disagree (SD)
The Mean for Grouped Data
• The Classmark Formula
• where:
• x̄ = mean
5 • f = frequency
• = classmark (midpoint)
• n = total frequency
The Mean for Grouped Data
The Classmark Formula
CLASSES Frequency (f) Class Mark/Midpoint (X) FXm
5-12 2 8.5
13-20 2 16.5
21-28 8 24.5
29-36 3 32.5
5 37-44 1 40.5
45-52 4 48.5
THE CODED FORMULA
• where:
• x̄ = mean
• = assumed mean
• f = frequency
6 • d = coded deviation
• n = total frequency
• i = class size
THE CODED FORMULA
CLASSES Frequency Class Mark/Midpoint (X) d fd
5-12 2 8.5
13-20 2 16.5
21-28 8 24.5
29-36 3 32.5
6 37-44 1 40.5
45-52 4 48.5
CHARACTERISTICS OF THE MEAN
• the most appropriate measure of central
tendency when the data are in the interval or
ratio scale - absolute
• lies between the largest and smallest value
• there is only one value for the mean for a given
set of values or measurements
• easily influenced by the extreme values because
7
all values contribute to the average
THE MEDIAN
• The statistical median is middle number of a group of
numbers that have been arranged in order by size. If
there is an even number of terms, the median is the
mean of the two middle numbers.
• To find the median of the group of numbers:
• Arrange the numbers in order by size
• If there is an odd number of terms, the median is the
8 center term.
• If there is an even number of terms, add the two
middle terms and divide by 2.
THE MEDIAN
𝒙
• where:
• = median
𝒙
• l = lower class boundary of the median class
• n = total frequency
• <cf = less than cumulative frequency above (below)
9 the median class
• i = size of the class interval
• = frequency of the median class
THE MEDIAN
CLASSES Frequency Class Boundary <cf
5-12 2 4.5-12.5 2
13-20 2 12.5-20.5 4
21-28 8 20.5-28.5 12
29-36 3 28.5-36.5 15
6
37-44 1 36.5-44.5 16
45-52 4 44.5-52.5 20
CHARACTERISTICS OF THE MEDIAN
• the most appropriate measure of central tendency
for interval data
• lies between the highest and the lowest
measurements
• there is only one value for the median in each set of
measurements
• not influenced by extreme values
10 • used when the middle value is desired. It is the value
where 50% or half of the distribution lies above it
and 50% lies below it
STATISTICAL MODE
• The statistical mode is the number that occurs most
frequently in a set of numbers.
• To find the mode of the group of numbers:
• Arrange the numbers in order by size
• Determine the number of instances of each numerical value.
• The numerical value that has the most instances is the mode.
• There may be more than one mode when two or more
numbers have an equal number of instances, and this is also
11 the maximum instances (unimodal, bimodal, multimodal)
• A mode does not exist if no number has more than one
instance.
• Example: The mode of 2, 4, 5, 5, 5, 7, 8, 8, 9, 12 is 5.
MODE
• where:
• = Mode
• = lower class boundary of the modal class
• = frequency of the modal class
12 • = frequency above (below) the modal class
• = frequency below (above) the modal class
• i = size of the class interval
CHARACTERISTICS OF THE MODE
• the most appropriate measure of central tendency
when the data are nominal in scale
• the least reliable among the three measures of
central tendency because its value is undefined in
some distributions
• used when we want to find out the value which
13 occurs most often
• quick approximation of the average; sometimes
referred to as an inspection average
MIDRANGE
• lowest value plus highest value, divided by 2
14
OTHER MEASURES OF LOCATION
Quartiles
• are values which divide the distribution into four equal
parts
Deciles
• are values which divide the distribution into ten equal
parts
Percentiles
15 • are values which divide the distribution into 100 equal
parts
• Percentile
• A teacher gives a 20-point test to 10 students.
• The scores are 18, 15, 12, 6, 8, 2, 3, 5, 20, 10
• Find the percentile rank of a score of 12
• Solution:
• Arrange the data from lowest to highest
16 • 2, 3, 5, 6, 8, 10, 12, 15, 18, 20
• Then substitute into the formula. P=65th percentile
• Thus, a student whose score was 12 did better than 65% of the
class
FINDING A DATA VALUE CORRESPONDING TO A GIVEN PERCENTILE
• Steps:
• 1. Arrange the data in order from lowest to highest
• 2. Substitute into the formula
• where n = total number of values below
• p = percentile
• 3. If c is not a whole number, round up to the next whole
number. Starting at the lowest value, count over to the
17 number that corresponds to the rounded-up value.
• If c is a whole number, use the value halfway between
the cth and (c+1)th values when counting up from the
lowest value
• Find the percentile rank for a score of 6.
• Thus, a student who scored 6 did better than 35% of the class
• Find the value corresponding to the 25th percentile
• Round up to 3, then count over from the lowest to the third value,
18 which is 5.
• Hence, the value 5 corresponds to the 25th percentile.
OUTLIERS
• An outlier is an extremely high or extremely low
data value when compared with the rest of the
data values.
19
MEASURES OF VARIABILITY OR DISPERSION
• are measures of the average distance of each
observation from the center of the distribution. They
measure the homogeneity or heterogeneity of a
particular group.
• A small variability would indicate that:
• clustered closely around the mean
• more homogenous
20 • less variable
• more consistent
• more uniformly distribute
Ex.
• Consider the following sets of grades: (mean and
location in a number line)
• A: 70, 95, 60, 80, 100
• B: 82, 80, 83, 81, 79
21
RANGE
• R = HV - L V
• Disadvantages:
• unstable descriptive measure of dispersion
• unreliable
• the range of two sets of data composed of different
numbers of samples are not directly comparable
22
MEAN ABSOLUTE DEVIATION
• is the average of the summation of the absolute
deviation of each observation from the mean
• Limitations
• greatly affected by extreme values since the mean is
23 used in its computation
• its use in further statistical computation is very limited
• it is not amenable to algebraic manipulation because of
the use of absolute value
VARIANCE
• is the average of the squared deviation from the mean
• Population variance:
• Sample variance:
• where:
• X = value from the raw data
24 • x̄ = mean
• N = total population
• n = sample
STANDARD DEVIATION
• is the squared root of the average deviation from the
mean, or simply the square root of the variance
• Population standard deviation:
25 • Sample standard deviation:
S
SHORTCUT OR COMPUTATIONAL FORMULA FOR VARIANCE AND SD
2
𝑛(σ 𝑥 ) − (σ 𝑥) 2
26 𝑠=
𝑛(𝑛 − 1)
RANGE RULE OF THUMB
• A rough estimate of the standard deviation is
• It can be used to estimate the largest and smallest
data values of a data set
• The smallest data value will be approximately 2
27 standard deviations below the mean
• The largest data value will be approximately 2
standard deviations above the mean
MEASURES OF VARIABILITY FOR GROUPED DATA
• Range:
• R = UL – LL
where UL = upper limit and
LL = lower limit
STANDARD DEVIATION
• Population standard deviation:
• Sample standard deviation:
S
• where:
• X = classmark
• x̄ = mean
• f = frequency
MEASURES OF SKEWNESS
• Normal Distribution
• is a distribution with a bell-shaped appearance
• mean = median = mode
A. DISTRIBUTION OF CORRECT ANSWERS OF 19 STUDENTS
WHO PARTICIPATED IN A MATH CONTEST
No. of Correct Answer f
1 1
2 2
3 4
4 5
5 4
6 2
7 1
N = 19
B. DISTRIBUTION OF CORRECT ANSWERS OF 19 STUDENTS
WHO PARTICIPATED IN A MATH CONTEST
No. of Correct Answer f
1 0
2 0
3 1
4 2
5 4
6 9
7 3
N = 19
C. DISTRIBUTION OF CORRECT ANSWERS OF 19 STUDENTS
WHO PARTICIPATED IN A MATH CONTEST
No. of Correct Answer f
1 3
2 9
3 4
4 2
5 1
6 0
7 0
N = 19
-
SKEWNESS
• refers to the degree of symmetry and asymmetry of a distribution
• Skewed to the left / negatively skewed
• if the mean is less than the median; the bulk of the distribution is
on the right
• Skewed to the right / positively skewed
• if the mean is greater than the median; the bulk of the distribution
is on the left
• Coefficient of Skewness (SK)
Interpretation:
• Normal Distribution if Ku=3,
• leptokurtic if Ku>3, and
• platykurtic if Ku<3.
KURTOSIS
• refers to the peakedness or flatness of a distribution
• Mesokurtic
• is a normal distribution
• Leptokurtic
• is more peaked than the normal distribution
• Platykurtic
• is flatter than the normal distribution
THANK YOU!
Bienvenido D. Vendiola III, MAed Dalubhasaan ng Lungsod ng Lucena