0% found this document useful (0 votes)

75 views16 pages

Chapter 3 - Central Tendency & Variability

There are three main measures of central tendency: the mode, median, and mean. The mode is the most commonly occurring value, the median is the middle value when values are arranged in order, and the mean is the average value found by summing all values and dividing by the total count. These measures can each be influenced differently based on whether a distribution is symmetrical or skewed. Additional measures like quantiles further describe a distribution by dividing it into equal portions.

Uploaded by

Wai Kiki

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views16 pages

Chapter 3 - Central Tendency & Variability

Uploaded by

Wai Kiki

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Psychological Statistics

Chapter 3 – MEASURES OF CENTRAL TENDENCY & VARIABILTY

Introduction

There are many ways of describing a given set of

data. A good number of descriptive measures exist in
statistics whose use depends largely on the nature of data
and the intended purpose of the description. These
measures are the measures of position or central
tendency, and the measures of variability.

Learning Outcomes
At the end of the chapter, you are expected to:
1. compute the mean, median, and mode of a given set of data;
2. decide which measure of central tendency should be used for certain types of data;
3. compute the standard deviation and variance of a given set of data; and
4. Interpret the computed measures.

Learning Content & Learning Activities

Measures of Central Tendency

A measure of central tendency is a summary statistic that represents the center point or
typical value of a data set. These measures indicate where most values in a distribution fall and are
also referred to as the central location of a distribution. You can think of it as the tendency of data to
cluster around a middle value. The three most common measures of central tendency are
the mean, median, and mode. Each of these measures calculates the location of the central point
using a different method. Colloquially, measures of central tendency are often called averages.
Choosing the best measure of central tendency depends on the type of data you have.

Summary of when to use the mean, median and mode

Use the following summary table to know what the best measure of central tendency is with respect
to the different types of variable.

Type of Variable Best measure of central tendency

Nominal Mode
Ordinal Median
Interval/Ratio (not skewed) Mean
There are three
Interval/Ratio (skewed) Median
main measures
of central tendency: the mode, the median and the mean. Each of these measures describes a
different indication of the typical or central value in the distribution.

What is the mode?

The mode is the most commonly occurring value in a distribution. A set of distribution may have more
than one mode or none at all. For grouped data, the class with the greatest frequency is called the

nds* 2020-2021
Psychological Statistics

modal class. A distribution with only one mode is said to be unimodal. When two measures have the
same frequency, the set is said to be bimodal. If the set has more than two modes then the set is
multimodal. It is also possible for a distribution to have no mode. The set 3, 4, 5, 7, 9, 12, 15 has no
mode.

Consider this data set showing the retirement age of 11 people, in whole years:
54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60
This table shows a simple frequency distribution of the retirement age data.

Age Frequency
54 3
55 1
56 1
57 2
58 2
60 2
The most commonly occurring value is 54, therefore the mode of this distribution is 54 years.

Advantage of the mode:

The mode has an advantage over the median and the mean as it can be found for
both numerical and categorical (non-numerical) data. It is the simplest but unreliable measure of
central tendency. It is not affected by extreme values in a distribution. It is not necessary to arrange
the item before the mode is known.

Limitations of the mode:

There are some limitations to using the mode. In some distributions, the mode may not reflect the
center of the distribution very well.
Consider the distribution of retirement age shown below which is ordered from lowest to highest
value, it is easy to see that the center of the distribution is 57 years, but the mode is lower, at 54
years.

54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60

It is also possible to have more than one mode for the same distribution of data, (bi-modal, or multi-
modal). The presence of more than one mode can limit the ability of the mode in describing the
center or typical value of the distribution because a single value to describe the center cannot be
identified.

In some cases, particularly where the data are continuous, the distribution may have no mode at
all. This is in the case when all values are different. In such cases, it may be better to consider using
the median or mean, or group the data in to appropriate intervals, and find the modal class.

What is the median?

The median is the middle value in a distribution when the values are arranged in ascending or
descending order. It is the number that divides the upper 50% of the data from the lower 50%, that is,
half the data items fall below the median and half are above that value. In an odd number of items,
the median is simply the middle value.

nds* 2020-2021
Psychological Statistics

Looking at the retirement age distribution (which has 11 observations), the median is the middle
value, which is 57 years:

54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60

When the distribution has an even number of observations, the median value is the mean of the two
middle values. In the following distribution, the two middle values are 56 and 57, therefore the
median equals 56.5 years:

52, 54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60

Advantage of the median:

The median is less affected by outliers and skewed data than the mean, and is usually the preferred
measure of central tendency when the distribution is not symmetrical.

Limitation of the median:

The median cannot be identified for categorical nominal data, as it cannot be logically ordered.

What is the mean?

The mean is the sum of the value of each observation in a data set divided by the number of
observations. This is also known as the arithmetic average. It is also used to obtain an average value
of a series of values after each item is weighted. It is referred to as weighted average/mean.

Looking at the retirement age distribution again:

54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60
The mean is calculated by adding together all the values (54+54+54+55+56+57+57+58+58+60+60 =
623) and dividing by the number of observations (11) which equals 56.6 years.

Advantage of the mean:

The mean can be used for both continuous and discrete numeric data.

Limitations of the mean:

The mean cannot be calculated for categorical data, as the values cannot be summed.

As the mean includes every value in the distribution the mean is influenced by outliers and skewed
distributions.

The population mean is indicated by the Greek symbol µ (pronounced ‘mu’). When the mean is

calculated on a distribution from a sample, it is indicated by the symbol x ̅ (pronounced X-bar).

How does the shape of a distribution influence the Measures of Central Tendency?

Symmetrical distributions:
When a distribution is symmetrical, the mode, median and mean are all in the middle of the
distribution. The following graph shows a larger retirement age data set with a distribution which is
symmetrical. The mode, median and mean all equal 58 years.

nds* 2020-2021
Psychological Statistics

Skewed distributions:
When a distribution is skewed the mode remains the most commonly occurring value, the median
remains the middle value in the distribution, but the mean is generally ‘pulled’ in the direction of the
tails. In a skewed distribution, the median is often a preferred measure of central tendency, as the
mean is not usually in the middle of the distribution.

A distribution is said to be positively or right skewed when the tail on the right side of the
distribution is longer than the left side. In a positively skewed distribution, it is common for the mean
to be ‘pulled’ toward the right tail of the distribution. Although there are exceptions to this rule,
generally, most of the values, including the median value, tend to be less than the mean value.

Computation of the mean, median and mode from grouped data

Data which are arranged in a frequency distribution are called grouped data. When the
number of items is too large, it is best to compute for the measures of central tendency and
variability using the frequency distribution.

nds* 2020-2021
Psychological Statistics

The Quantiles

The quantiles are a natural extension of the median concept in that they are values which
divide a set of data into equal parts. While the median divides the distribution into two parts, the
quantiles divide it into four, or ten, or one hundred equal parts. The quantiles which divide the
distribution into four parts are called quartiles, those which divides the distribution into ten parts are
called deciles; and those which divides the distribution into one hundred parts are called percentiles.

How to Find Quantiles? – ungrouped data

Sample question: Find the number in the following set of data where 20 percent of values fall below
it, and 80 percent fall above:
1, 3, 5, 6, 9, 11, 12, 13, 19, 21, 22, 32, 35, 36, 45, 44, 55, 68, 79, 80, 81, 88, 90, 91, 92, 100, 112, 113,
114, 120, 121, 132, 145, 146, 149, 150, 155, 180, 189, 190

nds* 2020-2021
Psychological Statistics

Step 1: Order the data from smallest to largest. The data in the question is already in ascending order.
Step 2: Count how many observations you have in your data set. this particular data set has 40 items.
Step 3: Convert any percentage to a decimal for “q”. We are looking for the number where 20 percent
of the values fall below it, so convert that to 0.20.
Step 4: Insert your values into the formula:
ith observation = q (n + 1)
ith observation = 0.20 (40 + 1) = 8.2

Answer: The ith observation is at 8.2, so we round down to 8 (remembering that this formula is an
estimate). The 8th number in the set is 13, which is the number where 20 percent of the values fall
below it.

Summary statistics such as the median, first quartile and third quartile are measurements of position.
This is because these numbers indicate where a specified proportion of the distribution of data lies.
For instance, the median is the middle position of the data under investigation. Half of the data have
values less than the median. Similarly, 25% of the data have values less than the first quartile and 75%
of the data have values less than the third quartile.

This concept can be generalized. One way to do this is to consider percentiles. The 90th percentile
indicates the point where 90% percent of the data have values less than this number. More generally,
the pth percentile is the number n for which p% of the data is less than n.

Quartile Formula (grouped data)

Decile Formula (grouped data)

Percentile Formula (grouped

data)

nds* 2020-2021
Psychological Statistics

Example: Let us consider the frequency distribution that was organized in chapter 2. Solve for the
mean, median and mode of the distribution and interpret.

Class Less than Greater Relative

Class Frequency(f) boundaries Class fX cum. freq. than cum. freq. (%)
interval mark (X) (<CF) freq. (>CF)

170-172 3 169.5-172.5 171 513 100 3 3

167-169 6 166.5-169.5 168 1,008 97 9 6
164-166 10 163.5-166.5 165 1,650 91 19 10
161-163 18 160.5-163.5 162 2,916 81 37 18
158-160 23 157.5-160.5 159 3,657 63 60 23
155-157 18 154.5-157.5 156 2,808 40 78 18
152-154 10 151.5-154.5 153 1,530 22 88 10
149-151 4 148.5-151.5 150 600 12 92 4
146-148 3 145.5-148.5 147 441 8 95 3
143-145 2 142.5-145.5 144 288 5 97 2
140-142 2 139.5-142.5 141 282 3 99 2
137-139 1 136.6-139.5 138 138 1 100 1
Total (Σ) 100 15,831 100

a. To solve for the mean:

15,831
x ̅ (mean) = = 158.31
100
On the average, the heights of the 100 students is 158.31cm (this now serves as the
representative heights of the 100 students)

b. To solve for the median:

nds* 2020-2021
Psychological Statistics

n 100
= = 50 . The median is the mean of the 50 th and 51st observation, when arranged
2 2
into an array, and these two observations are within the class interval 158-160 as indicated by the
“less than” cumulative frequency. Hence, the median class interval is 158-160 with 157.5 as lower
class boundary. The size of the class interval (i) is 3. Therefore,
50−40
Median = 157.5 + ( ) 3 = 158.8
23
This means that ½ of 100 or 50 students have heights greater than 158.8 cm and the other 50
students have heights lower than 158.8 cm.

c. To solve for the mode:

The modal class is the interval 158-160 since it has the greatest frequency. The lower
boundary of the modal class is 157.5. Δ1 = 23 – 18 = 5 and Δ2 = 23 – 18 = 5. The size of the class
interval is 3. Hence,
Mode = 157.5 + ¿ ) 3 = 159.
The mode of the heights of the 100 students is 159.

To illustrate finding the quartiles, let us consider the same data about the heights of 100 students.
Since there are 100 observations, the first quartile lies between the 25 th and 26th observations and the
third quartile lies between the 75 th and 76th observations. Hence, the first quartile is within the class
interval 155-157 and the third quartile is within the class interval 161-163. Hence, the first and third
quartiles are:
Q1 = 154.5 + ¿ ) 3 = 155

and
Q3 = 160.5 + ¿ ) 3 = 162.5

nds* 2020-2021
Psychological Statistics

Similar process is applied in the computation of deciles (D 1, D2, D3, …,D9) and percentiles (P1, P2, P3, P4,
P5, …, P99).

Measures of Variability

Variability refers to how spread apart the scores of the distribution are or how much the scores vary
from each other. When descriptive statistics are presented, there is usually at least one measure of
central tendency and at least one measure of variability reported. While measures of central
tendency are useful statistics for summarizing the scores in a distribution, they are not sufficient. Two
distribution may have identical means and medians yet be quite different in other ways. There is
need, therefore, for measures researchers can use to describe variability, that exists within a
distribution.

The Range
The range is the difference between the largest and smallest values in a set of values.

For example, consider the following numbers: 1, 3, 4, 5, 5, 6, 7, 11. For this set of numbers, the range
would be 11 - 1 or 10.
The Interquartile Range (IQR)
The interquartile range (IQR) is a measure of variability, based on dividing a data set into quartiles.

Quartiles divide a rank-ordered data set into four equal parts. The values that divide each part are
called the first, second, and third quartiles; and they are denoted by Q 1, Q2, and Q3, respectively.

 Q1 is the "middle" value in the first half of the rank-ordered data set.

 Q2 is the median value in the set.
 Q3 is the "middle" value in the second half of the rank-ordered data set.
The interquartile range is equal to Q3 minus Q1. For example, consider the following numbers: 1, 2, 3, 4,
5, 6, 7, 8.

Q2 is the median of the entire data set - the middle value. In this example, we have an even number of
data points, so the median is equal to the average of the two middle values. Thus, Q 2 = (4 + 5)/2 or Q2
= 4.5. Q1 is the middle value in the first half of the data set. Since there are an even number of data
points in the first half of the data set, the middle value is the average of the two middle values; that is,
Q1 = (2 + 3)/2 or Q1 = 2.5. Q3 is the middle value in the second half of the data set. Again, since the
second half of the data set has an even number of observations, the middle value is the average of the
two middle values; that is, Q3 = (6 + 7)/2 or Q3 = 6.5. The interquartile range is Q3 minus Q1, so
IQR = 6.5 - 2.5 = 4.

The interquartile range indicates the distance between the two values which determine the middle
50% of all observations within the distribution. One-half this distance is called the semi-interquartile
Q 3−Q1
range or the quartile deviation (QD). Thus, Q D = 2
Mean Deviation

nds* 2020-2021
Psychological Statistics

The mean deviation measures the average deviation of the values from the arithmetic mean.
It gives equal weight to the deviation of every observation. The mean deviation id used in
determining the extent of the differences or variabilities among the members of a group. It is also an
indicator of how compact the group is on a certain measure.

The formula to calculate the mean deviation for the given data set is given below.
Mean Deviation = [Σ |X – x̅ |]/n
Here,
Σ represents the addition of values
X represents each value in the data set
x̅ represents the sample mean
n represents the number of data values
|| represents the absolute value, which ignores the “-” symbol

Example 1:
Determine the mean deviation for the data values 5, 3,7, 8, 4, 9.
Solution:
Given data values are 5, 3, 7, 8, 4, 9.
First, find the mean for the given data:
Mean, x̅ = ( 5+3+7+8+4+9)/6
x̅ = 36/6
x̅ = 6
Therefore, the mean value is 6.
Now, subtract each mean from the data value, and ignore the minus symbol if any
(Ignore”-”)
5–6=1
3–6=3
7–6=1
8–6=2
4–6=2
9–6=3
Now, the obtained data set is 1, 3, 1, 2, 2, 3.
Finally, find the mean value for the obtained data set
Therefore, the mean deviation is
= (1+3 + 1+ 2+ 2+3) /6
= 12/6
=2
Hence, the mean deviation for 5, 3,7, 8, 4, 9 is 2.
For a grouped data,
Mean Deviation = [Σ |X – x̅ |]/n
Here,
Σ represents the addition of values
X represents the midpoint or class mark of a class interval
x̅ represents the sample mean
n represents the total number of observations
|| represents the absolute value, which ignores the “-” symbol

Example:

|X - x̅ | or
Class interval Frequency(f) Class mark(X)
|X – 158.31| f| X - x̅ |

nds* 2020-2021
Psychological Statistics

170-172 3 171 12.69 38.07

167-169 6 168 9.69 58.14
164-166 10 165 6.69 66.9
161-163 18 162 3.69 66.42
158-160 23 159 0.69 15.87
155-157 18 156 2.31 41.58
152-154 10 153 5.31 53.1
149-151 4 150 8.31 33.24
146-148 3 147 11.31 33.93
143-145 2 144 14.31 28.62
140-142 2 141 17.31 34.62
137-139 1 138 20.31 20.31
Total (Σ) 100 490.8

490.8
The computed mean is 158.31, and the mean deviation = = 4.908. This number means that
100
some values are greater than the mean, some lesser. But on the average, each value differs from the
mean by the representative value of 4.908.

Variance and Standard Deviation

The variance is equal to the sum of the squared deviations about the mean divided by the
number of observations. The standard deviation is the square root of the average of the squares of
the deviation of each observation from the mean. It is calculated as the square root of the variance.
They are used when the mean is the preferred measure of central tendency. They show whether or
not the values are grouped closely around the mean of the distribution. The symbols for sample and
population variances are s2 and 2, respectively. Variance is frequently discussed by researchers as
an indicator of how much variability there is in an entire distribution of values. The standard
deviation is used to determine how far the data are from the mean.
If the values are clustered tightly about their mean, the standard deviation is small and if the
values become more and more scattered about the mean, the standard deviation of these sets is
large.
If the data points are further from the mean, there is a higher deviation within the data set; thus, the
more spread out the data, the higher the standard deviation. A low standard deviation indicates that
the values tend to be close to the mean of the set, while a high standard deviation indicates that the
values are spread out over a wider range.

When to use the sample or population standard deviation

We are normally interested in knowing the population standard deviation because our population
contains all the values we are interested in. Therefore, you would normally calculate the population
standard deviation if: (1) you have the entire population or (2) you have a sample of a larger
population, but you are only interested in this sample and do not wish to generalize your findings to
the population. However, in statistics, we are usually presented with a sample from which we wish to
estimate (generalize to) a population, and the standard deviation is no exception to this. Therefore, if
all you have is a sample, but you wish to make a statement about the population standard deviation
from which the sample is drawn, you need to use the sample standard deviation. Confusion can often
arise as to which standard deviation to use due to the name "sample" standard deviation incorrectly
being interpreted as meaning the standard deviation of the sample itself and not the estimate of the
population standard deviation based on the sample.

nds* 2020-2021
Psychological Statistics

What type of data should you use when you calculate a standard deviation?

The standard deviation is used in conjunction with the mean to summarize continuous data, not
categorical data. In addition, the standard deviation, like the mean, is normally only appropriate when
the continuous data is not significantly skewed or has outliers.

Examples of when to use the sample or population standard deviation

Q. A teacher sets an exam for her students. The teacher wants to summarize the results the students
attained as a mean and standard deviation. Which standard deviation should be used?

A. Population standard deviation. Why? Because the teacher is only interested in this class of
students' scores and nobody else.

Q. A researcher has recruited males aged 45 to 65 years old for an exercise training study to
investigate risk markers for heart disease (e.g., cholesterol). Which standard deviation would most
likely be used?

A. Sample standard deviation. Although not explicitly stated, a researcher investigating health related
issues will not simply be concerned with just the participants of their study; they will want to show
how their sample results can be generalized to the whole population (in this case, males aged 45 to
65 years old). Hence, the use of the sample standard deviation.

Q. One of the questions on a national consensus survey asks for respondents' age. Which standard
deviation would be used to describe the variation in all ages received from the consensus?

A. Population standard deviation. A national consensus is used to find out information about the
nation's citizens. By definition, it includes the whole population. Therefore, a population standard
deviation would be used.

What are the formulas for the standard deviation?

The sample standard deviation formula is:

where,

s = sample standard deviation

= sum of...
= sample mean
n = number of scores(values) in sample.

The population standard deviation formula is:

nds* 2020-2021
Psychological Statistics

where,

= population standard deviation

= sum of...
= population mean
n = number of scores(values) in sample.

Example: Compute the standard deviation of the heights of 100 students in the activity.
(X - x̅ ) or
Class interval Frequency(f) Class mark(X)
(X – 158.31) (X - x̅ )2

170-172 3 171 12.69 161.0361

167-169 6 168 9.69 93.8961
164-166 10 165 6.69 44.7561
161-163 18 162 3.69 13.6161
158-160 23 159 0.69 0.4761
155-157 18 156 -2.31 5.3361
152-154 10 153 -5.31 28.1961
149-151 4 150 -8.31 69.0561
146-148 3 147 -11.31 127.9161
143-145 2 144 -14.31 204.7761
140-142 2 141 -17.31 299.6361
137-139 1 138 -20.31 412.4961
Total (Σ) 100 1,461.1392

If treated as sample:

s=
√ 1,461.1392 =
100−1
√ 14.75953 =

If treated as population:

=
√ 1,461.1392 =
100
√ 14.6114 =

Coefficient of Variation

The coefficient of variation (relative standard deviation) is a statistical measure of the

dispersion of data points around the mean. The coefficient of variation (CV) is the ratio of the
standard deviation to the mean. The higher the coefficient of variation, the greater the level of
dispersion around the mean. It is generally expressed as a percentage. Without units, it allows for
comparison between distributions of values whose scales of measurement are not comparable.
When we are presented with estimated values, the CV relates the standard deviation of the estimate

nds* 2020-2021
Psychological Statistics

to the value of this estimate. The lower the value of the coefficient of variation, the more precise the
estimate.

Mathematically, the standard formula for the coefficient of variation is expressed in the following
way:

s
or Coefficient of variation = x̅ x 100%

where: where:

σ – the population standard deviation s – the sample standard deviation

μ – the population mean x ̅ - the sample mean

Two sets of data with known means and standard deviations may be compared quantitatively by
taking the coefficient of variation of each group.

Example 1. Suppose a set of data has mean = 32 and s = 5, and another set has mean = 26 and s = 4.
5
For the first set, CV = 32 x 100 = 15.62%

4
For the second set, CV = 26 x 100 = 15.38%
Since the CV of the second group is smaller, the second group is better than the first group. While its
mean is a little lower than that of the first group, the values are less variable than those of the first.
Thus, a high mean does not always imply a better set of values. The standard deviation, together with
the mean, gives a better description of the set of data.

Example 2.
A researcher is comparing two multiple-choice tests with different conditions. In the first test, a
typical multiple-choice test is administered. In the second test, alternative choices (i.e. incorrect
answers) are randomly assigned to test takers. The results from the two tests are:

Randomized
Regular Test Answers

Mean 59.9 44.8

SD 10.2 12.7

Trying to compare the two test results is challenging. Comparing standard deviations doesn’t really
work, because the means are also different. Calculating the coefficient of variation helps to make
sense of the data:
Randomized
Regular Test Answers

Mean 59.9 44.8

nds* 2020-2021
Psychological Statistics

SD 10.2 12.7

CV (%) 17.03 28.35

Looking at the standard deviations of 10.2 and 12.7, you might think that the tests have similar
results. However, when you adjust for the difference in the means, the results have more significance:
Regular test: CV = 17.03
Randomized answers: CV = 28.35

The coefficient of variation can also be used to compare variability between different measures. For
example, you can compare IQ scores to scores on the Woodcock-Johnson III Tests of Cognitive
Abilities.

Note: The Coefficient of Variation should only be used to compare positive data on a ratio scale. The CV has little or no
meaning for measurements on an interval scale. Examples of interval scales include temperatures in Celsius or Fahrenheit,
while the Kelvin scale is a ratio scale that starts at zero and cannot, by definition, take on a negative value (0 degrees Kelvin
is the absence of heat).

Assessment Task

From the same data considered in the activity, organize the heights of the 100 students into a
frequency distribution with a class interval of 5, the highest value must be the upper limit of the
highest class interval.

Class Less than (X - x̅ )

interval tally Frequency(f) Class fX cum.
(X - x̅ )2
mark(X) freq.(<CF)

168-172 170 100

163-167 165

nds* 2020-2021
Psychological Statistics

Total (Σ) 100

Solve the following: (show corresponding solutions)

1. mean
2. median
3. mode
4. first quartile
5. third quartile
6. 5th decile
7. 7th decile
8. 25th percentile
9. 85th percentile
10. standard deviation

References

Altares, Priscilla S., et. al. 2003. Elementary Statistics: A Modern Approach. Rex Book Store.
Manila, Philippines

Deauna, Melecio C. 1999. Elementary Statistics for Basic Education. Phoenix Publishing House,
Inc. QC. Philippines

into account_Th
Febre, Francisco A. 1987. Introduction to Statistics. Phoenix Publishing House, Inc. QC. Phil.

nds* 2020-2021

Chapter 6 Annotation of Antonio Morgas Sucessos Delas Islas Filipinas
No ratings yet
Chapter 6 Annotation of Antonio Morgas Sucessos Delas Islas Filipinas
3 pages
Jose Rizal'S Life: Higher Education and Life Abroad: Learning Outcomes
No ratings yet
Jose Rizal'S Life: Higher Education and Life Abroad: Learning Outcomes
21 pages
Jose Rizal'S Life: Family, Childhood and Early Education
50% (2)
Jose Rizal'S Life: Family, Childhood and Early Education
13 pages
Jose Rizal'S Life: Exile, Trial and Death: Learning Outcomes
No ratings yet
Jose Rizal'S Life: Exile, Trial and Death: Learning Outcomes
13 pages
Chapter 1 Psych Stat
No ratings yet
Chapter 1 Psych Stat
7 pages
Chapter 2 - Organization and Presentation of Data: Learning Outcomes
No ratings yet
Chapter 2 - Organization and Presentation of Data: Learning Outcomes
8 pages
Working With Variables in SPSS Statistics
No ratings yet
Working With Variables in SPSS Statistics
7 pages
The Endocrine System Overview/ Introduction: Nur112: Anatomy and Physiology ISU Echague - College of Nursing
No ratings yet
The Endocrine System Overview/ Introduction: Nur112: Anatomy and Physiology ISU Echague - College of Nursing
6 pages
Digestive System.: Nur112: Anatomy and Physiology ISU Echague - College of Nursing
No ratings yet
Digestive System.: Nur112: Anatomy and Physiology ISU Echague - College of Nursing
9 pages
Module 7 Lymphatic System
No ratings yet
Module 7 Lymphatic System
7 pages
THEORIES of PERSONALITY MATRIX
100% (1)
THEORIES of PERSONALITY MATRIX
3 pages
Nur112: Anatomy and Physiology ISU Echague - College of Nursing
No ratings yet
Nur112: Anatomy and Physiology ISU Echague - College of Nursing
14 pages
Directions For One-Way ANOVA in Microsoft Excel 2007: Part 1: Making The Data Analysis Tab Visible
No ratings yet
Directions For One-Way ANOVA in Microsoft Excel 2007: Part 1: Making The Data Analysis Tab Visible
4 pages
Ejc t2 Enge
No ratings yet
Ejc t2 Enge
5 pages
Hotelling T2 Control Chart Guide
No ratings yet
Hotelling T2 Control Chart Guide
8 pages
As of Sep 16, 2021: Seppo Pynn Onen Econometrics I
No ratings yet
As of Sep 16, 2021: Seppo Pynn Onen Econometrics I
60 pages
Jadad Scale For Reporting Randomized Controlled Trials: Appendix
No ratings yet
Jadad Scale For Reporting Randomized Controlled Trials: Appendix
2 pages
Designing Organization For Performance Excellence: Deloitte Consulting
No ratings yet
Designing Organization For Performance Excellence: Deloitte Consulting
20 pages
A - Step-By-Step - Guide - To - Exploratory - Factor - Analysi... - (6. - Step - 1 - Variables - To - Include)
No ratings yet
A - Step-By-Step - Guide - To - Exploratory - Factor - Analysi... - (6. - Step - 1 - Variables - To - Include)
3 pages
Memonetal JASEM Editorial V4 Iss2 June2020
No ratings yet
Memonetal JASEM Editorial V4 Iss2 June2020
21 pages
Pratima Education® 9898168041: D. Ratio
No ratings yet
Pratima Education® 9898168041: D. Ratio
68 pages
Module 18 Probability Distributions
No ratings yet
Module 18 Probability Distributions
34 pages
ADU5301 - Home Assignment
No ratings yet
ADU5301 - Home Assignment
3 pages
Prob Stats Module 4 2
No ratings yet
Prob Stats Module 4 2
80 pages
Probability & Statistics Final Exam
No ratings yet
Probability & Statistics Final Exam
2 pages
Assignment Solutions 8
No ratings yet
Assignment Solutions 8
3 pages
Pearson Product Moment Correlation Coefficient: Miguel Angelo Oboza Concio
No ratings yet
Pearson Product Moment Correlation Coefficient: Miguel Angelo Oboza Concio
17 pages
(Springer Series in Statistics) Jun Shao, Dongsheng Tu (Auth.) - The Jackknife and Bootstrap-Springer-Verlag New York (1995)
100% (1)
(Springer Series in Statistics) Jun Shao, Dongsheng Tu (Auth.) - The Jackknife and Bootstrap-Springer-Verlag New York (1995)
532 pages
Accounting Indvidual Assignment
100% (1)
Accounting Indvidual Assignment
3 pages
Statistics and Probability: Department of Education
100% (1)
Statistics and Probability: Department of Education
3 pages
Basic Concepts of Estimation
100% (1)
Basic Concepts of Estimation
17 pages
Statistics Cheat Sheet Guide
No ratings yet
Statistics Cheat Sheet Guide
3 pages
Predicting Grades from Alcohol Use
No ratings yet
Predicting Grades from Alcohol Use
17 pages
Control Chart A Statistical Process Cont
No ratings yet
Control Chart A Statistical Process Cont
10 pages
The Study of Different Types of Kernel Density Estimators: Minge Sha, Yonggang Xie
No ratings yet
The Study of Different Types of Kernel Density Estimators: Minge Sha, Yonggang Xie
5 pages
Arens Chapter17
100% (1)
Arens Chapter17
44 pages
Lecture - 3
No ratings yet
Lecture - 3
24 pages
360DigiTMG Practical Data Science New
100% (1)
360DigiTMG Practical Data Science New
168 pages
Histogram, Box and Whisker Plots
No ratings yet
Histogram, Box and Whisker Plots
7 pages
P Chart
No ratings yet
P Chart
3 pages
Solutions Stat200 Final Fall2015 Ol4 B
No ratings yet
Solutions Stat200 Final Fall2015 Ol4 B
9 pages
Introduction To Survey Sampling5 PDF
No ratings yet
Introduction To Survey Sampling5 PDF
9 pages

Chapter 3 - Central Tendency & Variability

Uploaded by

Chapter 3 - Central Tendency & Variability

Uploaded by

Psychological Statistics

Chapter 3 – MEASURES OF CENTRAL TENDENCY & VARIABILTY

There are many ways of describing a given set of

Learning Content & Learning Activities

Measures of Central Tendency

Summary of when to use the mean, median and mode

Type of Variable Best measure of central tendency

What is the mode?

Advantage of the mode:

Limitations of the mode:

54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60

What is the median?

52, 54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60

Advantage of the median:

Limitation of the median:

What is the mean?

Looking at the retirement age distribution again:

Advantage of the mean:

Limitations of the mean:

The population mean is indicated by the Greek symbol µ (pronounced ‘mu’). When the mean is

Computation of the mean, median and mode from grouped data

How to Find Quantiles? – ungrouped data

Quartile Formula (grouped data)

Decile Formula (grouped data)

Percentile Formula (grouped

Class Less than Greater Relative

170-172 3 169.5-172.5 171 513 100 3 3

a. To solve for the mean:

b. To solve for the median:

c. To solve for the mode:

 Q1 is the "middle" value in the first half of the rank-ordered data set.

170-172 3 171 12.69 38.07

Variance and Standard Deviation

When to use the sample or population standard deviation

Examples of when to use the sample or population standard deviation

What are the formulas for the standard deviation?

The sample standard deviation formula is:

s = sample standard deviation

The population standard deviation formula is:

= population standard deviation

170-172 3 171 12.69 161.0361

The coefficient of variation (relative standard deviation) is a statistical measure of the

σ – the population standard deviation s – the sample standard deviation

Mean 59.9 44.8

Mean 59.9 44.8

CV (%) 17.03 28.35

Class Less than (X - x̅ )

168-172 170 100

Total (Σ) 100

Solve the following: (show corresponding solutions)

You might also like