0% found this document useful (0 votes)

77 views43 pages

Measures of CT and Dispersion

The document provides an example of sales data from Amazon without context. It then discusses the importance of context when analyzing data by considering questions like who, what, where, when, why and how. Finally, it provides the sales data example with additional context answering these questions.

Uploaded by

Langalia Kandarp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views43 pages

Measures of CT and Dispersion

Uploaded by

Langalia Kandarp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 43

Scales of measurement,

Measures of Central
Tendency and Dispersion
Quantitative Methods
Prof. Sonia Nangalia
Amazon Data Example
10675489 B000001OAA 10.99 Chris G.
Samuel P. Orange County 10783489 12837593
Canada Garbage 16.99 Ohio
B000002BK9 312 Monique D. Y
Boston 15.98 Kansas 902
B000068ZVQ Bad Blood Nashville N
Chicago N 11.99 N
B00000I5Y6 440 15783947 413
Massachusetts Katherine H. Illinois Let Go

• Data doesn’t always have to be numerical

• Numbers don’t always represent numerical quantities
• Data is meaningless without context
• Consider Who, What, When, Where, Why and How to
add context
Amazon Data Example
PO Ship Area Previous
Name Price Gift? ASIN Artist
Number To Code Purchase
Katherine
10675489 Ohio 10.99 440 Nashville N B00000I5Y6 Kansas
H.
Samuel Orange B000002BK
10783489 Illinois 16.99 312 Y Boston
P. County 9
Chris Mass’tt Bad B000068ZV
12837593 15.98 413 N Chicago
G. s Blood Q
Monique Canad B000001OA
15783947 11.99 902 Let Go N Garbage
D. a A

• Who?  Where?
• What?  When?
• Why?  How?
Types of Data

Types of Data

Categorical
Quantitative
(or Qualitative)

Discrete or
Discrete
Continuous

Nominal Ordinal Interval Ratio

Four ‘levels of measurement’)
• Nominal data. These are purely qualitative data. All
you can do to present these data neatly is putting
each observation into one of a number of categories.
That is why they are also called ‘categorical’ data.
The different categories of the variable involved
cannot be ranked from ‘high’ to ‘low’. Examples:
religion, eye colour.
• Ordinal data. These data record some kind of
ranking. It is possible to say whether one
observation is ‘bigger’ or ‘better’ than another. But
due to the subjectivity or the lack of precision of the
measurement it not possible to state ‘how much
bigger’ or ‘better’. Example: Favourite bands, ‘rare’,
‘medium’, ‘well done’ steaks.
• Interval data. These data can be ranked on a scale
that uses a fixed unit of measurement. Thus, you can
say ‘how much more’ one observation is compared
to another. However, it does not make sense to
express one observation as a ratio of another.
Example: temperature.

• Ratio data. With these data you can sensibly express

one observation as a ratio of another. Examples:
distance, time, income.
EXERCISE
• Classify the following data using the variable
types on the previous slide:
– Temperatures of the 30 days in June

– The hair colour of first year students

– The number of DVD’s sold by a music store each day

– Social class codings of A, B, C1, C2, D, E.

– Species of butterfly
EXERCISE

– How could a customer’s age be collected so it

is recorded as qualitative data?

– How could a customer’s age be collected so it

is recorded as quantitative data?
Summary Statistics

• A summary measure is a single value that

describes a characteristic of a sample of data.
For example, if we want to know something about where
the centre of the data is located (i.e., average) we would
calculate a summary measure of location, e.g., the
mean, median or mode.
• Two characteristics important to decision makers
 Central Tendency
 Dispersion
Central Tendency
• Middle point of a distribution.
• Also called the measures of location.
Dispersion
• The spread of the data in a distribution.
or
• The extent to which the observations are
scattered.
Arithmetic Mean (average)
• Most common measure of central tendency.
• Best for making predictions.
• Symbolized as: X
– for the mean of a sample
– μ for the mean of a population
Calculating mean for ungrouped
data

Add up all the values and divide by the number

of values
x
x
n
Example

• Eleven Geography students were asked how much they spent on

travel each week (in £).

16 20 24 11 20 15 18 22 10 14 17

• Find the mean travel spend for these students:

Mean – Grouped Data
Example: The following table gives the frequency distribution of the number
of orders received each day during the past 50 days at the office of a mail-order
company. Calculate the mean. Number f
of order
10 – 12 4
13 – 15 12
16 – 18 20
19 – 21 14
Solution: n = 50
X is the midpoint of the
Number f x fx class. It is adding the
of order class limits and divide by
10 – 12 4 11 44 2.
13 – 15 12 14 168
x=
 fx = 832 = 16.64
16 – 18 20 17 340 n 50
19 – 21 14 20 280
n = 50 = 832
• Advantages:
 Easy to understand and calculate
 As it is based on all observations, it becomes a good
representative.
 Capable of further algebraic treatment.
• Disadvantages:
 Affected by extreme values
 Sometimes gives absurd results like 4.4 children per
family.
 Cannot calculate mean for open-end class intervals
present in the data
Median
• Middle-most Value
• 50% of observations are above the Median, 50%
are below it
To compute the median
• First arrange the data into ascending or
descending order.
• use n  1 to find the position of the middle
value 2
• If the data contains an odd number of items, the
middle item is the median.
• If there is an even number of items, the median
is the average of the two middle items.
Example

• Eleven Geography students were asked how much they

spent on travel each week (in £).

16 20 24 11 20 15 18 22 10 14 17
• Find the median travel spend for these students:
Median for grouped data
Step 1: Construct the cumulative frequency distribution.
Step 2: Decide the class that contain the median.
Class Median is the first class with the value of cumulative
frequency equal at least n/2.
Step 3: Find the median by using the following formula:
 n 
 2 -F 
Median = Lm + i
 fm 
 
Where:
n = the total frequency
F = the cumulative frequency before class median
fm = the frequency of the class median
i = the class width
= the lower boundary of the class
Lm
median
Example
Time to travel to Frequency
work
1 – 10 8
11 – 20 14
21 – 30 12
31 – 40 9
Solution:
41 – 50 7
1st Step: Construct the cumulative frequency distribution
Time to travel Frequency Cumulative
to work Frequency
1 – 10 8 8
11 – 20 14 22
21 – 30 12 34
31 – 40 9 43
41 – 50 7 50
n 50
  25 class median is the 3rd class
2 2
fSo,
m F =L22,
m
= 12, = 20.5 and i = 10
Therefore,

n 
 - F 
Median = Lm   2 i
f
 m 
 
 25 - 22 
= 21.5   10
 12 
= 24

Thus, 25 persons take less than 24 minutes to travel to work and another 25 persons
take more than 24 minutes to travel to work.
• Advantages:
 Easy to calculate and understand.
 Not affected by extreme values.
• Disadvantages:
 The arrangement of data is time consuming when there
are large number of elements.
Mode
• Mode is the value that is repeated most
often in the data set.
• Bimodal and Multimodal distribution
Example

• Eleven Geography students were asked how much they

spent on travel each week (in £).
16 20 24 11 20 15 18 22 10 14 17

• Find the mode travel spend for these students:

Mode - Grouped data
Mode
•Mode is the value that has the highest frequency in a data set.
•For grouped data, class mode (or, modal class) is the class with the highest frequency.
•To find mode for grouped data, use the following formula:

 Δ1 
Mode = Lmo +  i
Δ
 1 + Δ2 

Where:

i is the class
width
1 is the difference between the frequency of class mode and the frequency
of the class below the class mode
is the difference between the frequency of class
mode
2

and the frequency of the class above the class

mode
Lmo is the lower boundary of class mode
Example
Time to travel to Frequency
work
1 – 10 8
11 – 20 14
21 – 30 12
31 – 40 9
41 – 50 7

Solution:
Based on the table,
Lmo = 10.5, 1 = (14 – 8) = 6,  2 = (14 – 12) = 2 and
i = 10

 6 
Mode = 10.5   10  17.5
 6  2 
• Advantages:
 It is simple to understand and calculate
 It is not unduly affected by extreme values.

• Disadvantages:
 When data set contains 2, 3, or many modes they are
difficult to interpret and compare.
Dispersion
• Central Tendency doesn’t tell us everything
• Dispersion/Deviation/Spread tells us a lot about
how a variable is distributed.
• We are most interested in Standard Deviations
(σ) and Variance (σ2)
Importance and application of
Dispersion
• It gives additional information that enables us to
judge the reliability of our measure of central
tendency.
• It enables us to compare dispersions of various
samples.
• Used by Financial Analysts to know the
dispersion in a firm’s earnings.
• Quality control experts use it to analyse
dispersion of a products quality levels.
Variance and Standard Deviation
• They both tell us an average distance of any
observation in the data set from the mean of the
distribution.
Denoted by
Sample- Population-
s: Standard Deviation σ: Standard
Deviation
s2: Variance σ2: Variance
Calculating variance
• Divide the sum of the squared distances
between the mean and each observation
in the population by the total number of
observations
Variance and Standard Deviation
-Grouped Data
  fx 
2

Population Variance:  fx 2

N
2 
N

  fx 
2

 fx 2

n
Variance for sample data: s 
2

n 1

Standard
Deviation:
Population: 2  2

Sample: s2  s2
• Other measures of dispersion
 Range
 Quartile deviations

 RANGE= VALUE OF HIGHEST OBSERVATION -

VALUE OF LOWEST OBSERVATION
Range
• Advantages
Easy to calculate and understand.
• Disadvantages
Ignores the nature of variation in all
observations
Affected by extreme values.
No range for open end class.
INTERFRACTILE RANGE

• A measure of spread between two fractiles

in a frequency distribution , ie difference
between the values of two fractiles.

• Fractiles have special names: deciles,

quartiles, percentiles.
Interquartile range
• The difference between the first and the
third quartile.

• IQR = Q3-Q1
RELATIVE DISPERSION:COEFFICIENT
OF VARIATION
• The standard deviation cannot be the sole
basis for comparing two variations.

• Relative measure gives the magnitude of

the deviation relative to the measure of the
mean.
Some concepts
• Skewness
• Kurtosis
• Chebychev’s Theorem
EXERCISE DA20: Symmetrical
Distribution
EXERCISE DA20: Positive or Right
Skew
EXERCISE DA20: Negative or Left
Skew
Thank You

Measures of CT and Dispersion
No ratings yet
Measures of CT and Dispersion
57 pages
Statistical Analysis 2023
No ratings yet
Statistical Analysis 2023
56 pages
Math in The Modern World Stat Lecture
No ratings yet
Math in The Modern World Stat Lecture
3 pages
Statistics: Central Tendency & Dispersion
No ratings yet
Statistics: Central Tendency & Dispersion
19 pages
Frequency Distributions and Graphs2
No ratings yet
Frequency Distributions and Graphs2
8 pages
Statistics
No ratings yet
Statistics
164 pages
Lecture 2-Descriptive Statistics
No ratings yet
Lecture 2-Descriptive Statistics
74 pages
Central Tendency
No ratings yet
Central Tendency
105 pages
3rd Week
No ratings yet
3rd Week
87 pages
Lecture Statistics
No ratings yet
Lecture Statistics
23 pages
Measures of Central Tendency & Dispersion - Lecture
No ratings yet
Measures of Central Tendency & Dispersion - Lecture
62 pages
MMW Statistics
No ratings yet
MMW Statistics
50 pages
MCS Lecture 3
No ratings yet
MCS Lecture 3
57 pages
Physics
No ratings yet
Physics
6 pages
Measures of Central Tendency and Dispersion
100% (1)
Measures of Central Tendency and Dispersion
7 pages
Statistics
No ratings yet
Statistics
49 pages
Psychological Statistics Midterm - 2023 2024
No ratings yet
Psychological Statistics Midterm - 2023 2024
7 pages
Unit 3 - Statistics
No ratings yet
Unit 3 - Statistics
25 pages
Mathematics in The Modern World
No ratings yet
Mathematics in The Modern World
50 pages
Educ 98 - MEASURES OF CENTRAL TENDENCY
No ratings yet
Educ 98 - MEASURES OF CENTRAL TENDENCY
24 pages
Topic 2 - Descriptive - Statistics
No ratings yet
Topic 2 - Descriptive - Statistics
36 pages
PC 2 Statistics by Praveen Mathur
No ratings yet
PC 2 Statistics by Praveen Mathur
44 pages
3-Measure of Central Tendency
No ratings yet
3-Measure of Central Tendency
11 pages
D2 - Mathematics in The Modern World
No ratings yet
D2 - Mathematics in The Modern World
7 pages
Chapter 3
No ratings yet
Chapter 3
9 pages
Data Presentation Basics
100% (1)
Data Presentation Basics
45 pages
Numerical Summary Measures Guide
No ratings yet
Numerical Summary Measures Guide
73 pages
For The Students - MODULE 3 - Week 5-7 - Numerical Techniques in Describing Data
No ratings yet
For The Students - MODULE 3 - Week 5-7 - Numerical Techniques in Describing Data
24 pages
Module-4 PPT
No ratings yet
Module-4 PPT
54 pages
MMW MidTerm RevMat
No ratings yet
MMW MidTerm RevMat
8 pages
Statistics for CSS Students
No ratings yet
Statistics for CSS Students
73 pages
MMW - Module 5 - Measures of Central Tendency (Ungrouped Data)
No ratings yet
MMW - Module 5 - Measures of Central Tendency (Ungrouped Data)
31 pages
Central Tendancy in R
No ratings yet
Central Tendancy in R
10 pages
Inbound 1530185091425444579
No ratings yet
Inbound 1530185091425444579
16 pages
Measures of Central TendencyGrouped Module 1
No ratings yet
Measures of Central TendencyGrouped Module 1
10 pages
Lesson Note For S.S 2
No ratings yet
Lesson Note For S.S 2
24 pages
Central Tendency
No ratings yet
Central Tendency
43 pages
Stats Form 4
100% (2)
Stats Form 4
35 pages
Data Management: Midterm
0% (1)
Data Management: Midterm
85 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
Measures of Central Tendency & Variation
No ratings yet
Measures of Central Tendency & Variation
73 pages
Stat Chapter 3
No ratings yet
Stat Chapter 3
41 pages
Chapter 1 BFC34303
No ratings yet
Chapter 1 BFC34303
104 pages
Measure of Central Tendency (Ungrouped and Grouped Data)
100% (1)
Measure of Central Tendency (Ungrouped and Grouped Data)
40 pages
Modern Math Reviewer
No ratings yet
Modern Math Reviewer
14 pages
2 - Measures of Central Tendency - Maed 003
No ratings yet
2 - Measures of Central Tendency - Maed 003
28 pages
Lesson 4 Measure of Central Tendency
100% (1)
Lesson 4 Measure of Central Tendency
20 pages
Statistics Report, Group I
No ratings yet
Statistics Report, Group I
44 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
11 pages
Lesson 6c, 7, 8
No ratings yet
Lesson 6c, 7, 8
46 pages
Statistics for College Students
No ratings yet
Statistics for College Students
90 pages
Measures of Location
No ratings yet
Measures of Location
43 pages
Business Statistics Guide
No ratings yet
Business Statistics Guide
34 pages
MMW Complete Module
No ratings yet
MMW Complete Module
39 pages
Lecture 3 - MEASURE OF CENTRAL TENDENCY
No ratings yet
Lecture 3 - MEASURE OF CENTRAL TENDENCY
25 pages
State9 Project Work Statistics PDF May 24 2012-5-25 PM 1 8 Meg Evozi
100% (1)
State9 Project Work Statistics PDF May 24 2012-5-25 PM 1 8 Meg Evozi
27 pages
Central Tendency - Fall 20
No ratings yet
Central Tendency - Fall 20
38 pages
Statistics & Research MCQs
No ratings yet
Statistics & Research MCQs
31 pages
Mcqs On Biostatistics: Public Health Dentistry
No ratings yet
Mcqs On Biostatistics: Public Health Dentistry
19 pages
THE NATURE OF VARIABLES - PRACTICAL RESEARCH 2.mp4
No ratings yet
THE NATURE OF VARIABLES - PRACTICAL RESEARCH 2.mp4
2 pages
Formulating Min-Research
No ratings yet
Formulating Min-Research
43 pages
Use The Following Scale To Respond To The Item Below
No ratings yet
Use The Following Scale To Respond To The Item Below
3 pages
Intro to Statistics Basics
No ratings yet
Intro to Statistics Basics
89 pages
FinalReport LIPSTICKS
No ratings yet
FinalReport LIPSTICKS
39 pages
EIE2003 Lecture 1
No ratings yet
EIE2003 Lecture 1
6 pages
Biomedical Literature Evaluation
No ratings yet
Biomedical Literature Evaluation
27 pages
Practical Research 2: Quarter 2-Module 5 Data Collection Procedure
100% (1)
Practical Research 2: Quarter 2-Module 5 Data Collection Procedure
19 pages
CBM 300 Methods of Research
No ratings yet
CBM 300 Methods of Research
10 pages
Xử Lý Số Liệu Trong Phân Tích Dược - 01
No ratings yet
Xử Lý Số Liệu Trong Phân Tích Dược - 01
109 pages
A Hundred Years of Numbers. An Historical Introduction To Measurement Theory 1887-1990
No ratings yet
A Hundred Years of Numbers. An Historical Introduction To Measurement Theory 1887-1990
19 pages
Variables: Mesfin Kote (BSC., MPH
No ratings yet
Variables: Mesfin Kote (BSC., MPH
28 pages
Word of Mouth
No ratings yet
Word of Mouth
141 pages
Chapter 1 The Where, Why, and How of Data Collection
No ratings yet
Chapter 1 The Where, Why, and How of Data Collection
42 pages
Computational Statistics 1r
No ratings yet
Computational Statistics 1r
364 pages
Probability and Statistics For Engineers
No ratings yet
Probability and Statistics For Engineers
123 pages
Lecture 1 - Introduction To Statistics For Health Science
No ratings yet
Lecture 1 - Introduction To Statistics For Health Science
47 pages
Visualization 2 Data Representation
100% (1)
Visualization 2 Data Representation
56 pages
Q.1 Write A Note On Criterion Validity, Concurrents Validity and Predictive Validity
No ratings yet
Q.1 Write A Note On Criterion Validity, Concurrents Validity and Predictive Validity
9 pages
0 Ppt1 Introduction To Biostatistics123
No ratings yet
0 Ppt1 Introduction To Biostatistics123
59 pages
Statistics Test Review Guide
No ratings yet
Statistics Test Review Guide
2 pages
213 Bca 23-24
No ratings yet
213 Bca 23-24
25 pages
Basic Statistics For Business & Economics - 10th-CH1
No ratings yet
Basic Statistics For Business & Economics - 10th-CH1
32 pages
Statistics: Salvador J. Dabo Iii College of Advance Education
No ratings yet
Statistics: Salvador J. Dabo Iii College of Advance Education
19 pages
Basics of Statistics: Descriptive Statistics Inferential Statistics
No ratings yet
Basics of Statistics: Descriptive Statistics Inferential Statistics
6 pages
MMW - Midterm - Modules - DATA MANAGEMENT
No ratings yet
MMW - Midterm - Modules - DATA MANAGEMENT
29 pages
Intro to Business Statistics
100% (1)
Intro to Business Statistics
31 pages
STA301 Quiz-1 by Vu Topper RM
No ratings yet
STA301 Quiz-1 by Vu Topper RM
76 pages

Measures of CT and Dispersion

Uploaded by

Measures of CT and Dispersion

Uploaded by

Scales of measurement,

• Data doesn’t always have to be numerical

Nominal Ordinal Interval Ratio

• Ratio data. With these data you can sensibly express

– The hair colour of first year students

– The number of DVD’s sold by a music store each day

– Social class codings of A, B, C1, C2, D, E.

– How could a customer’s age be collected so it

– How could a customer’s age be collected so it

• A summary measure is a single value that

Add up all the values and divide by the number

• Eleven Geography students were asked how much they spent on

• Find the mean travel spend for these students:

• Eleven Geography students were asked how much they

• Eleven Geography students were asked how much they

• Find the mode travel spend for these students:

and the frequency of the class above the class

 RANGE= VALUE OF HIGHEST OBSERVATION -

• A measure of spread between two fractiles

• Fractiles have special names: deciles,

• Relative measure gives the magnitude of

You might also like