0% found this document useful (0 votes)

54 views20 pages

Descriptive Stats for GIS Students

This document provides an overview of a lecture on descriptive statistics. It discusses univariate descriptive statistics like measures of central tendency (mean, median, mode) and dispersion (range, interquartile range, variance, standard deviation). It also discusses bivariate descriptive statistics like correlation. Examples of calculating various univariate statistics and interpreting skewness, kurtosis, and frequency distributions are provided. Scatterplots and calculating correlation to analyze relationships between two variables are also introduced.

Uploaded by

Fanelo Felicity

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views20 pages

Descriptive Stats for GIS Students

Uploaded by

Fanelo Felicity

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

2021/08/25

Lecture 4

GIS220:
Descriptive statistics

Prof Gregory Breetzke

[email protected]
Room 1-19, Geography

Lecture overview

• What are descriptive statistics?

• Types of descriptive statistics
– Univariate
– Bivariate
• Examples

1
2021/08/25

Descriptive statistics

• Provide an initial entry point

• Some research questions can satisfactory be answered

using descriptive statistics

Types of descriptive statistics

• Univariate and bivariate statistics

– U: mean, mode, range, standard deviation
– B: correlation coefficient

2
2021/08/25

Types of descriptive statistics

UNIVARIATE

• Measures of central tendency

– Mean
– Mode
– Median
• Measures of dispersion
– Range
– Interquartile range
– Variance
– Standard deviation

The mean

• The mean is a measure of central value

– What most people mean by “average”
– Sum of a set of numbers divided by the number
of numbers in the set

3
2021/08/25

The median
• Middlemost or most central item in the set of
ordered numbers; it separates the distribution
into two equal halves
• If odd, then n is the middle value of sequence
– if X = [1,2,4,6,9,10,12,14,17]
– then 9 is the median
• If even, then n, average of 2 middle values
– if X= [1,2,4,6,9,10,11,12,14,17]
– then 9.5 is the median; i.e., (9+10)/2
• Median is not affected by extreme values

The mode
• The mode is the most frequently occurring
number in a distribution
– if X = [1,2,4,7,7,7,8,10,12,14,17]
– then 7 is the mode
• Easy to see in a simple frequency distribution
• Possible to have no modes or more than one
mode
– bimodal and multimodal
• Don’t have to be exactly equal frequency
– major mode, minor mode
• Mode is not affected by extreme values

4
2021/08/25

When to use what…?

• Mean is a great measure. But, there are time when its
usage is inappropriate or impossible
– Nominal data: Mode
– The distribution is bimodal: Mode
– You have ordinal data: Median or mode
– Are a few extreme scores: Median

Dispersion
• Dispersion
– How tightly clustered or how
variable the values are in a data
set
• Example
– Data set 1: [0,25,50,75,100]
– Data set 2: [48,49,50,51,52]
– Both have a mean of 50, but data
set 1 clearly has greater variability than data set 2

5
2021/08/25

Range
• The difference between the maximum and
minimum values in a set
• Example
– Data set 1: [1,25,50,75,100]; R: 100-1 = 99
– Data set 2: [48,49,50,51,52]; R: 52-48 = 4
– The range ignores how data are distributed and
only takes the extreme scores into account

• RANGE = (Xlargest –Xsmallest)

Quartiles
• Split ordered data into four quarters

= first quartile = (25th percentile)

= second quartile = Median (50th percentile)
= third quartile = (75th percentile)

6
2021/08/25

Interquartile range (IQR)

• Difference between third and first quartiles
– Interquartile Range = Q3-Q1

• Spread in middle 50%

• Not affected by extreme values

• The IQR is used to measure how spread out the data points in a set
are from the mean of the data set

• The higher the IQR, the more spread out the data points

• The smaller the IQR, the more bunched up the data points are
around the mean

• It is best used with other measurements such as the median and

total range to build a complete picture of a data set’s tendency to
cluster around its mean.

Example

• Given the set of values: 27, 18, 19, 12, 15, 1,

2, 6, 5, 9, 7, find the…
– Mean
– Median
– Range
– Interquartile range

7
2021/08/25

Standard deviation
• Let X = [3, 4, 5 ,6, 7]
– X=5
– (X - X) = [-2, -1, 0, 1, 2]
• Subtract x from each number in X
– (X - X)2 = [4, 1, 0, 1, 4]
• Squared deviations from the mean
– – S (X - X)2 = 10
• Sum of squared deviations from the mean (SS)
– S (X - X)2 /n-1 = 10/5 = 2.5
• Average squared deviation from the mean
– S (X - X)2 /n-1 = 2.5 = 1.58
• Square root of averaged squared deviation

Standard deviation
• Most South African employers issue raises based on
percent of salary
• Why do supervisors think the most fair raise is a
percentage raise?
• Answer:
1)Because higher paid persons get the most money.
2)The easiest thing to do is raise everyone’s salary by a fixed
percent.
• If your budget went up by 5%, salaries can go up by 5%.
• The problem is that the flat percent raise gives
unequal increased rewards

8
2021/08/25

Standard deviation
• Acme Toilet Cleaning Services
• Salary Pool: R200,000

Incomes:
• President: R100K; Manager: R50K; Secretary: R40K; and
Toilet Cleaner: R10K
• Mean: R50K - These can be considered
• Range: R90K “measures of inequality”

• Variance: R1,050,000,000
• Standard Deviation: R32.4K
• Now, let’s apply a 5% raise

Standard deviation
• After a 5% raise, the pool of money increases by R10K to
R210,000

• Incomes:
– President: R105K; Manager: R52.5K; Secretary: R42K; and Toilet Cleaner:
R10.5K
– Mean: R52.5K –went up by 5%
– Range: R94.5K –went up by 5%
– Variance: R1,157,625,000
– Standard Deviation: R34K –went up by 5%

• The flat percentage raise increased

inequality. The top earner got 50% of
the new money. The bottom earner
got 5% of the new money. Measures of
inequality went up by 5%.

9
2021/08/25

Skew
• Skewness is a measure of the asymmetry of the
probability distribution
• Roughly speaking, a distribution has positive skew
(right-skewed) if the right (higher value) tail is
longer and a negative skew (left-skewed) if the left
(lower value) tail is longer (confusing the two is a
common error)

Skew

10
2021/08/25

Kurtosis

• A high kurtosis distribution has a sharper "peak"

and fatter "tails", while a low kurtosis distribution
has a more rounded peak with wider "shoulders".

11
2021/08/25

Frequency distributions
• Symmetrical distribution
– Approximately equal numbers of observations above and
below the middle
• Skewed distribution
– One side is more spread out that the other, like a tail
– Direction of the skew
• Positive or negative (right or left)
• Side with the fewer scores
• Side that looks like a tail

Symmetrical vs. skewed distributions

12
2021/08/25

Types of descriptive statistics

BIVARIATE

• Correlation
– linear pattern of relationship between one variable (x) and
another variable (y) –an association between two variables
• Relative position of one variable correlates with relative
distribution of another variable
• Warning:
– No proof of causality
– Cannot assume x causes y

Scatterplots and correlation

• A scatter plot (or scatter diagram) is used to show
the relationship between two variables
– Scatter diagram plots pairs of bivariate observations (x, y)
on the X-Y plane
– Y is called the dependent variable
– X is called an independent variable
• Correlation analysis is used to measure strength of
the association (linear relationship) between two
variables
– Only concerned with strength of the
relationship
– No causal effect is implied

13
2021/08/25

Types of correlation
• Positive correlation
– High values of X tend to be associated with high values of Y.
– As X increases, Y increases
• Negative correlation
– High values of X tend to be associated with low values of Y.
– As X increases, Y decreases
• No correlation
• No consistent tendency for values on Y to increase or
decrease as X increases

14
2021/08/25

15
2021/08/25

Applications

Individual vs Group (Neighbourhood)

16
2021/08/25

What type of relationship?

Scatterplot:Video Games and Alcohol Consumption

20
Average Number of Alcoholic Drinks

18
16
14
Per Week

12
10
8
6
4
2
0
0 5 10 15 20 25
Average Hours of Video Games Per Week

What type of relationship?

Scatterplot: Video Games and Test Score

100
90
80
70
Exam Score

60
50
40
30
20
10
0
0 5 10 15 20
Average Hours of Video Games Per Week

17
2021/08/25

Each point represents something or

some PLACE!!

18
2021/08/25

19
2021/08/25

Practical 1

Date: Thursday 26th August 1130-1430 (Posted on Thursday)

Location: Remotely or on-campus (Brown & Orange & Red IT labs)

Assistance: Thursdays 1130-1420 and Thursdays 14:00-16:00 by

appointment via Doodle

Due: Thursday 9th September at 1130 (upload on ClickUp)

Task: Sampling exercise and gaining familiarity with GeoDa and

ArcPro

Software: Excel, GeoDa and ArcPro

Interpreting Test Score: Online Workshop 8602 Aiou
100% (1)
Interpreting Test Score: Online Workshop 8602 Aiou
39 pages
Data Analysis and Data Visualization Basics 2
No ratings yet
Data Analysis and Data Visualization Basics 2
50 pages
Lecture-6-7-8-Descriptive Statistics-Dispersion
No ratings yet
Lecture-6-7-8-Descriptive Statistics-Dispersion
42 pages
Chapter 4 Measures of Dispersion (Variation)
No ratings yet
Chapter 4 Measures of Dispersion (Variation)
34 pages
Basic Statistics
No ratings yet
Basic Statistics
7 pages
Bioepi Lesson 6. Descriptive Statistics
No ratings yet
Bioepi Lesson 6. Descriptive Statistics
38 pages
Lecture 2-Descriptive Statistics
No ratings yet
Lecture 2-Descriptive Statistics
74 pages
Chapter 2 Descriptive Statistics
No ratings yet
Chapter 2 Descriptive Statistics
3 pages
Lecture Notes 2 - Descriptive Statistics-1720598791715
No ratings yet
Lecture Notes 2 - Descriptive Statistics-1720598791715
21 pages
RRU5903 (850Mhz) - Technical Specifications
No ratings yet
RRU5903 (850Mhz) - Technical Specifications
8 pages
Unit 4 & 5 8614
No ratings yet
Unit 4 & 5 8614
58 pages
Day 3 Educational Statistics
No ratings yet
Day 3 Educational Statistics
37 pages
Q & A - Unit 1 - Introduction To Statistics
No ratings yet
Q & A - Unit 1 - Introduction To Statistics
20 pages
Descriptive Statistics - Measures of Spread: April 2014
No ratings yet
Descriptive Statistics - Measures of Spread: April 2014
5 pages
Descriptive Statistics - Measures of Spread: April 2014
No ratings yet
Descriptive Statistics - Measures of Spread: April 2014
5 pages
Source: Pllnu4Dk9H04Wqyrebvzx4?Fr Yfp-T-701-S &toggle 1&cop Mss&Ei Utf8&Fp - Ip PH&P Types of Descriptive Statistics
No ratings yet
Source: Pllnu4Dk9H04Wqyrebvzx4?Fr Yfp-T-701-S &toggle 1&cop Mss&Ei Utf8&Fp - Ip PH&P Types of Descriptive Statistics
51 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
Analytics Compendium (Incl Stats)
No ratings yet
Analytics Compendium (Incl Stats)
31 pages
C32 - PFRS 5 Noncurrent Asset Held For Sale
No ratings yet
C32 - PFRS 5 Noncurrent Asset Held For Sale
4 pages
Windows Movie Maker
100% (2)
Windows Movie Maker
6 pages
Measures of Central Tendency - 1-1
No ratings yet
Measures of Central Tendency - 1-1
24 pages
Descriptive Statistics & Data Prep
No ratings yet
Descriptive Statistics & Data Prep
113 pages
Descriptive Statistics Course Guide
No ratings yet
Descriptive Statistics Course Guide
50 pages
District Test On The Circular Flow Model-1-1
100% (2)
District Test On The Circular Flow Model-1-1
7 pages
Week 12
No ratings yet
Week 12
37 pages
Statistics Basics for Data Science
100% (1)
Statistics Basics for Data Science
27 pages
Measures of Central Tendency Guide
No ratings yet
Measures of Central Tendency Guide
32 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
Final Measures of Dispersion DR Lotfi
No ratings yet
Final Measures of Dispersion DR Lotfi
54 pages
Module I. Basic Calculations. Average, Standard Deviation by Excel
No ratings yet
Module I. Basic Calculations. Average, Standard Deviation by Excel
48 pages
Introduction To Statistics Lecture 7
No ratings yet
Introduction To Statistics Lecture 7
32 pages
Statistical Analysis - Descriptive Stat
No ratings yet
Statistical Analysis - Descriptive Stat
6 pages
Statistics in Psychology
No ratings yet
Statistics in Psychology
15 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
Geo - Statistics 1 1 1
No ratings yet
Geo - Statistics 1 1 1
51 pages
8614.educational Statitics Unit 4
No ratings yet
8614.educational Statitics Unit 4
34 pages
Descriptive Stats for Students
No ratings yet
Descriptive Stats for Students
21 pages
Hns 2321 Biostatistics Descritive Statistics
No ratings yet
Hns 2321 Biostatistics Descritive Statistics
35 pages
Basic Functions of Excel
No ratings yet
Basic Functions of Excel
76 pages
Stats Lecture 1
No ratings yet
Stats Lecture 1
45 pages
BTK - A318 - A319 - A320 - A321 - AMM - 01-Feb-2020 - J. AIDS MCDU Functions
No ratings yet
BTK - A318 - A319 - A320 - A321 - AMM - 01-Feb-2020 - J. AIDS MCDU Functions
44 pages
Biostats Lesson 3
No ratings yet
Biostats Lesson 3
6 pages
Basic Concepts of Statistics
No ratings yet
Basic Concepts of Statistics
43 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
Bharat Parekh
100% (1)
Bharat Parekh
3 pages
Business Statistics ASSIGNMENT
No ratings yet
Business Statistics ASSIGNMENT
4 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
22 pages
Statistical Measures 2024 (Part 2) - Word
No ratings yet
Statistical Measures 2024 (Part 2) - Word
8 pages
Unit-3 DS Students
No ratings yet
Unit-3 DS Students
35 pages
Statistics: Types, Data, and Measures
No ratings yet
Statistics: Types, Data, and Measures
6 pages
Data Visualizations: Histograms
No ratings yet
Data Visualizations: Histograms
27 pages
Why Study Dispersion?: Spread of The Data
No ratings yet
Why Study Dispersion?: Spread of The Data
31 pages
Stats Week 1 PDF
No ratings yet
Stats Week 1 PDF
6 pages
Statistics 3: DR Taher
No ratings yet
Statistics 3: DR Taher
38 pages
Assignment 2
No ratings yet
Assignment 2
19 pages
Lectures 11 12 13 - Engineering Statistics 2017 - Handouts
No ratings yet
Lectures 11 12 13 - Engineering Statistics 2017 - Handouts
97 pages
SINGLE VARIABLE Notes 5.3 Year 10
No ratings yet
SINGLE VARIABLE Notes 5.3 Year 10
9 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
37 pages
Basic Conducting Online Lesson Plan 3 31
No ratings yet
Basic Conducting Online Lesson Plan 3 31
1 page
How To Post Bail For Your Temporary Liberty
No ratings yet
How To Post Bail For Your Temporary Liberty
4 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
5 pages
Bookkeeping (Second Part)
100% (3)
Bookkeeping (Second Part)
38 pages
Statistical Machine Learning
100% (1)
Statistical Machine Learning
12 pages
2019 GGY 283 Semester Test 1
No ratings yet
2019 GGY 283 Semester Test 1
2 pages
Descriptive Stat
No ratings yet
Descriptive Stat
13 pages
Installation Manual For EG2233/EG3333/EG8406/EG3355/EG3388 RF EAS Systems
No ratings yet
Installation Manual For EG2233/EG3333/EG8406/EG3355/EG3388 RF EAS Systems
8 pages
Procedure For Design and Development
No ratings yet
Procedure For Design and Development
8 pages
2021 L5 GGY283 Data Models 2
No ratings yet
2021 L5 GGY283 Data Models 2
19 pages
BE Module 5
No ratings yet
BE Module 5
16 pages
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
No ratings yet
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
35 pages
Insurance Premium Rates Guide
No ratings yet
Insurance Premium Rates Guide
6 pages
2021 L4 GGY283 Data Models 1
No ratings yet
2021 L4 GGY283 Data Models 1
28 pages
Practical Exercise 2: Introduction To Online Geospatial Technology
No ratings yet
Practical Exercise 2: Introduction To Online Geospatial Technology
10 pages
Middle East Real Estate Predictions - Dubai
No ratings yet
Middle East Real Estate Predictions - Dubai
28 pages
Cao Wang FTA EMA
No ratings yet
Cao Wang FTA EMA
5 pages
Hydraulic Sealing Surface Insights
No ratings yet
Hydraulic Sealing Surface Insights
7 pages
Database Normalization Exercise
No ratings yet
Database Normalization Exercise
1 page
Spray Booth Design English
No ratings yet
Spray Booth Design English
7 pages
OM Unit - III-1
No ratings yet
OM Unit - III-1
29 pages
Crime Mapping for Police Planning
No ratings yet
Crime Mapping for Police Planning
7 pages
2021 - Lecture 6 - Spatial Autocorrelation I - Slides
No ratings yet
2021 - Lecture 6 - Spatial Autocorrelation I - Slides
17 pages
The Individual Team Member - Personality
No ratings yet
The Individual Team Member - Personality
20 pages
Test November, Questions and Answers Test November, Questions and Answers
No ratings yet
Test November, Questions and Answers Test November, Questions and Answers
9 pages
Hartley Oscillator
No ratings yet
Hartley Oscillator
4 pages
STD Blanket MSDS FOR TURBINE INSULATION
No ratings yet
STD Blanket MSDS FOR TURBINE INSULATION
7 pages
2021 - Lecture 8 - Spatial Analysis of Points - Slides
No ratings yet
2021 - Lecture 8 - Spatial Analysis of Points - Slides
16 pages
LiDAR for Mapping and Surveying
No ratings yet
LiDAR for Mapping and Surveying
29 pages
P6 - GGY283 - 2021 - Instructions and Notes
No ratings yet
P6 - GGY283 - 2021 - Instructions and Notes
23 pages
Geogia Hotel Ghana LTD Vrs Silver Star Auto LTD (J4 34 of 2012) 2012 GHASC 54 (4 December 2012)
No ratings yet
Geogia Hotel Ghana LTD Vrs Silver Star Auto LTD (J4 34 of 2012) 2012 GHASC 54 (4 December 2012)
26 pages
LESSON PLAN FORMAT HAND TOOLS Arbelle
No ratings yet
LESSON PLAN FORMAT HAND TOOLS Arbelle
2 pages
P8 Ggy283 2021
No ratings yet
P8 Ggy283 2021
16 pages
Test 15 January 2016 Questions
No ratings yet
Test 15 January 2016 Questions
16 pages
2021 - Lecture 5 - ESDA
No ratings yet
2021 - Lecture 5 - ESDA
17 pages
Informatics 271 Semester Test
No ratings yet
Informatics 271 Semester Test
25 pages
GMA220 SUT3 30mar2021 EMR Principles
No ratings yet
GMA220 SUT3 30mar2021 EMR Principles
41 pages
Department of Informatics INF 315 Lecture 3: Soft Issues Involved in IT Project Management
No ratings yet
Department of Informatics INF 315 Lecture 3: Soft Issues Involved in IT Project Management
55 pages
GMA220 - SUT5 - Image Enhancement - 19 - 20April2021FN
No ratings yet
GMA220 - SUT5 - Image Enhancement - 19 - 20April2021FN
47 pages
Essential of Financial Accounting
No ratings yet
Essential of Financial Accounting
8 pages
Self-Reflection On Instructional Coaching (1) 2
No ratings yet
Self-Reflection On Instructional Coaching (1) 2
3 pages
TN206
No ratings yet
TN206
37 pages
Business Functions - Chaper 1-9-162-191
No ratings yet
Business Functions - Chaper 1-9-162-191
30 pages
Cryptanalysis of (EC)DSA with Lattice Techniques
No ratings yet
Cryptanalysis of (EC)DSA with Lattice Techniques
11 pages
Oilfield Chemical Solutions
No ratings yet
Oilfield Chemical Solutions
13 pages
HUAWEI MateView GT Quick Start Guide - (01, En-Us, Zhuque)
No ratings yet
HUAWEI MateView GT Quick Start Guide - (01, En-Us, Zhuque)
41 pages

Descriptive Stats for GIS Students

Uploaded by

Descriptive Stats for GIS Students

Uploaded by

2021/08/25

Prof Gregory Breetzke

• What are descriptive statistics?

• Provide an initial entry point

• Some research questions can satisfactory be answered

Types of descriptive statistics

• Univariate and bivariate statistics

Types of descriptive statistics

• Measures of central tendency

• The mean is a measure of central value

When to use what…?

• RANGE = (Xlargest –Xsmallest)

= first quartile = (25th percentile)

Interquartile range (IQR)

• Spread in middle 50%

• Not affected by extreme values

• It is best used with other measurements such as the median and

• Given the set of values: 27, 18, 19, 12, 15, 1,

• The flat percentage raise increased

• A high kurtosis distribution has a sharper "peak"

Symmetrical vs. skewed distributions

Types of descriptive statistics

Scatterplots and correlation

Individual vs Group (Neighbourhood)

What type of relationship?

What type of relationship?

Each point represents something or

Date: Thursday 26th August 1130-1430 (Posted on Thursday)

Location: Remotely or on-campus (Brown & Orange & Red IT labs)

Assistance: Thursdays 1130-1420 and Thursdays 14:00-16:00 by

Due: Thursday 9th September at 1130 (upload on ClickUp)

Task: Sampling exercise and gaining familiarity with GeoDa and

Software: Excel, GeoDa and ArcPro

You might also like