Chapter One - Two Quantitative Geography
Chapter One - Two Quantitative Geography
If we let X equal the number of points earned by one math student at the end of a term, then X
is a numerical variable.
If we let Y be a person's party affiliation, then some examples of Y include Republican,
Democrat, and Independent.
Y is a categorical variable. We could do some math with values of X (calculate the average
number of points earned, for example), but it makes no sense to do math with values of Y
(calculating an average party affiliation makes no sense). Data are the actual values of the
variable.
They may be numbers or they may be words.
Datum is a single value.
Two words that come up often in statistics are mean and proportion.
1.2 . Measurement Level
The recording of the data involves special problems in geography, one of which is scale and
level of aggregation. There are four 1evels of measurement.
These are
Nominal,
Ordinal
Interval
Ratio.
A. Nominal Level
Nominal 1evel is the weakest data measurement. Numbers may be used to represent
an item or characteristic. Examples include: a college may designate majors by
numbers, i.e., BA in Geography =1, BA in History = 2, BA in Mathematics = 3 or gender
may be designated as male =1 and female =2.
Note that such data should not be treated as numerical, since relative size has no
meaning.
Thus, you can simply name or designate the gender category as female and male.
Measurement on nominal scale consists of simply placing each individual into one of a
number of different categories.
Quantitative Techniques and Spatial Analysis 2021 3
It doesn't give you any further information other than the designation of the individual
in the category.
In Nominal level Data that can only be classified into categories and cannot be
arranged in an ordering scheme.
Examples: Eye color, gender, religious affiliation. The categories are mutually exclusive, i.e. an
individual, object, or measurement is included in only one category. The categories are
exhaustive, i.e. an individual, object, or measurement appears in one of the categories.
B. Ordinal Level
Ordinal level involves data that may be arranged in some order, but differences
between data values cannot be determined or are meaningless. Example: During a taste
test of three soft drinks, Coca Cola was ranked number 1, Sprite was ranked number 2,
and Fanta was ranked number 3.
Measurement on an ordinal scale involves putting individuals into an order, ranking
them according to some criterion.
Some data are inherently of an ordinal nature, for example, data on preferences. In
rapidly expanding field of perception studies, geographers are often asking people to
rank towns, parts of the country, or photographs of different types of scenery in order
of preference.
Ordinal data is one step up from nominal data in the sense that it enables us to say
whether one individual comes before or after another along a scale.
The main difference between ordinal data and nominal data is that ordinal data contain
both an equality (=) and a greater-than (>) relationship, whereas the nominal data
contain only an equality (=) relationship.
In Ordinal level Numbers are used to rank. An example is wind forces at sea. Simple
arithmetic operations are not meaningfully applied to ordinal data. Another example is
excellent, good, fair and poor.
In summary, the properties of ordinal level data are:
1. The data categories are mutually exclusive and exhaustive.
2. Data categories are ranked or ordered according to the particular trait they possess.
C. Interval Level
Quantitative Techniques and Spatial Analysis 2021 4
Experimental data are data that are generated in the course of a controlled scientific
experiment. Data is analyzed using techniques such as calculation, reasoning,
discussion, presentation, visualization, or other forms of post-analysis.
Data can be qualitative or quantitative.
Qualitative data is descriptive information (it describes something)
Quantitative data is numerical information (numbers)
1.3.2. Spatial and Non-Spatial Data
Spatial data, also known as geospatial data, is a term used to describe any data related
to or containing information about a specific location on the Earth’s surface.
Non-spatial data, on the other hand, is data that is independent of geographic location.
This blog post will highlights the basics and difference between both spatial and non-
spatial data.
Spatial Data
It answers where things are.
It describes the absolute and relative location of geographical objects.
It is stored in a shape file or geodatabase.
Generally multi-dimensional and auto-correlated.
Satellite maps and scanned images help to obtain spatial data.
Relationships among spatial attributes are implicit. For example, boundaries 1 and 2 could be
neighbors, but cannot be explicitly represented.
Types of spatial data: Raster Data – Composed of grids or pixels and identified by rows and
columns.
Vector Data – Composed of points, lines, and polygons.
Examples of spatial data are maps, photographs, satellite images, scanned images, Roads
Rivers, contours, etc.
Non-spatial Data
It answers what and how much things are.
Characteristics of geographical features that are qualitative or quantitative in nature.
It is stored in a database table.
Generally one-dimensional and independent.
Quantitative Techniques and Spatial Analysis 2021 7
Forest managers, fire departments, environmental groups, and online media helps to obtain
non-spatial data.
Relationships among non-spatial attributes are explicit. For example, two different attributes
may be a part of, a subclass of, a member of, or represented in the form of arithmetic values
or orders.
Types of non-spatial data: Nominal Data, Ordinal Data, Interval Data, Ratio Data
Examples of non-spatial data are names, phone numbers, area, postal code, rainfall,
population, etc.
1.3.3. Sources of data
Sources of Data
The sources of data can be classified into two types: statistical and non-statistical.
? Statistical sources refer to data that is gathered for some official purposes, incorporate
censuses, and officially administered surveys.
? Non-statistical sources refer to the collection of data for other administrative purposes or
for the private sector.
The following are the two sources of data:
1. Internal sources
When data is collected from reports and records of the organization itself, they are known
as the internal sources.
For example, a company publishes its annual report’ on profit and loss, total sales, loans,
wages, etc.
2. External sources
When data is collected from sources outside the organization, they are known as the
external sources.
Types of Data
A) Primary data
Primary data means first-hand information collected by an investigator.
It is collected for the first time.
It is original and more reliable.
For example, the population census conducted by the government of India after every ten
years is primary data.
Methods of Collecting Primary Data
Quantitative Techniques and Spatial Analysis 2021 8
Unit two
2. Methods of Data Presentation
Raw statistical data must be presented in a suitable and summarized form without any loss
of relevant information so that it can be efficiently used for decision making.
Whenever there is a need to present statistical data, graphic aids can help communicate
this information to your audience more quickly.
The two graphic aids mostly used in research reports are tables and graphs.
Besides making the report easier to read and understand, graphic aids make attractive the
physical appearance of your research. .
Several types of statistical/data presentation tools (graphic aids) exist. These include bar
graphs, histograms, scattered diagrams, pie-chart, line graphs and tables.
2.1. Table
Tabulation is the process of summarizing classified or grouped data in the form of a table
so that it is easily understood and an investigator is quickly able to locate the desired
information.
A table is systematic arrangement of classified data in columns and rows.
Thus, a statistical table makes it Possible for the investigator to present a huge mass of
data in a detailed and orderly form.
Quantitative Techniques and Spatial Analysis 2021 9
It facilitates comparison and often reveals certain patterns in data which are otherwise not
obvious.
Classification and Tabulation, as a matter of fact, are not two distinct processes. Actually
they go together.
Before tabulation data are classified and then displayed under different columns and rows
of a table.
Advantages of Tabulation
Statistical data arranged in a tabular form serve the following objectives:
1. It simplifies complex data and the data presented are easily understood.
2. It facilitates comparison of related facts.
3. It facilitates computation of various statistical measures like measures of central
tendency, Dispersion and correlation.
4. It presents facts in minimum possible space unnecessary repetitions and
explanations are avoided. Moreover, the needed information can be easily located
5. Tabulated data are good for references and they make it easier to present the
information in the Form of graphs and diagrams.
2.2. Histogram
Histogram is simply a bar graph where the bar lengths are determined by the frequencies in
each class of a grouped frequency distribution.
Notice how the bar diagram above is represented by the histogram below having eight
interconnected bars that represent the numbers of farmers in each of the quantities of crop
production distribution.
Quantitative Techniques and Spatial Analysis 2021 10
In this graph, only one dependent variable is tracked, so there is only a single line
connecting all data points on the graph.
All points on the graph relate to the same item, and the only purpose of the graph is to
track the changes of that variable over time.
This graph cannot be used to compare the variable to another variable because only
variable is charted.
A pie chart (or a circle chart) is a circular statistical graphic, which is divided into slices
to illustrate numerical proportion.
In a pie chart, the arc length of each slice (and consequently its central angle and area) is
proportional to the quantity it represents.
Pie charts are very widely used in the business world and the mass media.
However, they have been criticized and many experts recommend avoiding them as
research has shown it is difficult to compare different sections of a given pie chart, or to
compare data across different pie charts. Pie charts can be replaced in most cases by
other plots such as the bar
2.7. Map
A map is a symbolic depiction emphasizing relationships between elements of some space, such
as objects, regions, or themes.
Many maps are static, fixed to paper or some other durable medium, while others are dynamic or
interactive.
The space being mapped may be two dimensional, such as the surface of the earth, three
dimensional, such as showing in relief maps or the interior of the earth, or even more abstract
spaces of any dimension, such as arise in modeling phenomena having many independent
variables.
Although the earliest maps known are of the heavens, geographic maps of territory have a very
long tradition and exist from ancient times.
Quantitative Techniques and Spatial Analysis 2021 15
Unit three
3. Measure of Central Tendency
Measures of Central Tendency Measures of central tendency provide us with a summary that
describes some central or middle point of the data. There are five important measures of central
tendency, viz).
3.1. Arithmetic Mean
Arithmetic mean is also called ‘mean’ or ‘average’. It is denoted by abar above the variable
being averaged. It is defined as the sum of all observations divided by the number of
observations
To find mean of ungrouped data it is obtained by adding the raw scores and dividing the sum by
the number of items. Suppose the raw scores are x1, x2, x3,…, xn
Quantitative Techniques and Spatial Analysis 2021 16
X=58
54 48 58 50 25 47 75 46 60 70
67 68 39 35 56 66 33 62 65 67
Suggested answer:
x f fx
10 3 30
Quantitative Techniques and Spatial Analysis 2021 18
15 6 90
20 5 100
25 4 100
30 2 60
Tota 20 380
l
380
20 =19
Exercise
The company pays its daily laborers 6.50. 7.50 Or 8.50 birr per day. There are 26 daily
laborers.
14 are paid at the 6.50-birr rate. 10 at 7.50-birr rate and 2 at the 8.50 birr rate.
What is the mean of these daily laborers?
Example 4. Calculate the mean marks in the distribution below.
Suggested answer
Quantitative Techniques and Spatial Analysis 2021 19
2. GM cannot be calculated when any of the observation in the data set is either negative or
zero
Quantitative Techniques and Spatial Analysis 2021 22
3.4. Median
Median: The midpoint of the values after they have been ordered from the smallest to the
largest, or the largest to the smallest. There are as many values above the median as below it in
the data array.
Note: For an even set of numbers, the median will be the arithmetic average of the two middle
numbers.
Example: Compute the median for the following data. The age of a sample of five college
students is: 21,25,19,20, and 22.
Arranging the data in ascending order gives: 19, 20,21,22,25. Thus, the median is 21.
The height of four basketball players, in inches, is 76, 73, 80, and 75.
Arranging the data in ascending order gives: 73, 75, 76, and 80. Thus the median is 75.5
Exercise 3.
1. The age at marriage for a sample of eight females in Addis Ababa is given below. 17, 24,
23, 16, 21, 8, 22 and 19. What is the median age at marriage?
The median is the halfway point in a data set. The symbol for the median is MD .Before one can
find the median; the data must be arranged in order. When the data set is order, it is a data array.
The median either will be a specific value in the data set of will fall between two values.
Steps in computing the median of a data array
1. Arrange the median of a Data array
2. Find the midpoints of the array.
Example
The weights of seven grade 9 students are 45, 50, 55, 48, 56, 49, and 47, .
Find the median
Step 1. Arrange the data in order 45, 47, 48, 49, 50, 55, 56,
22 | P a g e
Quantitative Techniques and Spatial Analysis 2021 23
Median for Grouped Data: To find the median for grouped data, first identify the class interval
which contains the median value or no observation (of the data set. To find such class interval,
23 | P a g e
Quantitative Techniques and Spatial Analysis 2021 24
Example
Using the frequency distortion given below, find the median
Class Frequency (f)
5.5- 10.5 1
10.5- 15.5 2
15.5- 20.5 3
20.5- 25.5 5
25.5- 30.5 4
30.5- 35.5 3
35.5- 40.5 2
N=20
24 | P a g e
Quantitative Techniques and Spatial Analysis 2021 25
The procedure for finding the median for grouped data is shown here.
A B C
Class Frequency (f) Cumulative Frequency (CF)
5.5- 10.5 1 1
10.5- 15.5 2 3
15.5- 20.5 3 6
20.5- 25.5 5 11
25.5- 30.5 4 15
30.5- 35.5 3 18
35.5- 40.5 2 20
N=20
Step 3. Find the class that contains the tenth value by using the cumulative
frequency distribution. This class is called the median class: it contains the
median.
A B C
Class Frequency (f) Cumulative frequency (CF)
5.5- 10.5 1 1
10.5- 15.5 2 3
15.5- 20.5 3 6 cƒ
Lm 20.5- 25.5 5 ƒ 11 Median
25.5- 30.5 4 15
30.5- 35.5 3 18
35.5- 40.5 2 20
N=20
25 | P a g e
Quantitative Techniques and Spatial Analysis 2021 26
Exercise
Let us assume that you have collected a data from a factory employing 80 workers during your
project work for this course. Let your data indicate that the daily wage of 20 workers is less than
10 Eth. Birr, of 30 workers is 10 to 20 Eth. Birr, of 14 workers is from 20 to 30 Eth. Birr, of 7
workers is 30 to 40 Eth. Birr and of the remaining 9 workers ranges from 40 to 50 Eth. Birr.
Then, calculate the median wage of the worke
26 | P a g e
Quantitative Techniques and Spatial Analysis 2021 27
3.5. Mode
The mode of a set of data (numbers) is the value that occurs most frequently.
The mode may not exist and even if it does exist it may not be unique.
Properties of Mode
1. Mode is indeterminate in ungrouped data possessing two or more observations occurring
for the maximum number of times. By definition, there is no mode in a set of data
wherein observations occur with the same frequency.
2. In the case of grouped data, mode is the same as the mid-point of the modal class when
the immediately preceding and succeeding classes have equal frequencies.
3. When the class preceding the modal class has higher frequency than that succeeding it.
The value of mode is more than the mid-point of the modal class. And vice versa.
4. Like median, the existence of unequal and open ended classes does not interfere with the
Computation of mode. However, mode becomes indeterminate where the modal class is
an open-ended class.
Further, mode is the least used measure of central tendency as compared to mean and median
Example
1. The weight gains of 10 experimental rats fed on a special diet were 1, 0, 3, 7, 11, 5, 9, 1,
and 4, find the modal weight gain.
2. A sample of 7 measurements of the thickness of a copper wire was 0.100, 0.010, 0.009,
0.008, 0.007, 0.009, 0.009 find the modal thickness?
3. The time in minutes that 12 patients spent in a doctor’s office were 20, 15, 18, 22, 10, 12,
16, 17, 19, 21, 23, 13 find the modal time spent in the doctor’s office.
Solution
1. Mode =1 2. Modes are = 0.008, 0.009 3. No mode Remark A distribution having only
one mode is said to be bimodal and a distribution that have two modes is called
bimodal. Notation The mode of a set of numbers X1, X2, X3, Xn is usually denoted
by xˆ (x hat).
27 | P a g e
Quantitative Techniques and Spatial Analysis 2021 28
Where
L, is the lower limit and Cm the width of the modal class
1 represents the difference, ·ignoring plus/minus sign; between the frequency of the modal
class and the class preceding it, and
2 is the difference between the frequency of the modal class and the class succeeding it
Illustration: For the distribution given in the table below, the computation of the mode
is as' shown herein. Since (1080 - 1089) class has the maximum (25) frequencies, it is the
modal class. With L. = 1080,
28 | P a g e
Quantitative Techniques and Spatial Analysis 2021 29
Exercise
For the following data set, find and interpret the result of median and mode
Age in year Number of birth Cumulative number of birth
14.5-19.5 677 677
19.5-24.5 1908 2585
24.5-29.5 1737 4332
29.5-34.5 1040 5362
34.5-39.5 294 5656
39.5-44.5 91 5747
44.5-49.5 16 5763
29 | P a g e
Quantitative Techniques and Spatial Analysis 2021 30
30 | P a g e