Demography 1
Demography 1
UNIT I
Demography: Demography is the scientific study of human population dynamics. It encompasses the
study of size, structure and distribution of population, and how populations change over time due to births,
deaths, migration and ageing. Demographic analysis can relate to whole societies or to groups defined by
criteria such as education, nationality, religion and ethnicity. In most countries demography is regarded
as a branch of either economics or sociology. Formal demography limits its object of study to the
measurement of population process, which is the more broad field of population studies to analyze the
relationship between economic, social, cultural and biological processes influencing the population.
Vital Statistics: Vital Statistics can be defined as the total process of registering, compiling and reporting
of the aggregate of vital events which have to do with an individual’s entrance into life (birth) or exit from
life (death) and change in the social and civil status (marriage, divorce, adoption, etc.) that may occur
during a specified duration of time, among the members of a population residing within a country or any
delimited territory during the same period.
Vital Statistics is defined as that branch of Biometry which deals with data and the laws of human
mortality, morbidity and demography.
Methods of obtaining Vital Statistics: the various methods of obtaining Vital Statistics are given below:
i) Registration method: The most important source of obtaining vital statistics data
is the registration method which consists in continuous and permanent recording of vital events pertaining
to births, deaths, marriages, migration, etc. These, data in addition to their statistical utility also have their
value as legal documents. Registration of births provide information on the place of birth, sex, age and
religion of the parents, legitimacy, number of previous issues& their sexes , father’s occupation and birth
place of parents. Similarly, death registration furnishes information on place of death, sex, age, marital
status, number of issues, birth place, occupation, and cause of death. Similar information is also obtained
with respect to marriages & migrants.
ii) Census Method: Almost in all the countries all over the world population census
is conducted at regular intervals of time, usually ten years. Census consists of complete enumeration of
the population of the particular area under study and collecting information from individual regarding age,
sex, marital status, occupation, religion and other economic and social characteristics. The main drawback
of the census method is that it provides vital statistics only for the census years and fails to give any
information about the vital events in the intercensal period.
1
ST18405DCE
UNIT I
Population: Population is a group of items, units or subjects which is under reference of study. A
population is constantly in a state of flux. During this process the size of population varies and develops
potentialities for unlimited growth. At any given time t , the growth of population in absolute numbers is
largely determined by its own size at the moment. Thus symbolically, if Pt is the population of defined
territory at time t , then
Pt +Dt = Pt + DPt , the population after the lapse of Dt , will depend upon its initial size Pt & the
duration Dt . If Pt is large, then the increase DPt will also tend to be large in absolute numbers and hence
the chances of Pt + Dt being different from Pt will be great even when Dt is small. On the other hand, if Pt
is very small, then even long interval Dt may lapse without any change being recorded. For empirical
analysis the interval of time usually chosen to study changes in a fairly large population is one year.
Cohort: A cohort is the aggregate of all units that experience a particular demographic event during a
specified time interval. Population can also be sub-divided on the basis of the year of birth of individuals.
All those who are born during a given year form a ‘birth cohort’ or simply the ‘cohort’ of that year. For
example, those who are age 20 in 2005, are the survivors of 1985. Thus, a population at a given time is
the aggregate of the survivors of different cohorts of people born during different years.
Experience shows that different cohorts are not alike in their characteristics as they would have
passed through different social, economic & demographic conditions. For this reason, cohort-wise analysis
of demographic data is necessary to understand the process of changes over time.
The term cohort can be used in respect of other conditions also. A marriage cohort then means all
those, who were married during a given period.
Radix: Rates and ratios are averages expressed as fractions. Death rate, obtained, for example
mathematically, is the average number of deaths per person &is always a value between 0&1. Such values
are always can be confusing and meaningless for ordinary use & give a false sense of accuracy. To avoid
these demerits, rates and ratios are conveniently expressed as per 100 persons or 1000 persons instead of
per person. The multiplier 100, 1000 etc., used to round off decimals is called the radix. Radix could be
thought of as the minimum sample size of the population that would be necessary to obtain a reliable
estimate of a particular rate or ratio. Usually, the radix chosen to describe the sex ratio is 100 & for death
rate it is taken as 1000. For mathematical manipulations the radix is kept as one.
Demographic Data: The major source of information on the demographic process of a country is the
census and the vital statistics registration system. Demographic sample surveys are carried out when these
2
ST18405DCE
UNIT I
data are not adequate. Secondary data from governmental and other agencies can at times provide useful
information on certain aspects of population not covered by census and vital statistics.
The census is the official survey conducted periodically by the governments to count the people
of its country & collect the information about them. Because of the cost involved & the elaborate legal &
administrative arrangements that extent over several years before and after the actual count, censuses are
conducted only once in 10 years. Therefore, for intervening periods, the size of populations will have to
be estimated.
Statistics on the vital events such as birth, death and marriages are usually maintained by other
agencies of the government. The network of offices all over the country registers the information on these
events, when the event had occurred. The national agency collects these details and publishes annually or
monthly, the consolidated statistics. The total number of events and certain characteristics of the
individuals associated with the events are made available in these reports.
Demographers may at times find that secondary sources such as school enrolment registers,
electoral rolls, and employment/unemployment records, could also provide needed data. When the above
sources are found inadequate and when specific problems have to be studied intensively, special surveys
are carried out.
Demographic Balancing Equation: If birth and death are the two most fundamental demographic
processes, migration is probably the third. The size of the world’s population is (at least at present)
completely determined by birth and death rates, but the population in any particular region or locale is
also determined by the net migration. These three processes are expressed in the demographic balancing
equation – the increase (or decrease) in a population as the algebraic sum of births, deaths, immigration,
and emigration. In general, we can write the difference between two population growths at two time points
0 and t as equal to
Pt - P0 = B - D + I - E (1)
where B, D, I, and E represent the number of births, deaths, immigrations and emigrations in absolute
numbers. Equation (1) is called the balancing equation of population growth and is very fundamental in
concept as it is an equation relating population change to its determinants.
Example: The Demographic Balancing Equation for the United States (From Mcfalls, 1991) (Numbers
in thousands)
Starting Population + (Births-Deaths) + (Immigration-Emigration) = Ending Population
3
ST18405DCE
UNIT I
The expression on the right side of (1) is known as error of closure in a different context, i.e. when the
vital statistics or migration data are not completely reliable.
If rates are more dependable then growth of the period arrived at by comparing population totals
could be checked with the growth rates calculated using vital rates. Thus if
1 æP ö
r= logçç t ÷÷
t è P0 ø
is the rate of growth base upon population totals and
r¢ = b - d + i - e
is the average rate of the intercensal period based on birth, death and migration rates, then, when counts
are correct, r and r ¢ will be close to each other.
4
ST18405DCE
UNIT I
Chandrasekharan-Deming Formula: Several kinds of checks are available to verify the accuracy of
published reports. The method suggested by Chandrasekharan and Deming to evaluate the coverage of
vital registration and to estimate the undercount is different from the conventional approaches. The method
is described below: -
The Chandrasekharan-Deming method envisages a second agency collecting on vital events for a
pre-assigned period. The second agency could even be a retrospective survey. The two sets of data are
matched event and the results could be categorized into a 2 ´ 2 table as follows:
Registration
Registered Not-registered Total
Information A C A+C
Independent gathered
Agency Could not B D B+D
detect
Total A+B C+D N=A+B+C+D
Here A is the number of events that were registered and was also gathered by the independent agency. B
is the number of events that were registered but failed to be detected by the agency. C is the number of
events detected by the agency which went unregistered. Thus, (A+B) is the total number of events
registered and (A+C) is the total number of events collected by the agency. Naturally, there is the
possibility that a few events could have been missed by both the Registration & the Agency, and D is that
unknown number. The problem is of estimating D and once D is known, the amount of undercount could
be estimated.
If constant proportion of undercount is assumed for both agencies, or in statistical terminology, if
the chance of detecting an event is independent of not detecting that event for both agencies, then we have
A C A+C
= =
A+ B C + D N
On rearranging and simplifying, we can get
BC
Dˆ =
A
where D̂ is the estimate of D. as the right side of the above equation is known, D is easily estimated. This
simple formula of Chandrasekharan and Deming has turned ot to be very useful in evaluating vital
5
ST18405DCE
UNIT I
there is no universal tendency to report ages as 25, 30, 35, etc., the numerator should be approximately
1/5th of the total population in the range of 23 to 62 years given in the denominator, because ‘0’ and ‘5’
are just 2 out of 10 digits 0, 1, …, 9. Thus if there is no age heaping at 0 and 5, then the index have a value
of 100, as the theoretical minimum. The other extreme is obtained when there is complete age heaping;
that is, when the entire population is concentrated at ages 25, 30… etc. with none reporting their ages as
23, 24, 26, … , 29,31, …etc; in which case the index will have a value of 500.
6
ST18405DCE
UNIT I
Studies of age have shown that preference for age is not limited to 0 & 5 but to some other digits
also but with no particular pattern. In such situations the index as defined by Whipple is not helpful.
Further, the index considers only the arbitrary interval 23 to 62 years and not the entire life span. The most
serious drawback however, is that Whipple’s index does not take into consideration the decreasing nature
of the age distribution due to depletion by death. In other words population of younger ages would receive
greater weight-age than at the older ages which would bias the index. To over these drawbacks Myer has
suggested a modified index.
Myer’s Index: Myer’s approach is to derive a ‘blended population’ to overcome the decreasing nature of
the population curve and to calculate heaping at each age using the entire life span.
Blended population
The blended population is calculated as follows. The first is to write the population in a matrix
form and sum the rows as given below:
Population Sum of each row
at each age From 1 to 9 From 2 to 9
P10 P20 … P90 X0 Y0
P11 P21 … P91 X1 Y1
P12 P22 … P92 X2 Y2
…………………………..... ……….. ………..
7
ST18405DCE
UNIT I
1 2X1 +8Y1 B1
2 3X2 +7Y2 B2
… … ………..
… …. ………..
9 10X9 +0Y9 B9
9
Note that the total weights are same for each digit. Any Bj must be 10% of åB
j =0
j when there is
Bj
Where bj = 9
´ 100
åB
j =0
j
When there is no absolute age heaping at any digit each b j will be 10 and so the index will be equal to
zero. The extreme is when there is age heaping at one digit with all other digits being not preferred in
which case Myer’s index will be take the value 180.
Myer’s index too is not above criticism. Like the Whipple’s index this one too does not have any
sound theoretical basis other than perhaps, rectilinearity of the age distribution which as we know is not
very realistic. And digit preference is not the only tendentious bias in reported age distribution, and this
index cannot assess the other biases.
There are several other measures like Baachi’s index and Ramachandran’s index to measure age
heaping. But Myer’s index is the simplest among them and is still in common use particularly because
other indices have not established their superiority.
Example: Using age distribution data for a West African country (1960), determine the extent of heaping
on
(a) Digit ‘0’ and ‘5’.
(b) Digit ‘0’ only
(c) Digit ‘5’ only.
Age Population Age Population
23 38687 45 44654
8
ST18405DCE
UNIT I
Example: Use the Myer’s blended index to assess the quality of age data given below-
9
ST18405DCE
UNIT I
Digit 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
0 386 417 307 190 89 80 50 40 10
1 133 93 90 89 41 35 33 10 2
2 341 227 100 40 30 25 15 6 4
3 223 160 90 46 28 22 8 5 1
4 201 138 50 38 25 10 8 5 3
5 298 238 201 154 148 65 25 2 3
6 198 105 70 49 31 18 12 6 4
7 166 86 50 45 25 9 5 4 2
8 255 100 90 40 28 10 5 5 2
9 132 85 40 20 8 6 2 2 2
Solution:
Dig Sum Weight Product Sum Weight Produc Blende Percen Deviation Remarks
it (10- s (20-99) s t d sum t Dist from 10%
99) bj bj-10
0 1569 1 1569 1183 9 10647 12216 21.7 11.7 Preference
1 526 2 1052 393 8 3144 4196 7.5 -2.5 Avoidance
2 788 3 2364 447 7 3129 5493 9.8 -0.2 Avoidance
10
ST18405DCE
UNIT I
M .I = å | b j - 10 |= 39.3
Dependency Ratio: A measure of the portion of a population which is composed of dependents (people
who are too young or too old to work). The dependency ratio is equal to the number of individuals aged
below 15 or above 64 divided by the number of individuals aged 15 to 64, expressed as a percentage. A
rising dependency ratio is a concern in many countries that are facing an aging population, since it
becomes difficult for pension and social security systems to provide for a significantly older, non-working
population.
A worked example should make this clearer. Pakistan, which is a developing country, has 41% of its
population less than 15 years and 4% above 64 years, the dependency ratio is given by
41 + 4 45
Dependency ratio = ´ 100 = ´ 100 = 81.8%
100 - (41 + 4) 55
New Zealand, a developed country, has23% of its population less than 15 years and 12% above 64 years,
the dependency ratio is given by
23 + 12 35
Dependency ratio = ´ 100 = ´ 100 = 53.8%
100 - (23 + 12) 65
Countries that have a high dependency ratio have more people who are not of working age and few who
are working and paying taxes. The higher the number, the more people need looking after.
11