Module 1 - 4
Module 1 - 4
MODULE-1
INTRODUCTION TO STATISTICS
Definitions
“Statistics are numerical statements of facts in any department of enquiry placed in relation
to each other” – Bowley
Statistics are the classified facts representing the conditions of the people in a state.
Specially those facts which can be stated in number or in tables of number or in any tabular
or classified arrangement” – Webster
Limitations of statistics
1. Statistics does not study qualitative phenomenon
2. Statistics does not study individuals
3. Statistics laws are not exact
4. Statistics is liable to be misused
COLLECTION OF DATA:
Primary Data:
“The data which are originally collected by an investigator or agency for the first time for any statistical
investigation & used by them in the statistical analysis are termed as primary data.”
1. Direct personal investigation: - This method consists in the collection of data personally by the
investigator from the source concerned. The investigator has to go to the field personally for making
enquiries & collecting the information from the respondents. This method should be used only if
the investigation is generally local confined to a single locality or area.
2. Indirect oral interviews: - This method consists in collecting the information by interviewing his
personal friends, relatives, or neighbors who know him thoroughly well. In these types of enquires
factual data on different problems are collected by interviewing person who directly or indirectly
concerned with the subject matter of the enquiry & who are in possession of the requisite
information. A list of questions are prepared & put to the persons known as witnesses & records
this procedure is usually adopted by the enquiry committees or commissions appointed by the
government
3. Information received through local Agencies:- In this method the information is not collected
formally by the investigator. This method consists in the appointment of local agents called
correspondents by the investigator in different parts of field of enquiry. The correspondents in
different regions collect the information according to their own ways & submit their reports
periodically to the central or head office where the data are processed for final analysis. This
technique of data collection is usually employed by newspaper or periodical agencies who require
information in different fields like sports, economic trends business stock & share market, policies
so on
4. Mailed questionnaire method: this method consist in preparing a questionnaire which is mailed to
the respondents with a request for quick response with in the specified time. A very polite covering
note, explaining in detail the aims & objectives of collecting the information & also the operational
definitions of collecting the information & also the operational definitions of various terms &
concepts used in the questionnaire is attached.
Respondents ate requested to extend their full co – operation by furnishing the correct replies &
returning the questionnaire duly filled in time. & it is kept strictly confidential & secret. In this method
the questionnaires is the only media for communication between the investigator & the respondents.
Secondary Data:
The data which have already been collected & processed by some agency or person & taken over from there
& used by only other agency for their statistical work are termed as secondary data.
Methods of collecting secondary Data:-
1. Published data
2. Unpublished data
Averages are the measures which condense a huge unwidely set of numerical data in to single
numerical values which are representative of the entire distribution.
Averages are sometimes referred to as the measure of central tendency.
“Averages are statistical constant which enable us to comprehend in a single effort the significance
of the whole” – A.L Bowley.
3 + 5 + 10 + 15 + 19 + 25
6
In general if X1, X2--------Xn are the given ‘n’ observation then their arithmetic mean usually
denoted by X
i.e. X = X1 + X2 + --------- + Xn
N
. X = ∑X
N
X = ∑fX
N
Or
∑fX
∑f
Where N = ∑f is the total frequency.
1. Multiply each value of X or the mid value of the class (in case of grouped or continuous
frequency distribution) by the corresponding frequency f.
2. Obtain the total of the products obtained in step 1 to get ∑fX
3. Divide the total obtained in step 2 by N, the total frequency.
X = ∑fX = 865
N 9
X = 96.1
2. The following is the frequency distribution of the number of telephone calls received in 245
successive one- minute intervals at an exchange.
Number of calls 0 1 2 3 4 5 6 7
Frequency 14 21 25 43 51 40 39 12
Obtain the mean number of call per minute.
Sol:
No of calls (X) Frequency fx
0 14 0
1 21 21
2 25 50
3 43 129
4 51 204
5 40 200
6 39 234
7 12 84
Total N= 245 ∑fx =922
X = ∑fX = 922
N 245
X = 3.763 call/min
Step deviation method is a method of computing A.M which consist in taking the deviations
(difference) of the given observations from any arbitrary value A. the formula to calculate the
mean is
X = A + h.∑fd
N
1. Compute d = (x – A), ‘A’ being any arbitrary number and ‘h’ is the common magnitude
of the classes.
2. Multiply ‘d’ by the corresponding frequency ‘f’ to get fd
3. Find the sum of the products obtained in step 2 to get ∑fd
4. Divide the sum obtained in step 3 by N, the total frequency
5. Multiply the value obtained in step 4 by ‘h’.
6. Add ‘A’ to the value obtained in step 5
PROBLEMS
2. Calculate the mean of the following marks obtained by students in English using direct
method & step deviation method.
Marks 5 10 15 20 25 30 35 40 45 50
No of 20 43 75 67 72 45 39 9 8 6
students
5. Calculate the average marks by the step deviation method from the following data.
Marks 0-10 10-20 20-30 30-40 40-50 50-60
No of 42 44 58 35 26 15
students
6. From the following data of income distribution calculate the arithmetic mean. It is given
that the total income of persons in the highest group is Rs 435 and none is earning less
than Rs 20
Income (Rs) No of persons
Below 30 16
Below 40 36
Below 50 61
Below 60 76
Below 70 87
Below 80 95
80 & above 5
7. Find the missing frequency from the following series, if the value of the Arithmetic
average is 33
X 10 12 60 70 40
Y 5 10 ? 2 5
8. From the following data find the missing frequency when the mean is 15.38
Size 10 12 14 16 18 20
Frequency 3 7 ? 20 8 5
9. A certain number of salesman were appointed in different territories and the following
data were compiled from their sales report, if the average sale is believed to be Rs 19920,
find the missing frequency.
Sales 4-8 8-12 12-16 16-20 20-24 24-28 28-32 32-36 36-40
‘000’
No of 11 13 16 14 ? 9 17 6 4
sales
man
10. Find the missing frequencies of the following series, if the arithmetic average is 39.5 and
the total number of items is 100
11. From the following frequency distribution of 100 families the mean is 50. Find the unknown
frequency f1 and f2 for classes 20-40 and 60-80
Expenditure 0-20 20-40 40-60 60-80 80-100
No of 14 f1 27 f2 15
families
MEDIAN
“The median is that value of the variable which divides the group in two equal parts, one part
comprising all the values greater and the other, all the values less than median”
Calculation of median
Case i) Frequency distribution: in case of frequency distribution where the variable takes the
values X1, X2…….Xn with respective frequencies f1, f2…..fn. in this case cumulative frequency
distribution facilitates the calculations. The steps involved are.
PROBLEMS
1. Eight coins were tossed together and the number of heads resulting was noted. The
operation was repeated 256 times and the frequency distribution of the number of heads
is given below.
No of heads 0 1 2 3 4 5 6 7 8
Frequency 1 9 26 59 72 52 29 7 1
Sol:
Computation of median
X F Less than cf
0 1 1
1 9 10
2 26 36
3 59 95
4 72 167
5 52 219
6 29 248
7 7 255
8 1 256
∑f = N = 256
N/2 = 256/2 = 128
The C.F just greater than 128 is 167 and the value of X corresponding to 167 is 4. Hence median
number of head is 4.
Case ii) Continuous frequency distribution
Median = L + h (N/2 – C)
F
Where
L is the lower limit of the median class.
F is the frequency of the median class
H is the magnitude or width of the median class
N is the total frequency
C is the Cf of the class preceding the median class
3. The following table gives the marks obtained by 50 students in economics. Find the
median.
Marks 10-14 15-19 20-24 25-29 30-34 35-39 40-44 45-49
No of 4 6 10 5 7 3 9 6
students
4. The following table shows the age distribution of persons in a particular region.
5. Find the missing frequency from the following distribution of daily sales of shops, given
that the median sale of shops is Rs 2400
6. Find the frequency distribution of 100 families given below; the number of families
corresponding to expenditure group 20-40 and 60-80 are missing from the table.
However the median is known to be 50. Find the missing frequencies.
8. Calculate the median from the following data .7 marks (July 2008) (Jan 2007)
10. Expenditure of 1000 families is given as under, the median of distribution Rs 87.
Calculate missing frequency
Expenditure 40-59 60-79 80-99 100-119 120-139
No of 50 ? 500 ? 50
families
MODE
Mode is the value which occurs most frequently in a set of observations and around which the
other items of the set cluster densely.
“According to A.M. Tuttle mode is the value has the greatest frequency density in its
immediate neighborhood”
In case of continuous frequency distribution, the class corresponding to the maximum frequency
is called the modal class and the value of mode is obtained by the formula.
Where
L is the lower limit of the modal class
F1 is the frequency of the modal class
F0 is the frequency of the class preceding the modal class
F2 is the frequency of the class succeeding the modal class
1. Find the value of mean, mode and median from the data given below.
2. Find the value of mean, mode and median from the data given below. 12 marks (Jan
2009,Jan 2010)
Weight (kg) 93-97 98-102 103-107 108-112 113-117 118-122 123-127 128-132
No of students 3 5 12 17 14 6 3 1
3. The median and mode of the following wage distribution are known to be Rs 33.5
and Rs 34 respectively. Three frequency values from the table are missing. Find
out those values. 5 marks (July 2008, Jan 2009)
DISPERSION
“Dispersion is the measure of the variation of the items” – A.L. Bowley.
“Dispersion is a Measure of the extent to which the individual items vary” – L R Connor.
Measures of Dispersion:
The various measure of Dispersions are
1. Range
2. Quartile deviation
3. Mean deviation
4. Standard deviation
I. RANGE
Range is defined as the difference between the two extreme observation of the
distribution i.e. the greatest and the smallest observation of the distribution.
Co-efficient of Range is the Ratio of the difference between two extreme observations of the
distribution of their sum.
Co-efficient of Range = L- S
L+S
Ex: 1. Calculate the range and the co efficient of range of A’s Monthly earnings for a year
Range = L-S
= 17500-13900
= Rs. 3,600
Co- efficient of Range = L-S
L+S
= 17500 – 13,900
17500 + 13900
= 36 = 0.115
314
Solution: -
Since age is a continuous variable convert the given classes into continuous classes
The first class is 15.5 – 20.5 and the last class is 305-365
Largest values = 35.5 S.v = 15.5
Range = 35.5 – 15.5
= 20 yrs
Co-efficient of Range = 35.5-15.5 = 0.39
35.5+15.5
It is a measure of dispersion based on the upper quartile Q3 and the lower quartile Q1
Inter-quartile range = Q3 – Q1
Quartile deviation is obtained from inter-quartile range on dividing by 2 and hence is also known
as semi inter-quartile range, thus
Quartile Deviation (Q.D) = Q3 – Q1
For comparative studies of variability of two distributions we need a relative measure which is
known as co-efficient of quartile deviation and is given by
Coefficient of Q.D = Q3 – Q1
Q3 + Q1
1. Find inter-quartile range, quartile deviation and coefficient of quartile deviation for the
following distribution:
Class 0-15 15-30 30-45 45-60 60-75 75-90 90-105
Interval
F 8 26 30 45 20 17 4
2. Find inter-quartile range, quartile deviation and coefficient of quartile deviation for the
following distribution:
Marks 10-20 20-30 30-40 40-50 50-60 60-70 70-80 80-90
No of 60 45 120 25 90 80 120 60
students
“Mean Deviation is the Average amount of scatter of items in a distribution from either mean or
the median, ignoring the signs of the deviation. The average that is taken of the scatter is an
arithmetic mean which accounts for the fact that this measure is often called the mean deviation”
MD (about Mean) = 1 ∑f x – M
N
MD (about Mean) = 1 ∑ f x – Md
N
MD (about Mean) = 1 ∑f x – Mo
N
Short cut Method of computation of Mean Deviation:-
Relative Measure of Mean Deviation: The relative measure of dispersion, called the co-efficient
of mean deviation and is given
PROBLEMS:-
1. Calculate the Mean deviation from Mean for the following data
Class interval 50 - 100 100 - 150 150 -200 200-250 250 - 300 300 - 350
Frequency 7 18 25 31 15 4
3. Find the Mean deviation from the Mean for the following data
4. Calculate Mean deviation from the Medium for the following data
5. From the following series determine the value of the mean deviation and its co-efficient
from the median
Marks 0-10 10-20 20-30 30-40 40-50 50-60 60-70
No of 4 8 11 15 12 6 3
students
6. From the following frequency distribution find the mean deviation and co-efficient of MD
from mode.
Heights No of workers
100-104 4
105-109 14
110-114 60
115-119 138
120-124 206
125-129 298
130-134 380
135-139 476
140-144 500
145-149 430
150-154 260
155-159 128
160-164 66
165-1696 28
170-174 12
STANDARD DEVIATION:
σ = ∑ (x - x )2
N
σ = ∑ f( x - x )2
N
Variance is the Mean of the squared deviations about the mean of a series variance is the square
of the S.D and it is denoted by.
σ2 = ∑ f (x- x )2
N
s2 = ∑ f (x- A )2
N
Where A is any arbitrary number.
PROBLEMS
1. Calculate the Mean and SD from the following data January 2010, 6 Marks
Coefficient of Variation
Coefficient of variation is the percentage variation in mean, standard deviation being considered
as the total variation in the mean.
For comparing the variability of two distribution we compute the coefficient of variation for each
distribution.
Mean
1. From the prices X and Y of shares A and B respectively given below, state which share is
more stable in value
Price of 55 54 52 53 56 58 52 50 51 49
share X
Price of 108 107 105 105 106 107 104 103 104 101
share Y
UNIT-2
CORRELATION AND REGRESSION
“The correlation is a statistical tool which studies the relationship between two variables”
“When the relationship is a quantitative nature, the appropriate statistical tool for
discovering and measuring the relationship and expressing it in a brief formula is known as
correlation” – Croxton & Cowden.
Two variables are said to be correlated if the change in one variable results in a corresponding
change in the other variables.
TYPES OF CORRELATION:
1. Positive & Negative correlation: If the values of the two variables deviate in the same
direction i.e. if the increase in the value of one variable results in a corresponding increase
in the value of other variable and vice versa. This is said to be positive or direct correlation.
Ex: Height and weights
The family income and expenditure
Correlation is said to be negative or inverse if the variables deviate in the opposite direction
i.e. if the increase or decrease in the value of one variable results in corresponding decrease or
increase in the value of other variable.
Ex. Price and demand of a commodity.
The correlation between two variables is said to be linear if a unit change in one variable result
in a constant change in the values of other variable.
Ex.
X 1 2 3 4 5
Y 5 8 11 14 17
The correlation between two variables is said to be non-linear if a unit change in one variable
result in change in other variable without constant rate
Ex:
X 1 2 3 4 5
Y 5 7 10 11 13
SCATTER DIAGRAM:
If the points on the scatter diagram raise from left hand corner towards upper right hand corner,
the correlation is perfect and positive i.e. +1
3. No correlation:
CORRELATION PROBLEMS
2. Find if there is any significant correlation between height and weight given below
Height in inches 57 59 62 63 64 65 55 58 57
Weight in kgs 113 117 126 126 130 129 111 116 112
X 6 8 12 15 18 20 24 28 31
Y 10 12 15 15 18 25 22 26 28
4. Making use of the data given below calculate the coefficient of correlation r12
Case A B C D E F G H
X1 10 6 9 10 12 13 11 9
X2 9 4 6 9 11 13 8 4
X 19 21 23 25 32
Y 65 66 65 68 75
6. Compute Karl Pearson’s coefficient of correlation in the following series relating to cost
of living and wages:
7. Compute Karl Pearson’s coefficient of correlation for the following ages of husbands and
wives at the time of their marriage
Age of husband 23 27 28 28 28 30 30 33 35 38
(in years)
Age of wife (in 18 20 22 27 21 29 27 29 28 29
years)
8. calculate Karl Pearson’s coefficient of correlation for the following data using 20 as the
working mean for price and 70 as the working mean for demand
Price 14 16 17 18 19 20 21 22 23
Demand 84 78 70 75 66 67 62 58 60
9. Calculate Karl Pearson’s coefficient of correlation for the following data using 44 and 26
respectively as the origin of x and Y
X 43 44 46 40 44 42 45 42 38 40 42 57
Y 29 31 19 18 19 27 27 29 41 30 26 10
When actual mean is not a whole number but a fraction or when the series is large we cannot use
direct method, so we use the assumed mean method.
11. Calculate the co-efficient of correlation between the sales and expenses from the following data
Sales (lakhs) 50 50 55 60 65 65 65 60 60
Expenses 11 13 14 16 16 15 15 14 13
12. Calculate the co-efficient of correlation taking 31 and 25 as assumed mean for x & y series for
calculation purpose.
X 23 27 28 29 30 31 33 35 36
Y 18 22 23 24 25 26 28 29 30
13. The following table is the distribution of total population and those who are totally and
partially blind among them. Find out if there is any relation between age and blindness.
Age (years) 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80
No. of 100 60 40 36 24 11 6 3
persons (‘000)
Blind 55 40 40 40 36 22 18 15
14. Calculate the coefficient of correlation between age group and mortality from the
following data.
Age group 0-20 20-40 40-60 60-80 80-100
Rate of mortality 350 280 540 760 900
15. Find karl pearson co-efficient of correlation between the age and playing habit of the people from
the following information. Also mention what does your calculated indicates.
Age 15-20 20-25 25-30 30-35 35-40 40-45
Probable error-
It is a measure to find out the reliability (or) the significance of the co-efficient
correlation.
If the value “r” is less than the probable error then the “r” is not significant. If the “r” is
more than 6 times of the probable error then the “r” is significant.
PROBLEMS
16. Find Karl Pearson’s coefficient of correlation from the following series of marks secured
by 10 students in a class test in mathematics and statistics.
Marks in math’s 45 70 65 30 90 40 50 75 85 60
Marks in statistics 35 90 70 40 95 40 60 80 80 50
Y 18 16 14 12 10 6 8
19. The ranks of the same 15 students in 2 subjects A and B are given below.
The 2 numbers (denoting the ranks of the same students in a and b respectively)
(1,10) (2,7) (3,2) (4,6) (5,4) (6,8) (7,3) (8,1) (9,11) (10,15) (11,9) (12,5) (13,14)
(14,12) (15,13)
Use spearman’s formula to find the rank correlation coefficient.
20. 10 competitors in beauty contest are rank by 3 judges in the following order
Judge 1 1 6 5 10 3 2 4 9 7 8
Judge 2 3 5 8 4 7 10 2 1 6 9
Judge 3 6 4 9 8 1 2 3 10 5 7
Use the rank correlation coefficient to determine which pair of judges has the near of approach in
coon taste in beauty.
REGRESSION ANALYSIS
In regression analysis there are two types of variables. The variable whose value is
influenced or is to predicted is called dependent variable and the variable which influences the
value or is used for prediction is called independent variable.
LINES OF REGRESSION:
Line of regression is the line which gives the best estimate of one variable for any given value of
the other variable. In case of two variables X and Y. we shall have two lines of regression one of
Y on X and the other X on Y.
Line of regression of Y on X is the line which gives best estimate for the value of Y for
any specified value of X.
Line of regression of X on Y is the line which gives the best estimate for the value of X
for any specified value of Y.
REGRESSION PROBLEMS
2. Calculate the two regression equation of X on Y and Y on X from the data given below
taking deviation from the actual mean of X and Y
Price 10 12 13 12 16 15
Demand 40 38 43 45 37 43
Also estimate likely demand when the price is Rs 20
X 1 2 3 4 5
Y 2 5 3 8 7
Marks in 25 28 35 32 31 36 29 38 34 32
economics(X)
Marks in 43 46 49 41 36 32 31 30 33 39
statistics (Y)
5. Price indices of cotton and wool are given below for the 12 months of a year. Obtain the
equation of line of regression between the indices
MODULE-4
TIME SERIES ANALYSIS
Time series is an arrangement of statistical data in a chronological order i.e. in accordance with
its time of occurrence. Thus a time series is a set of quantitative reading of some variable
recorded at equal intervals of time. The intervals may be an hour, a day or a week or month or a
year.
E.g.: hourly temperature reading, daily sales in a shop, weekly sales in a market, monthly
production in an industry, yearly agricultural production.
It is very important in economic business planning, research work etc. because of the
following reasons.
1. It helps in understanding past behavior and it will help in estimating the future behavior.
2. It helps in planning and forecasting.
3. Comparison between data of one period with that of period is possible.
4. Helps to evaluate the progress in any field of economics and business activity.
There are a large number of forces that affecting the time series as a result there are fluctuations
time series. There are 4 basic types of variation and these are called components or elements of
time series.
Components
1. SECULAR TREND: The general tendency of time series data to increase or decrease
during a long period of time is called the secular trend. Trend may be upward or a
downward.
E.g. Increase in population, production, prices etc.
Decrease in deaths.
Y Y
0 time X 0 time X
Y Y
Y = a + bx Y = a + bx
0 time X 0 time X
When the rate of growth of a time series remains constant in the long run is known as linear
trend. It can be expressed Y = a + bx
When the long run growth of a time series is not a constant rate it is called non – linear
trend.
We can find out the direction of long term series whether it is growing or declining by the
measurement of trend. The reason for measurement of trend is to find out characteristics in the
series
For e.g.: we can compare the growth in the agricultural production in one state with the
agricultural production in other state.
The following are the 4 methods which can be used for determining the trend.
1. Compute the values of 1st 3 years and place the 3years total against the middle year.
2. Leave the 1st year value and add up the values of the next 3 years and place the 3 years
total against the middle year.
3. This process must be continued until the last years value is taken for calculating moving
averages.
4. The three yearly totals must be divided by 3 and placed in the next column this is the
trend value of moving average.
The formula for calculating 3 yearly moving averages is as follows.
5years
a+b+c+d+e b+c+d+e+f
5 5
PROBLEMS:
Years 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990
No of students 15 18 17 20 23 25 29 33 36 40
2. Gross revenue data (Rs.in million) for a travel agency for a 10 year period is as follows.
Years 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009
Revenue 3 6 10 8 7 12 14 14 18 19
Calculate 3 yearly moving averages for the revenue earned
3. Calculate three yearly and five yearly moving averages for the following data and
comment on the results.
Years 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000
Y 242 250 252 249 253 255 251 257 260 265 262
If the period of moving average is 4, 6, 8 the 4 yearly totals cannot be placed against any years,
as the median 2.5 is between the 2nd year and 3rd year. So the total should be placed between the
2nd and 3rd year. We must center the moving averages in order to place the moving averages
against an year.
1. Compute the values of the 1st 4 years & place the total in b/w the 2nd & 3rd years.
2. Leave the 1st year value & compute the value of the next 4 years & place the total in b/w
the 3rd & 4th year.
3. This process must be continued until the last year is taken into account.
4. Compute the 1st to four year total & place it against the middle year ( 3rd year )
5. Leave the 1st 4 years total compute the next 4 years total place it in the 4th year.
6. This method must be continued until all the 4 years totals are computed.
7. Divide the above total by 8 (it is the total of the two 4 yearly total ) & put in the next
column. This is the trend value.
PROBLEMS:
2. From the following data, calculate the trend values using four yearly moving average
Years 1989 1990 1991 1992 1993 1994 1995 1996 1997
Values 506 620 1036 673 588 696 1116 738 663
The method of least square can be used to explain the linear & non linear trend that is a straight
line trend or a parabolic trend. Straight line trend = 1/C = a + bx.
a = ∑Y & b= ∑ XY
N X2
PROBLEMS:
1. Calculate the trend values by the method of least square from the following data given
below and estimate the sales for 1993.
2. Fit a linear trend to the following data by least square method and also estimate the
production for the year 2007.
3. The sales of a company in millions of rupees for the year 1994-2001 are given below.
Fit the liner trend equation and also estimate the sales for the year 1993.
4. Fit a straight line trend to the following data using the method of least squares and
calculate the production for the year 2001
5. The following table shows the number of salesmen working in a certain concern.
Year 1990 1991 1992 1993 1994
No of salesmen 28 38 46 40 56
Use the method of least square to fit a straight line trend and estimate the number of salesmen in
1995.
6. Fit a straight line trend to the following data using the method of least squares and project
the probable sales for the next two years.
Seasonal variation:
The objective of studying seasonal variation is to determine the affect of seasonal variation on
the value of given phenomenon and to eliminate them ie determine the size of the value of the
variable. It is important in deciding the business policy of various firms. The time series data are
recorded monthly, quarterly, weekly, daily or hourly. There will be difference in them due to
seasonal variation. There are 4 methods.
1. Average the data for each month or quarter for all the years.
2. Find the totals of each month or quarter.
3. Divide each total by the number of year for which data are given. If we are given monthly
data for 4 years we must 1st get the total for each month for 4 years and divide each total
by 4 to get an average.
4. We must take the averages of month or quarterly as 100 and get seasonal index as
follows.
Quarterly production
Year I II III IV
1984 3.5 3.9 3.4 3.6
1985 3.5 4.1 3.7 4.0
1986 3.5 3.9 3.7 4.2
1987 4.0 4.6 3.8 4.5
1988 4.1 4.4 4.2 4.5
Quarterly production
Year I II III IV
1990 106 124 104 90
1991 84 114 107 88
1992 90 112 101 85
1993 76 94 91 76
1994 80 104 95 83
1995 104 112 102 84
RATIO TO MOVING AVERAGE METHOD: This is an improvement over the Ratio to trend
method as it tries to eliminate the cyclic variations which are mixed up with seasonal indices in
the ratio to trend method. Ratio to moving average is the most widely used method of measuring
seasonal fluctuations.
PROBLEMS:
1. Calculate seasonal indices by the ratio to moving average method from the following data.
Years I Quarter II Quarter III Quarter IV Quarter
1991 68 62 61 63
1992 65 58 66 61
1993 68 63 63 67
2. Calculate the seasonal indices by the ‘ratio to moving average’ method from the following data.
1993 I 86 67.125
II 65 70.875
III 63 70.000
IV 80 75.375
1994 I 90 76.625
II 72 77.625
III 66 79.500
IV 85 81.500