Module 3
Data Analysis
2
Objectives of Data Analysis
• Identify disease baseline
• Analyze and interpret the data received
• Compare analysis results to determine the rising trend
• Consider the possibility of an outbreak
• Determine the strong and weak point of the district
3
Specific Learning Objectives
Identify the techniques of data analysis and importance of
interpreting the same in a scientific manner.
Outline the various sources of data in the district for disease
surveillance and management of this data to draw valid
conclusions.
Identify disease baseline for each District.
Compare analysis results with baseline to identify rising trend of
cases within District(s).
4
Source Of Data Available Under IDSP
Syndromic
case data
Lab Presumptiv
Confirmed e case
case data data
5
Data Presentation
Tabulation
AGE (in
Male Female Total
years)
0-15 14 14 28
15-30 103 50 153
30-45 108 60 168
45-60 102 52 154
60-75 58 29 87
>75 12 10 22
Total 397 215 612
6
Data Presentation
Line Diagram
7
Data Presentation
Simple Bar Graph
8
Data Presentation
Component Bar Diagram
9
Data Presentation
Multiple Bar Diagram
10
Data Presentation
Pie Chart
11
Data Presentation Spot Map
12
Type of Analysis
▫ Routine data analysis – description understaken by time, person,
place with trend analysis.
▫ Checking for crossing of threshold levels by calculation of
baseline values.
▫ Comparison between public and private
▫ Comparison between data from public health sources and lab
data
▫ Comparison between reporting units.
13
Type and Frequency of Data Analysis
Daily
S. (During Monthly/
Type of analysis Weekly Yearly
No. outbreak Quarterly
situations)
1. Description by time, place, person
including trend analysis as and when
applicable
2. Timeliness, Consistency and
Completeness of reports
3. Checking for crossing of threshold
levels
4. Comparison between reporting units
5. Comparison between public and private
6. Comparison between data from public
health sources and lab data
14
Description By Time, Place And
Person
By time
• look for changes in the number of cases over time
• Time intervals may be in years, quarters, months, weeks or other
unit of time
• Date of onset of illness is usually considered to be the most
accurate representation of disease occurrence
By place
• distribution of cases by place of occurrence of case
• helps in identifying hot spots or high-risk areas and enables the
system to do targeted interventions
By person
• Attributes like age, gender, religion used for analysis of data
15
Crossing Baseline Summary Report
• Shows the usual level of the disease which may vary over time
during the season and off season
• Baseline values are calculated as average of each week’s cases in
the previous seasons
• Data bases on Pre-existing National/Internationally developed
thresholds
• Based on historical data
• Baselines and threshold systems are different for different disease
as well as for Block, Districts & above
16
Comparison of Cases Between Reporting Units
in the region
• An important tool for the quality of the data generated by
comparing Incidence rates of the current month between various
reporting units.
• Expectation should be tailored to both sharp increases or decreases
in incidence rates unless some specific interventions are done.
• Done from the Block level and above.
• Important in understanding spread rate
17
Comparison of Cases Reported From Private
and Public Sources
• Every District should make effort to enroll as many private reporting
units.
• The data generated from two independent sources is a good proxy
indicator of the quality of data.
• Regular reporting from private health facilities should be
encouraged.
• There should be continuous efforts of sensitization of health staff at
block, district and State levels.
18
Comparison Of Reports Received From Public
Health Sources And Laboratory Sources
• Comparison increases validity
Data includes:
• The water quality reports and the cases of water borne diseases
• Routine water testing and the resultant outbreak
• The entomological data and the cases of vector borne diseases
19
Data Interpretation
• Increasing trends
• Decreasing trends
• Plateau of the graph
Report are generated at sub-center, PHC/CHC, district and state level
20
Increasing Trends
Could be a potential outbreak
Could be due to better reporting
Could be a change in the detection and reporting due to change in
case definition, establishment of new lab.
• There could be a possible data error
21
Decreasing Trends
Could indicate improved control measures
Could indicate under-reporting because of incomplete reports
• Could be a change in the detection and reporting due to change
in case definition.
• Its seasonal variations, endemicity as well as the quality of data.
22
Plateau of the Graph
Could indicate stable situation.
If the number of cases are consistently on the higher side, this
could be a cause for concern.
• Has to be corroborated with the completeness report.
• The flattening period of the disease
23
Reports to be generated at different levels of
reporting
Level of Health- Manpower responsible for analysis Report Generated
care
Sub-centre ANM, MPHW (M) No report to be generated but
expected to do simple compilation of
numbers and seeing the trend of cases
as per ‘S’ register.
PHC/CHC Medical officer To detect outbreaks and anticipating
disease trends
At District Data Manager, along with All reports 1 – 6 to be generated at
Epidemiologist under supervision of desired interval as in table no 1.
DSO.
At State Data Manager, along with All reports 1 – 6 to be generated at
Epidemiologist under supervision of desired interval as in table no 1.
SSO.
24
Interpretation & Dissemination of Results
• The duty of the surveillance officer is to correctly interpret the various
reports.
• The results of the analysis should be shared regularly on monthly basis
with all the members of the District Surveillance Unit District
Surveillance Committee.
• Interpretation of data is more important to rather than just making
graphs/tables for analysis.
• Data should be shared with the stakeholders responsible to take
appropriate action on the situation
• During outbreak and epidemics, the reports should be shared within a
week / 10 days with district administration, PRI members
25
Interpretation & Dissemination of
Results
• Findings of analyses should be reviewed regularly and feedback provided to
health providers in the community.
• District Data Manager/District Epidemiologist should analyze the data and
appraise it to District Surveillance Officers.
• Data should be cross-checked by DSO so that correct data is sent.
• Health-workers should ensure that data collected in community should
without fear and with utmost sincerity .
• Information can be presented through various tables. - giving the number of
cases as primary, with summarized data and rates.
26
Take Home Messages
• Data analysis is one of the mainstays of the surveillance programme.
• It is important that analyzed report findings to be shared with both higher
authorities and lower levels during action
• Accurate data and reasonable analysis is a powerful tool to identify potential
and real outbreaks, with focused action to prevent morbidity and mortality.
• Information should be shared at all management levels.
• Regular reporting from private health facilities should be encouraged
through continuous efforts of sensitization of health staff at block, district
and State levels