0% found this document useful (0 votes)

44 views6 pages

Data Analysis

Data analysis involves turning collected monitoring data into useful information. It requires understanding appropriate statistical methods and the context of the data collection. Common statistical analyses include comparing data to guidelines, assessing temporal and spatial trends, and modeling relationships between variables.

Uploaded by

EICQ/00154/2020 SAMUEL MWANGI RUKWARO

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views6 pages

Data Analysis

Uploaded by

EICQ/00154/2020 SAMUEL MWANGI RUKWARO

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Data analysis

Data analysis is the component of the monitoring process that turns collected data into
useful information. Analysing water/sediment quality monitoring data improves your
understanding of the system being measured and drives management actions.

Your monitoring program objectives will guide the data analysis, and they will also
determine the study design, the quantity and type of data collected and sometimes the
need for adequate computing power.

Data analysis is quantitative and can be computationally intensive. Undertaking valid

data analysis requires a good understanding of appropriate statistical methods and a
strong appreciation of the context in which data were generated and the inferences that
are required.

We provide guidance in the Water Quality Guidelines on the use of common statistical
methods to analyse water/sediment quality data. We have pitched this information at an
introductory level to help your monitoring team identify suitable methods of analysis and
interpret the results.

Data analysis process

In a typical data analysis process, your interpretation of the analysis outputs may refine
the understanding of the system and lead to changes in monitoring design and the data
that are collected in the future.

Careful data preparation before any analyses should minimise the influence of anomalies
or errors.

Before starting a comprehensive analysis, you must enter, check and securely store the
raw data. This usually involves computer database applications. The checking process
will need to capture and flag data quality issues, such as missing data, detection limits
and data entry errors. You should be able to retrace the steps taken back to the raw data.

It is essential to perform exploratory analyses and interrogation of key variables using a

variety of numerical, statistical and graphical methods. Examining the raw data can yield
value, such as helping you to identify patterns for further scrutiny or raise questions for
investigation.
Useful modelling strategies for data analysis include model-based and probability-based
analyses and inferences that may reflect how the data are collected, and the Bayesian
and frequentist (classical) approaches to data analysis.

If quantifying status or change in water quality is a monitoring objective, then that may
entail comparing sample data with guideline values. We describe how to derive guideline
values using reference data, as well as significance testing and calculation of confidence
or credible intervals to assess monitoring data against those guideline values when the
sample data are spatially and temporally independent. If spatial or temporal
independence is not likely, then this dependence may need to be accounted for in the
analysis.

We discuss analytical options to assess temporal and spatial change, such as when a
single water quality variable is considered at one site over time (temporal analysis
approaches) or at multiple sites for a particular point in time (spatial and regional analysis
approaches).

We also introduce approaches for modelling relationships between multiple variables,

including correlation analysis, multivariate and high-dimensional data analysis
techniques, and regression analyses (both parametric and nonparametric approaches).

After completing data analysis, interpretation and reporting of the key results and findings
help to complete the cycle of meeting the set monitoring program objectives.

The information we provide here is not exhaustive. Complex monitoring studies may
require a greater level of statistical sophistication and input from a professional
statistician.

A useful checklist for water quality monitoring data analysis is presented in Box 1.

Box 1: Checklist for data analysis in a water quality monitoring program

1. Before commencing analysis, have you clearly identified:

a. purpose of the data analysis exercise?
b. parameters to be estimated or hypotheses to be tested?
c. compatible data from different sources (levels of measurement, spatial scale, time
scale)?
d. objectives concerning quality and quantity of data?
e. preferred methods for the statistical or data analysis?
f. assumptions that need to be met for an appropriate application of those methods?
g. data organisation and management considerations (storage media, layout, treatment
of inconsistencies, outliers, missing observations and below detection limit data)?
2. Have data visualisation and summary methods (graphical, numerical, and tabular
summaries) been applied?
3. Have data checks been performed and ‘aberrant’ observations (potential outliers)
been identified?
4. Have statistical assumptions (e.g. non-normality, non-constant variance,
autocorrelation) been checked?
5. Have data been suitably transformed, if necessary?
6. Have data been analysed using previously identified methods. Have alternative
procedures been identified for data not amenable to particular techniques?
7. Have results of analysis been collated into a concise (statistical) summary. Have
statistical diagnostics (e.g. residual checking) been used to support the
appropriateness of the model approach?
8. Has the statistical output been carefully assessed and interpreted in the context of the
objectives?
9. Have the objectives been addressed? If not, you may need to redesign the study,
collect new or additional data, refine the conceptual models and re-analyse the data.

Planning for data analysis

Data types, quantities and methods of statistical analysis need to be considered
collectively and at the early planning stages of any monitoring strategy. You must
make study design decisions about measurement scales, frequency of data collection,
level of replication and spatial and temporal coverage so that data of sufficient quality
and quantity are collected for subsequent statistical analysis.

It is important for your monitoring team to avoid the ‘data rich–information poor’
syndrome of collecting data that will not be subsequently analysed or that do not address
the monitoring program objectives.

Given the costs associated with the data collection process, it is imperative for the
monitoring team to use formal quality assurance and quality control (QA/QC)
procedures to ensure the integrity of the data. These procedures should be supported
by established back-up and archival processes. Seriously consider the archival medium
to be used because rapid advances in computer technology tend to increase the speed
at which equipment becomes obsolete.

QA/QC of any monitoring program should include:

 data analysis
 field sampling and measurement practices
 laboratory analysis.

Before statistically analysing the monitoring data, you should use standard methods of
data summary, presentation and outlier checking to help identify ‘aberrant’ observations.
If undetected, these data values can have profound effects on subsequent statistical
analyses and can lead to incorrect conclusions and flawed decision-making.

Develop a plan of the sequence of actions that the monitoring team will use for the
statistical analysis of the water quality data. Only some of the many available statistical
techniques need be used.

An initial focus of monitoring might be to assess water quality against a guideline value or
to detect trends. An ultimate objective of the data analysis exercise will probably be to
increase your team’s understanding of the natural system under investigation. Improved
understanding should result in more informed decision-making, which in turn will lead to
better environmental management.

One of the most significant challenges for the data analysis phase is to extract a ‘signal’
from an inherently noisy environment.

We can categorise monitoring study designs as:

 descriptive studies, including audit monitoring

 studies for the measurement of change, including assessment of water quality
against a guideline value (which can also be categorised as descriptive)
 studies for system understanding, including cause-and-effect studies and general
investigations of environmental processes.

The statistical requirements for the descriptive studies are less complex than for the other
2 categories in which more detailed inferences are being sought.
Most of the statistical methods that we present here are based on classical tools of
statistical inference (e.g. analysis of variance, t-tests, F-tests). These methods have
served researchers from a variety of backgrounds and disciplines extremely well over
many years but people have concerns about their utility for environmental sampling and
assessment. When measuring natural ecosystems or processes, it is invariably hard to
justify the assumptions that:

 response variables are normally distributed

 variance is constant in space and time
 observations are uncorrelated.

In these cases, remedial action (e.g. data transformations) may overcome some of the
difficulties but it is more probable that an alternative statistical approach is needed.

For example, generalised linear models (GLMs) are often more suitable than classical
analysis of variance (ANOVA) techniques for the analysis of count data because of their
inherent recognition and treatment of a non-normal response variable.

Many researchers have questioned the utility and appropriateness of statistical

significance testing for environmental assessment (e.g. McBride et al. 1993, Johnson
1999); refer to Bayesian versus frequentist approaches.

A large number of introductory and advanced texts of statistical methods are available
(e.g. Ott 1984, Helsel & Hirsch 2002). For a relatively detailed and comprehensive
description of statistical techniques used in water quality management studies, refer to
Helsel & Hirsch (2002) and McBride (2005). More general resources, such as the
Encyclopedia of Environmetrics (El-Shaarawi & Piegorsch 2014), provide extensive
coverage on the development and application of quantitative methods in the
environmental sciences.

Statistical software for data analysis

So many statistical software tools are available and it is beyond the scope of the Water
Quality Guidelines to review them all.

For example, Wikipedia compares various statistical packages. These packages support
different statistical techniques and different features, such as size of datasets handled,
graphical representations, database interfaces, linkages to other software and the level of
expertise needed to use them.

Many software tools provide a high level of functionality and technical sophistication but
they also lend themselves to abuse through blind application. It is important to scrutinise
both the output and the choice of technique by asking yourself:

 Does the analysis make sense?

 Is it consistent with what has been observed in the exploratory data analysis?

If these questions are not routinely asked, then you run the risk that the ‘mental models’
of your monitoring team may have undue influence on the outcome. Results are more
likely to be accepted — even if they occur for the wrong reasons — when they match the
expectations of those who are interpreting the analysis.

Next steps:

 Data preparation and exploration

 Data modelling approaches
 Derivation and assessment against guideline values
 Data analysis considerations for ecosystem receptors
 Assessment of change through data analysis
 Correlation between water quality variables

Sco417 - Gis Exam
67% (3)
Sco417 - Gis Exam
2 pages
Unit 05: Data Preparation & Analysis
100% (1)
Unit 05: Data Preparation & Analysis
26 pages
QAQC - Data Quality VAM
100% (1)
QAQC - Data Quality VAM
38 pages
Water Quality Data Analysis Guide
100% (1)
Water Quality Data Analysis Guide
8 pages
Notes in Environmental Data Analysis
100% (1)
Notes in Environmental Data Analysis
11 pages
ISPFL9 Module2
No ratings yet
ISPFL9 Module2
21 pages
QAQC
No ratings yet
QAQC
16 pages
Interpretation of Data
No ratings yet
Interpretation of Data
12 pages
Lecture Notes On STATISTICAL ANALYSIS OF WATER QUALITY DATA
No ratings yet
Lecture Notes On STATISTICAL ANALYSIS OF WATER QUALITY DATA
14 pages
Maths Project
No ratings yet
Maths Project
23 pages
5 Methods and Means of Analysis: 1 General
No ratings yet
5 Methods and Means of Analysis: 1 General
3 pages
Water Quality Monitoring Guide
100% (1)
Water Quality Monitoring Guide
35 pages
Environmental Monitoring Data Management Systems
No ratings yet
Environmental Monitoring Data Management Systems
72 pages
Public Health Engineering Assignment
No ratings yet
Public Health Engineering Assignment
7 pages
REV 213 - Combined 2023-2024
No ratings yet
REV 213 - Combined 2023-2024
147 pages
Summary - Lifecycle of Data Analysis - 3982
No ratings yet
Summary - Lifecycle of Data Analysis - 3982
7 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
19 pages
Basic Data Analysis
No ratings yet
Basic Data Analysis
16 pages
ANALYSING EVALUATION RESULTS - Class Notes
No ratings yet
ANALYSING EVALUATION RESULTS - Class Notes
9 pages
Estatistica DadosAmbientais
No ratings yet
Estatistica DadosAmbientais
169 pages
Data Analysis for Researchers
No ratings yet
Data Analysis for Researchers
4 pages
5006-9 Usama Junaid
No ratings yet
5006-9 Usama Junaid
18 pages
Citizen Monitoring QA/QC Guide
No ratings yet
Citizen Monitoring QA/QC Guide
2 pages
What Is Data Visualization and Why Is It Important
No ratings yet
What Is Data Visualization and Why Is It Important
18 pages
Groundwater Quality Analysis
No ratings yet
Groundwater Quality Analysis
25 pages
Term2 Datascience Notes
No ratings yet
Term2 Datascience Notes
8 pages
EDA
100% (1)
EDA
9 pages
Executive Masterclass ?? ?????????? ?????? ??? ?????? ????????
No ratings yet
Executive Masterclass ?? ?????????? ?????? ??? ?????? ????????
87 pages
WATERQUALITYANALYSIS
No ratings yet
WATERQUALITYANALYSIS
7 pages
Data Analysis: Analysis of Data Is A Process of Inspecting, Cleaning, Transforming, and Modeling
No ratings yet
Data Analysis: Analysis of Data Is A Process of Inspecting, Cleaning, Transforming, and Modeling
6 pages
Lecture Notes Course 3
No ratings yet
Lecture Notes Course 3
57 pages
Data Analysis for Field Researchers
No ratings yet
Data Analysis for Field Researchers
7 pages
PR2 M8 Fundamentals of Data
No ratings yet
PR2 M8 Fundamentals of Data
9 pages
Chapter 1.3 - Data Collection
No ratings yet
Chapter 1.3 - Data Collection
6 pages
Introduction To Methods of Data Analysis
No ratings yet
Introduction To Methods of Data Analysis
2 pages
Data Analysis A Beginner Guide
No ratings yet
Data Analysis A Beginner Guide
1 page
Abbasi Presentation
No ratings yet
Abbasi Presentation
9 pages
Dcova Framework
No ratings yet
Dcova Framework
7 pages
Analyzing Data - EvalCommunity EvalCommunity
No ratings yet
Analyzing Data - EvalCommunity EvalCommunity
8 pages
Data Analysis 2
No ratings yet
Data Analysis 2
8 pages
Data Analysis Notes
No ratings yet
Data Analysis Notes
29 pages
Unit 2
No ratings yet
Unit 2
58 pages
Preparing To Collect Engineering Data
No ratings yet
Preparing To Collect Engineering Data
15 pages
Data Analysis: Types, Process, Methods, Techniques and Tools
No ratings yet
Data Analysis: Types, Process, Methods, Techniques and Tools
6 pages
Nuclear Decommissioning DQOs
No ratings yet
Nuclear Decommissioning DQOs
72 pages
Rma Midterm Reviewer
No ratings yet
Rma Midterm Reviewer
11 pages
DSBD
No ratings yet
DSBD
23 pages
Week 2 - Data Analytics Life Cycle
No ratings yet
Week 2 - Data Analytics Life Cycle
41 pages
BDA-24 - Lect (3-4) - (Fundamentals of Data Analysis)
No ratings yet
BDA-24 - Lect (3-4) - (Fundamentals of Data Analysis)
15 pages
33007349
No ratings yet
33007349
22 pages
Assignment Statistics Ankit Agarwal May 2024
No ratings yet
Assignment Statistics Ankit Agarwal May 2024
10 pages
Guidance Manual For Optimizing Water Quality Monitoring Program Design
No ratings yet
Guidance Manual For Optimizing Water Quality Monitoring Program Design
88 pages
HMIS Handouts
No ratings yet
HMIS Handouts
10 pages
Data Collection, Analysis & Interpretations-1
No ratings yet
Data Collection, Analysis & Interpretations-1
34 pages
DAC Phase5
No ratings yet
DAC Phase5
12 pages
Methodology For The Development of Organizational Studies
No ratings yet
Methodology For The Development of Organizational Studies
16 pages
Data Science - III
No ratings yet
Data Science - III
94 pages
? What Is Data Analysis
No ratings yet
? What Is Data Analysis
2 pages
Research Lecture 17
No ratings yet
Research Lecture 17
49 pages
Data Analysis and Interpretation PDF
No ratings yet
Data Analysis and Interpretation PDF
13 pages
Research Methods 1
No ratings yet
Research Methods 1
2 pages
Kinap Student Guide and Declaration Form
No ratings yet
Kinap Student Guide and Declaration Form
14 pages
W.R. - Earth Dams and Spillways
No ratings yet
W.R. - Earth Dams and Spillways
20 pages
EECQ - 4242 - Systems Concept
No ratings yet
EECQ - 4242 - Systems Concept
5 pages
Lecture 1 Highway Geometric Design
No ratings yet
Lecture 1 Highway Geometric Design
9 pages
Cse 421 - Foundation Engineering
No ratings yet
Cse 421 - Foundation Engineering
4 pages
Cse 342 - Highway Geometric Design
No ratings yet
Cse 342 - Highway Geometric Design
3 pages
EECQ - 4242 - Probability in Hydrology
No ratings yet
EECQ - 4242 - Probability in Hydrology
13 pages
Cse 342
No ratings yet
Cse 342
4 pages
Lumped Flow Routing in Hydrology
No ratings yet
Lumped Flow Routing in Hydrology
13 pages
Eecq 4141 Tutorial Sheet 2
No ratings yet
Eecq 4141 Tutorial Sheet 2
2 pages
Retaining Walls
No ratings yet
Retaining Walls
20 pages
Eecq 4141 Tutorial Sheet 4
No ratings yet
Eecq 4141 Tutorial Sheet 4
2 pages
Dce 081 - Irrigation and Drainage Engineering
No ratings yet
Dce 081 - Irrigation and Drainage Engineering
3 pages
EECQ 4142 Precipitation
No ratings yet
EECQ 4142 Precipitation
23 pages
Cse 342
No ratings yet
Cse 342
4 pages
Bearing Capacity
No ratings yet
Bearing Capacity
18 pages
CSC 201
No ratings yet
CSC 201
5 pages
Introduction To Engineering Lesson3
No ratings yet
Introduction To Engineering Lesson3
7 pages
Eecq 4141 Tutorial Sheet 1
No ratings yet
Eecq 4141 Tutorial Sheet 1
1 page
Centrifugal Pump - Manometric Head 2
No ratings yet
Centrifugal Pump - Manometric Head 2
26 pages
GGE 4203 Intro To GIS & Remote Sensing Exam - 2021
100% (1)
GGE 4203 Intro To GIS & Remote Sensing Exam - 2021
2 pages
EECQ 4142 Surfacewater
No ratings yet
EECQ 4142 Surfacewater
24 pages
EECQ 4142 Soilwater
No ratings yet
EECQ 4142 Soilwater
13 pages
Spatial Interpolation Notes
100% (1)
Spatial Interpolation Notes
6 pages
7 - GGI - 4203-GIS and Remote Sensing
No ratings yet
7 - GGI - 4203-GIS and Remote Sensing
2 pages

Data Analysis

Uploaded by

Data Analysis

Uploaded by

Data analysis

Data analysis is quantitative and can be computationally intensive. Undertaking valid

Data analysis process

It is essential to perform exploratory analyses and interrogation of key variables using a

We also introduce approaches for modelling relationships between multiple variables,

Box 1: Checklist for data analysis in a water quality monitoring program

1. Before commencing analysis, have you clearly identified:

Planning for data analysis

QA/QC of any monitoring program should include:

We can categorise monitoring study designs as:

 descriptive studies, including audit monitoring

 response variables are normally distributed

Many researchers have questioned the utility and appropriateness of statistical

Statistical software for data analysis

 Does the analysis make sense?

 Data preparation and exploration

You might also like