100% found this document useful (1 vote)

889 views10 pages

Intro to Data Collection Methods

The document discusses different methods for obtaining data for statistical analysis, including surveys, experiments, and observational studies. It explains how to plan and conduct surveys, including determining objectives, identifying populations, question design, and sampling techniques. Non-probability and probability sampling methods are defined.

Uploaded by

etdr4444

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

889 views10 pages

Intro to Data Collection Methods

Uploaded by

etdr4444

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Chapter 1

OBTAINING DATA

Introduction

Statistics may be defined as the science that deals with the collection,

organization, presentation, analysis, and interpretation of data in order be able to draw

judgments or conclusions that help in the decision-making process. The two parts of this

definition correspond to the two main divisions of Statistics. These are Descriptive

Statistics and Inferential Statistics. Descriptive Statistics, which is referred to in the first

part of the definition, deals with the procedures that organize, summarize and describe

quantitative data. It seeks merely to describe data. Inferential Statistics, implied in the

second part of the definition, deals with making a judgment or a conclusion about a

population based on the findings from a sample that is taken from the population.

Intended Learning Outcomes

At the end of this module, it is expected that the students will be able to:

1. Demonstrate an understanding of the different methods of obtaining data.

2. Explain the procedures in planning and conducting surveys and experiments.

Statistical Terms

Before proceeding to the discussion of the different methods of obtaining data, let

us have first definition of some statistical terms:

Population or Universe refers to the totality of objects, persons, places, things used in

a particular study. All members of a particular group of objects (items) or people

(individual), etc. which are subjects or respondents of a study.

Sample is any subset of population or few members of a population.

Data are facts, figures and information collected on some characteristics of a population

or sample. These can be classified as qualitative or quantitative data.

Ungrouped (or raw) data are data which are not organized in any specific way. They are

simply the collection of data as they are gathered.

Grouped Data are raw data organized into groups or categories with corresponding

frequencies. Organized in this manner, the data is referred to as frequency distribution.

Parameter is the descriptive measure of a characteristic of a population

Statistic is a measure of a characteristic of sample

Constant is a characteristic or property of a population or sample which is common to all

members of the group.

Variable is a measure or characteristic or property of a population or sample that may

have a number of different values. It differentiates a particular member from the rest of

the group. It is the characteristic or property that is measured, controlled, or manipulated

in research. They differ in many respects, most notably in the role they are given in the

research and in the type of measures that can be applied to them.

1.1 Methods of Data Collection

Collection of the data is the first step in conducting statistical inquiry. It simply refers

to the data gathering, a systematic method of collecting and measuring data from different

sources of information in order to provide answers to relevant questions. This involves

acquiring information published literature, surveys through questionnaires or interviews,

experimentations, documents and records, tests or examinations and other forms of data

gathering instruments. The person who conducts the inquiry is an investigator, the one

who helps in collecting information is an enumerator and information is collected from a

respondent. Data can be primary or secondary. According to Wessel, “Data collected in

the process of investigation are known as primary data.” These are collected for the

investigator’s use from the primary source. Secondary data, on the other hand, is

collected by some other organization for their own use but the investigator also gets it for

his use. According to M.M. Blair, “Secondary data are those already in existence for

some other purpose than answering the question in hand.”

In the field of engineering, the three basic methods of collecting data are through

retrospective study, observational study and through a designed experiment. A

retrospective study would use the population or sample of the historical data which had

been archived over some period of time. It may involve a significant amount of data but

those data may contain relatively little useful information about the problem, some of the

relevant data may be missing, recording errors or transcription may be present, or those

other important data may not have been gathered and archived. These result in statistical

analysis of historical data which identifies interesting phenomena but difficulty of obtaining

solid and reliable explanations is encountered.

In an observational study, however, process or population is observed and

disturbed as little as possible, and the quantities of interests are recorded. In a designed

experiment, deliberate or purposeful changes in the controllable variables of the system

or process is done. The resulting system output data must be observed, and an inference

or decision about which variables are responsible for the observed changes in output

performance is made. Experiments designed with basic principles such as randomization

are needed to establish cause-and-effect relationships. Much of what we know in the

engineering and physical-chemical sciences is developed through testing or

experimentation. In engineering, there are problem areas with no scientific or engineering

theory that are directly or completely applicable, so experimentation and observation of

the resulting data is the only way to solve them. There are times there is a good underlying

scientific theory to explain the phenomena of interest. Tests or experiments are almost

always necessary to be conducted to confirm the applicability and validity of the theory in

a specific situation or environment. Designed experiments are very important in

engineering design and development and in the improvement of manufacturing processes

in which statistical thinking and statistical methods play an important role in planning,

conducting, and analyzing the data. (Montgomery, et al., 2018)

1.2 Planning and Conducting Surveys

A survey is a method of asking respondents some well-constructed questions. It is

an efficient way of collecting information and easy to administer wherein a wide variety of

information can be collected. The researcher can be focused and can stick to the

questions that interest him and are necessary in his statistical inquiry or study.
However surveys depend on the respondents honesty, motivation, memory and

his ability to respond. Sometimes answers may lead to vague data. Surveys can be done

through face-to-face interviews or self-administered through the use of questionnaires.

The advantages of face-to-face interviews include fewer misunderstood questions, fewer

incomplete responses, higher response rates, and greater control over the environment

in which the survey is administered; also, the researcher can collect additional information

if any of the respondents’ answers need clarifying. The disadvantages of face-to-face

interviews are that they can be expensive and time-consuming and may require a large

staff of trained interviewers. In addition, the response can be biased by the appearance

or attitude of the interviewer.

Self-administered surveys are less expensive than interviews. It can be

administered in large numbers and does not require many interviewers and there is less

pressure on respondents. However, in self-administered surveys, the respondents are

more likely to stop participating mid-way through the survey and respondents cannot ask

to clarify their answers. There are lower response rates than in personal interviews.

When designing a survey, the following steps are useful:

1. Determine the objectives of your survey: What questions do you want to answer?

2. Identify the target population sample: Whom will you interview? Who will be the

respondents? What sampling method will you use?

3. Choose an interviewing method: face-to-face interview, phone interview, self-

administered paper survey, or internet survey.

4. Decide what questions you will ask in what order, and how to phrase them.

5. Conduct the interview and collect the information.

6. Analyze the results by making graphs and drawing conclusions.

In choosing the respondents, sampling techniques are necessary. Sampling is the

process of selecting units (e.g., people, organizations) from a population of interest.

Sample must be a representative of the target population. The target population is the

entire group a researcher is interested in; the group about which the researcher wishes

to draw conclusions.

There are two ways of selecting a sample. These are the non-probability sampling

and the probability sampling.

Non-Probability Sampling

Non-probability sampling is also called judgment or subjective sampling. This

method is convenient and economical but the inferences made based on the findings are

not so reliable. The most common types of non-probability sampling are the convenience

sampling, purposive sampling and quota sampling.

In convenience sampling, the researcher use a device in obtaining the information

from the respondents which favors the researcher but can cause bias to the respondents.

In purposive sampling, the selection of respondents is predetermined according to

the characteristic of interest made by the researcher. Randomization is absent in this type

of sampling.

There are two types of quota sampling: proportional and non proportional. In

proportional quota sampling the major characteristics of the population by sampling a

proportional amount of each is represented.

For instance, if you know the population has 40% women and 60% men, and that

you want a total sample size of 100, you will continue sampling until you get those

percentages and then you will stop.

Non-proportional quota sampling is a bit less restrictive. In this method, a minimum

number of sampled units in each category is specified and not concerned with having

numbers that match the proportions in the population.

Probability Sampling

In probability sampling, every member of the population is given an equal chance

to be selected as a part of the sample. There are several probability techniques. Among

these are simple random sampling, stratified sampling and cluster sampling.

Simple Random Sampling

Simple random sampling is the basic sampling technique where a group of

subjects (a sample) is selected for study from a larger group (a population). Each

individual is chosen entirely by chance and each member of the population has an equal

chance of being included in the sample. Every possible sample of a given size has the

same chance of selection; i.e. each member of the population is equally likely to be

chosen at any stage in the sampling process.

Stratified Sampling

There may often be factors which divide up the population into sub-populations

(groups / strata) and the measurement of interest may vary among the different sub-

populations. This has to be accounted for when a sample from the population is selected
in order to obtain a sample that is representative of the population. This is achieved by

stratified sampling.

A stratified sample is obtained by taking samples from each stratum or sub-group

of a population. When a sample is to be taken from a population with several strata, the

proportion of each stratum in the sample should be the same as in the population.

Stratified sampling techniques are generally used when the population is

heterogeneous, or dissimilar, where certain homogeneous, or similar, sub-populations

can be isolated (strata). Simple random sampling is most appropriate when the entire

population from which the sample is taken is homogeneous. Some reasons for using

stratified sampling over simple random sampling are:

1. the cost per observation in the survey may be reduced;

2. estimates of the population parameters may be wanted for each subpopulation;

3. increased accuracy at given cost.

Cluster Sampling

Cluster sampling is a sampling technique where the entire population is divided

into groups, or clusters, and a random sample of these clusters are selected. All

observations in the selected clusters are included in the sample.

1.3 Planning and Conducting Experiments: Introduction to Design of Experiments

The products and processes in the engineering and scientific disciplines are mostly

derived from experimentation. An experiment is a series of tests conducted in a

systematic manner to increase the understanding of an existing process or to explore a

new product or process. Design of Experiments, or DOE, is a tool to develop an

experimentation strategy that maximizes learning using minimum resources. Design of

Experiments is widely and extensively used by engineers and scientists in improving

existing process through maximizing the yield and decreasing the variability or in

developing new products and processes. It is a technique needed to identify the "vital

few" factors in the most efficient manner and then directs the process to its best setting

to meet the ever-increasing demand for improved quality and increased productivity.

The methodology of DOE ensures that all factors and their interactions are

systematically investigated resulting to reliable and complete information. There are five

stages to be carried out for the design of experiments. These are planning, screening,

optimization, robustness testing and verification.

1. Planning

It is important to carefully plan for the course of experimentation before embarking

upon the process of testing and data collection. At this stage, identification of the

objectives of conducting the experiment or investigation, assessment of time and

available resources to achieve the objectives. Individuals from different disciplines related

to the product or process should compose a team who will conduct the investigation. They

are to identify possible factors to investigate and the most appropriate responses to

measure. A team approach promotes synergy that gives a richer set of factors to study

and thus a more complete experiment. Experiments which are carefully planned always

lead to increased understanding of the product or process. Well planned experiments are

easy to execute and analyze using the available statistical software.

2. Screening

Screening experiments are used to identify the important factors that affect the

process under investigation out of the large pool of potential factors. Screening process

eliminates unimportant factors and attention is focused on the key factors. Screening

experiments are usually efficient designs which require few executions and focus on the

vital factors and not on interactions.

3. Optimization

After narrowing down the important factors affecting the process, then determine

the best setting of these factors to achieve the objectives of the investigation. The

objectives may be to either increase yield or decrease variability or to find settings that

achieve both at the same time depending on the product or process under investigation.

4. Robustness Testing

Once the optimal settings of the factors have been determined, it is important to make the

product or process insensitive to variations resulting from changes in factors that affect

the process but are beyond the control of the analyst. Such factors are referred to as

noise or uncontrollable factors that are likely to be experienced in the application

environment. It is important to identify such sources of variation and take measures to

ensure that the product or process is made robust or insensitive to these factors.

5. Verification

This final stage involves validation of the optimum settings by conducting a few follow-

up experimental runs. This is to confirm that the process functions as expected and all

objectives are achieved.

Holes Human Anatomy and Physiology 16th Edition Full Download
0% (2)
Holes Human Anatomy and Physiology 16th Edition Full Download
405 pages
NATIONAL Certificate in Business Administration
No ratings yet
NATIONAL Certificate in Business Administration
172 pages
Ust Faculty of Civil Law For Ay 2020-2021
No ratings yet
Ust Faculty of Civil Law For Ay 2020-2021
26 pages
Inspection Manual For Inspectors
100% (1)
Inspection Manual For Inspectors
22 pages
Csec Mathematics Sba: What Is The SBA?
67% (3)
Csec Mathematics Sba: What Is The SBA?
7 pages
OLD NCERT Text Books PDF (Class-6 To 12) For IAS, UPSC Exams - IAS EXAM PORTAL - India's Largest Community For UPSC Exam Aspirants
No ratings yet
OLD NCERT Text Books PDF (Class-6 To 12) For IAS, UPSC Exams - IAS EXAM PORTAL - India's Largest Community For UPSC Exam Aspirants
3 pages
Complete Bundle Microeconomics 13th Edition Parkin
No ratings yet
Complete Bundle Microeconomics 13th Edition Parkin
407 pages
EDA Aguyyyyyyyyy
No ratings yet
EDA Aguyyyyyyyyy
7 pages
Integrated Humanities MYP5 Curriculum Overview 2023-24
No ratings yet
Integrated Humanities MYP5 Curriculum Overview 2023-24
6 pages
Sqa Higher Computing Coursework
100% (2)
Sqa Higher Computing Coursework
8 pages
Public Finance A Contemporary Application of Theory To Policy 11th Edition by David N Hyman Ebook and TestBank Bundle Fast Access
0% (1)
Public Finance A Contemporary Application of Theory To Policy 11th Edition by David N Hyman Ebook and TestBank Bundle Fast Access
335 pages
New Foundation Booklet
No ratings yet
New Foundation Booklet
12 pages
My Teaching Journal
No ratings yet
My Teaching Journal
4 pages
Chapter 1 and Chapter 2
No ratings yet
Chapter 1 and Chapter 2
12 pages
Engineering Data Analysis Chapter 1 2
No ratings yet
Engineering Data Analysis Chapter 1 2
78 pages
Application of Social Psychology
No ratings yet
Application of Social Psychology
82 pages
Technical Communication Process and Product 9th Edition Official Test Bank
No ratings yet
Technical Communication Process and Product 9th Edition Official Test Bank
404 pages
Fundamentals of MediumHeavy Duty Diesel Engines Unlocked Test Bank
No ratings yet
Fundamentals of MediumHeavy Duty Diesel Engines Unlocked Test Bank
301 pages
Math 403 Quiz 1 - Answer Key v1.0
No ratings yet
Math 403 Quiz 1 - Answer Key v1.0
9 pages
Hydrology and Floodplain Analysis 6th Edition by Philip B Bedient Ebook and TestBank Bundle Unlocked Test Bank
No ratings yet
Hydrology and Floodplain Analysis 6th Edition by Philip B Bedient Ebook and TestBank Bundle Unlocked Test Bank
335 pages
Americas History Volume 2 9th Edition Edwards Fast Access
No ratings yet
Americas History Volume 2 9th Edition Edwards Fast Access
301 pages
Confidence Intervals Explained
100% (2)
Confidence Intervals Explained
28 pages
EDUC 5240 - Group 0004-C 10 Mar
No ratings yet
EDUC 5240 - Group 0004-C 10 Mar
34 pages
CXC Annual Report 2009 PDF
No ratings yet
CXC Annual Report 2009 PDF
112 pages
MATH019A Engineering Data Analysis: Engr. Jan Justine A. Razon Faculty, Cpe Department
100% (2)
MATH019A Engineering Data Analysis: Engr. Jan Justine A. Razon Faculty, Cpe Department
50 pages
Chapter 1 ENSC 2013
No ratings yet
Chapter 1 ENSC 2013
12 pages
Datesheet - Board (Xii) Practical Examnation 2023-2024
No ratings yet
Datesheet - Board (Xii) Practical Examnation 2023-2024
2 pages
Engineering Data Analysis
100% (2)
Engineering Data Analysis
19 pages
Module 1 EE Data Analysis
100% (2)
Module 1 EE Data Analysis
13 pages
Discrete Probability Distributions Guide
No ratings yet
Discrete Probability Distributions Guide
18 pages
Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022
100% (1)
Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022
4 pages
UPANG CEA Common MAT171 P1
No ratings yet
UPANG CEA Common MAT171 P1
88 pages
EDA-Chapter-1 Batangas State University
No ratings yet
EDA-Chapter-1 Batangas State University
27 pages
Engineering Data Analysis Module 1-4
No ratings yet
Engineering Data Analysis Module 1-4
122 pages
Syllabus The Entrepreneurial Mind ENTREP212
No ratings yet
Syllabus The Entrepreneurial Mind ENTREP212
14 pages
EDA Module
No ratings yet
EDA Module
356 pages
Lesson 1 Methods of Data Collection
No ratings yet
Lesson 1 Methods of Data Collection
70 pages
Week 8 Statistical Intervals
No ratings yet
Week 8 Statistical Intervals
32 pages
Engineering Data Analysis
No ratings yet
Engineering Data Analysis
5 pages
Probability Theory Basics
No ratings yet
Probability Theory Basics
5 pages
NEET Previous Year Papers PDF Download
No ratings yet
NEET Previous Year Papers PDF Download
14 pages
Probability Basics for Engineers
No ratings yet
Probability Basics for Engineers
24 pages
Geyer Syllabus Intro Cog Assessment Spring 2024
No ratings yet
Geyer Syllabus Intro Cog Assessment Spring 2024
6 pages
Curriculum Gaps of Bachelor's Business Education in Universities of Bangladesh: An Analysis
No ratings yet
Curriculum Gaps of Bachelor's Business Education in Universities of Bangladesh: An Analysis
8 pages
Planning and Conducting Surveys: MM16 - Engineering Data Analysis Engr. Jeremaih S. Fainsan
No ratings yet
Planning and Conducting Surveys: MM16 - Engineering Data Analysis Engr. Jeremaih S. Fainsan
21 pages
Engineering Management-2
No ratings yet
Engineering Management-2
23 pages
Planning and Conducting Experiments
No ratings yet
Planning and Conducting Experiments
55 pages
Engeco Chap 01 - Introduction To Engineering Economy
100% (1)
Engeco Chap 01 - Introduction To Engineering Economy
33 pages
Centroids of Composite Figures
100% (1)
Centroids of Composite Figures
21 pages
Eng'g Data Analysis Module 1
No ratings yet
Eng'g Data Analysis Module 1
19 pages
System Development Life Cycle
No ratings yet
System Development Life Cycle
3 pages
ES 209 Engineering Data Analysis - Long Quiz
No ratings yet
ES 209 Engineering Data Analysis - Long Quiz
3 pages
Fundamental of Engineering Mathematics Midterm Exam
No ratings yet
Fundamental of Engineering Mathematics Midterm Exam
2 pages
Lesson 1 (Obtaining Data)
100% (1)
Lesson 1 (Obtaining Data)
7 pages
Midterm (Physics For Engineers)
No ratings yet
Midterm (Physics For Engineers)
2 pages
Checklist - Higher Study Semester
No ratings yet
Checklist - Higher Study Semester
2 pages
Mu Form Hunaid
No ratings yet
Mu Form Hunaid
2 pages
EDA 11 Introduction To Lesson 6
No ratings yet
EDA 11 Introduction To Lesson 6
38 pages
Cashflow Gradient
No ratings yet
Cashflow Gradient
45 pages
De 1 PDF
No ratings yet
De 1 PDF
34 pages
Physics For Engineers Final Exam Reviewer
No ratings yet
Physics For Engineers Final Exam Reviewer
19 pages
JEE Main, JEE Advanced, CBSE, NEET, IIT, Free Study Packages, Test Papers, Counselling, Ask Experts
No ratings yet
JEE Main, JEE Advanced, CBSE, NEET, IIT, Free Study Packages, Test Papers, Counselling, Ask Experts
1 page
ES106-CFP-Module 1 - Computer Organization
100% (1)
ES106-CFP-Module 1 - Computer Organization
74 pages
BPT - Best Private University of Uttarakhand Motherhood University
No ratings yet
BPT - Best Private University of Uttarakhand Motherhood University
1 page
University of North Bengal: Sl. Date & Time P.Code Paper
No ratings yet
University of North Bengal: Sl. Date & Time P.Code Paper
1 page
Civil Engineering Orientation Syllabus
No ratings yet
Civil Engineering Orientation Syllabus
3 pages
Cmo 37 2017 MW Hei BSXX
No ratings yet
Cmo 37 2017 MW Hei BSXX
16 pages
EMath 214 - Engineering Data Analysis
No ratings yet
EMath 214 - Engineering Data Analysis
17 pages
CE Thesis Format (Based On RDCO Format)
No ratings yet
CE Thesis Format (Based On RDCO Format)
19 pages
Feedback and Control Systems Course Syllabus 2012-2013
100% (1)
Feedback and Control Systems Course Syllabus 2012-2013
5 pages
Engineering Economics
No ratings yet
Engineering Economics
5 pages
CMO 24 s2008 Annex III Course Specification For The BSECE
No ratings yet
CMO 24 s2008 Annex III Course Specification For The BSECE
37 pages
Computer Fundamentals and Programming 2 Laboratory Laboratory No. 1.1 Title: Microsoft Excel Programming
No ratings yet
Computer Fundamentals and Programming 2 Laboratory Laboratory No. 1.1 Title: Microsoft Excel Programming
6 pages
Module Physics For Engineers
No ratings yet
Module Physics For Engineers
43 pages
Module 3 PDF
No ratings yet
Module 3 PDF
23 pages
Engineering Data Analysis
No ratings yet
Engineering Data Analysis
4 pages
Discrete Probability Distribution
No ratings yet
Discrete Probability Distribution
34 pages
Numerical Solutions To CE Problems Matrices Quiz
No ratings yet
Numerical Solutions To CE Problems Matrices Quiz
19 pages
Engineering Staffing Essentials
No ratings yet
Engineering Staffing Essentials
15 pages
Feedback and Control System Chapter 1. Introduction To Control System
No ratings yet
Feedback and Control System Chapter 1. Introduction To Control System
9 pages
ES 4 Management and Its Function
No ratings yet
ES 4 Management and Its Function
12 pages
Module 1 - Math-4 - Obtaining Data
No ratings yet
Module 1 - Math-4 - Obtaining Data
25 pages
Kinematics Course Learning Outcomes:: Nsci-6100 Physics For Engineers 1
No ratings yet
Kinematics Course Learning Outcomes:: Nsci-6100 Physics For Engineers 1
8 pages
Central Philippine University College of Engineering Course Syllabus Emath 1202 Engineering Data Analysis
No ratings yet
Central Philippine University College of Engineering Course Syllabus Emath 1202 Engineering Data Analysis
8 pages
Experimental Design & Analysis Course
No ratings yet
Experimental Design & Analysis Course
6 pages
Solutions.: F (X) X 1 X
No ratings yet
Solutions.: F (X) X 1 X
9 pages
Course Syllabus: Bes - Emech Engineering Mechanics First Semester, AY 2020-2021
No ratings yet
Course Syllabus: Bes - Emech Engineering Mechanics First Semester, AY 2020-2021
8 pages
Physics Problem Solutions and Analysis
No ratings yet
Physics Problem Solutions and Analysis
9 pages
Thermodynamics Fundamentals and Engineering Applications 1 Ed Reynolds
No ratings yet
Thermodynamics Fundamentals and Engineering Applications 1 Ed Reynolds
310 pages