CE 006 - ENGINEERING DATA ANALYSIS 2ND SEMESTER (AY 2019-2020)
STATISTICS
- is the scientific discipline that provides methods to help us POPULATION AND SAMPLE
make sense of data.
1. POPULATION - the entire collection of individuals or objects
- is about understanding the role that variability plays in about which information is desired.
drawing conclusions based on data.
2. SAMPLE - is a subset of the population, selected for the
- is the science of collecting, organizing, presenting, study.
analyzing and interpreting numerical data.
- has the same meaning as the Latin word datum which EXAMPLES
means a fact or information. ( The plural form of datum is 1. POPULATION
data.)
- Scores of entire students of 2nd year standing civil engineering students
- can refer to the mere tabulation of numeric information as in - All children of any age who have older or younger siblings
2. SAMPLE
reports of stock market transactions, or the body - Scores of students in a section
techniques used in processing or analyzing data.
- The 40 children who actually participated in one specific study about
siblings
APPLICATION OF STATISTICS
- Today statistics is a necessity in studies and researches.
- Some of the most important subject areas which make use DEFINITION
of statistical theory and techniques are as follows:
agriculture, automatic data processing, biology, business, 1. VARIABLE
- Any characteristic whose value may change from one individual or object
demography, economics, education, engineering, to another or from one observation to another.
entertainment, environmental studies, fisheries, - Any characteristic to be measured on each individual such as
government, health, insurance, manufacturing, medicine, compressive strength, or specific gravity; or a characteristic to be
observed such as one’s locality.
psychology, research and statistics, social sciences, sports,
etc.
a. INDEPENDENT VARIABLE
- In general, uses of statistics may be described by the - The variable who is the predictor of the dependent variable
b. DEPENDENT VARIABLE
following statements:
- The variable whose value is being predicted
• Statistics can give precise description of data.
EXAMPLE: To predict the value of compressive strength of concrete,
• Statistics can predict the outcome of experiment or the the dependent variable is the compressive strength of concrete
behavior of an individual.
while the independent variable is the amount of cement added to
the mix design.
• Statistics can be used to test a hypothesis.
2. DATA
- A collection of observations either on single variable or simultaneously
CATEGORIES OF STATISTICS on two or more variables.
- Results from taking measurements or observations.
1. DESCRIPTIVE STATISTICS
- Observing variable yields data.
- the branch of statistics that includes methods for - SET OF DATA is a collection of values for a particular variable.
organizing and summarizing data.
- the statistician tries to describe the situation
2. INFERENTIAL STATISTICS
- the branch of statistics that involves generalizing from a UNIVARIATE DATA SET
sample to the population from which the sample was - Consists of observations on a single variable made on
selected and assessing the reliability of such individuals in a sample or population
generalizations.
- its main concern is to analyze the organized data leading TYPES OF UNIVARIATE DATA SET
to prediction or inferences.
1. CATEGORICAL DATA SET (QUALITATIVE)
- Individual observations are categorical responses
EXAMPLES - Represents differences in quality, character, or kind but
not in amount
1. DESCRIPTIVE STATISTICS
- A union leader measures the attitudes of the 248 steelworkers in his - Yields from non-numeric variables (e.g. sex, birthplace,
union chapter towards the proposition that unemployment benefits religious preference, texture, etc)
be extended by asking them if they approve or disapprove of 2. NUMERICAL DATA SET (QUANTITATIVE)
extending such benefits.
- An economists attempts to measure the quality of life in the - Observation is a number
Philippines’ 100 largest metropolitan areas by combining several - Numerical in nature and can be ranked
indices of cost of living, crime, employment, and so on.
- 99% of the population has height between 66 and 74 inches
- Yields from numeric variables (e.g. weight, height, test
2. INFERENTIAL STATISTICS
scores, speed, temperature, etc)
- A clinical psychologist analyzes the dreams of 10 children who have
been physically abused to see whether or not images of the abuse
appear in their dreams.
TYPES OF NUMERICAL DATA
- A researcher is interested in the issue of whether men and women 1. DISCRETE DATA
are paid equal amounts for equal work. Accordingly, he surveys 39 - Possible values of the variable correspond to isolated
companies and compares the pay of male and female workers at
these companies, statistically equating for the company, years of points on the number line.
experience, and job classification.
- Observations are determined by counting
- Wearing seatbelts increases the chance of survival in automobile - Results from variables whose values can be counted
accidents.
using integral values (e.g. number of participants,
number of cars, etc)
ENGR. L.V. YNGENTE 1
CE 006 - ENGINEERING DATA ANALYSIS 2ND SEMESTER (AY 2019-2020)
2. CONTINUOUS DATA
3. INTERVAL DATA
- Possible values forms and entire interval on the number - Indicate an actual amount and there is equal unit of
line.
measurement separating each score, specifically equal
- Observations involves making measurements, as intervals.
opposed to counting
- Does not only include “greater than” and “less than”
- Results from variable that yields decimals or fraction
relationships, but also has a limit of measurement that
permits us to describe how much more or less one
NUMBER LINE object possesses than another.
4. RATIO SCALES
- Similar to interval data, but has an absolute zero and
multiples are meaningful
DISCRETE - RATIO DATA include all the usual measurements of
length, height, weight, area, volume, density, velocity,
duration, etc.
EXAMPLES
CONTINUOUS
1. NOMINAL DATA
- Meralco may list 5 types of electric consumption. The company might
use “1” to identify residential, “2” for commercial, “3” for industrial, “4”
for government, and “5” for others.
- A report on the number of persons involved in different religions in a
certain school.
EXAMPLE
- Sample of college instruction classified according to subject thought.
- Classifying survey subjects as male or female (gender).
2. ORDINAL DATA
- The intelligence of children may be above average, average, or below
average.
- Social class or incomes
- Contest (1st, 2nd, 3rd places)
- Income category (high, average, low)
3. INTERVAL DATA
- The Fahrenheit temperature scale. Suppose the following temperature
ratings were recorded: 87⁰F. 91⁰F, and 100⁰F. We can say 91⁰F is colder
than 150⁰F. We can’t write 87⁰F < 100⁰F; 100⁰F - 91⁰F.
SCALES OF MEASUREMENT OF DATA
- Aptitude test scores
- Relates to the rules used to assign scores and is an 4. RATIO SCALES
- Speed of production line
indicator of the kind of information that the scores provide.
- If the weight is 0, then it represents a complete absence of weight.
- The scale to which measurement belong will be important in - If A earns 1000PhP a day and B earns 2000PhP a day, then B earns
twice as much as A.
determining appropriate methods for data description and - Horsepower of motors
analysis. - weights of soil samples
DEFINITION
MEASUREMENT
- Assigning a numerical value to a variable.
1. NOMINAL DATA
- Use numbers for the purpose of identifying name or
membership in a group or category.
- It consists of “naming” observations or classifying them
into various mutually exclusive and collective exhaustive
categories
- All qualitative variables are measured on a nominal scale.
- Observations can be classified and counted without
particular order or ranking imposed on the data.
2. ORDINAL DATA
- Connotes ranking or inequalities. One category is higher
than the other one.
- Numbers represent “greater than” or “less than”
measurements, such as preferences or rankings.
ENGR. L.V. YNGENTE 2
CE 006 - ENGINEERING DATA ANALYSIS 2ND SEMESTER (AY 2019-2020)
DATA ANALYSIS
- is a process that begins with careful planning followed by
data collection, data description using graphical and
numerical summaries, data analysis, and finally
interpretation of results.
DATA ANALYSIS PROCESS
TO BE AN INFORMED CONSUMER OF REPORTS, YOU MUST BE ABLE TO
DO THE FOLLOWING:
THROUGHOUT YOUR PERSONAL AND PROFESSIONAL LIFE, YOU ILL
NEED TO UNDERSTAND AND USE DATA TO MAKE DECISIONS. TO DO
THIS, YOU MUST BE ABLE TO:
ENGR. L.V. YNGENTE 3