Chapter 7 Statistical Intervals

Uploaded by

Captain Right Jung hyuk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views113 pages

Chapter 7 Statistical Intervals

Uploaded by

Captain Right Jung hyuk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 113

ES – 71

ENGINEERING
DATA ANALYSIS
ENGR. MARY CRIS L. AYING-TAMPOS
FACULTY, CET
CHAPTER 7

STATISTICAL
INTERVALS
7. Statistical Intervals
7.1 Single Sample: Estimating the Mean
7.2 Confidence Interval on the Mean of a Normal
Distribution, Variance Unknown
7.3 Confidence Interval on the Variance and Standard
Deviation of a Normal Distribution
7.4 Two Samples: Estimating the Difference between
Two Means
7.5 Large-Sample Confidence Interval for a Population
Proportion
7.6 Prediction Interval for Future Observation
7.7 Tolerance Interval
Course References

(1) Walpole, Ronald E., et. al., 2016, “Probability and Statistics
for Engineers and Scientists”. 9th Ed., Pearson Education
Inc.
(2) Montgomery, Douglas C., et al., 2018, “Applied Statistics
and Probability for Engineers”., 7th Ed., John Wiley & Sons
(Asia) Pte Ltd.
(3) Murray, Spiegel R., et al., 2013, “Probability and
Statistics”, 4th Ed., McGraw Hill Companies Inc.
Grading System
Attendance : 5%
Quizzes/Participation : 15%
Prelim Exam : 20%
Midterm Exam : 20%
Prefinal Exam : 20%
Final Exam : 20%
100%

Passing Rate : 50%

01 Construct confidence intervals on the
mean of a normal distribution, using either

Chapter 7: the normal distribution or the 𝑡 distribution

method
02 Construct confidence intervals on the
Intended Learning
variance and standard deviation of a
Outcomes
normal distribution and on a population
proportion
03 Construct prediction intervals for a future
observation

04 Construct a tolerance interval for a normal

distribution

05 Explain the three types of interval

estimates; confidence intervals, ,
prediction intervals and tolerance intervals
Introduction

Statistical intervals represent an uncertainty that exists in the data

because we work with samples that are obtained from a larger
population or process. Statistical intervals are staples of the quality
and validation practitioner’s statistical toolbox. Statistical intervals
can manifest as plus-or-minus limits on test data, represent a margin
of error in a scientific poll, or indicate the level of confidence
associated with a predicted value.
Introduction
Engineers are often involved in estimating parameters. For example, there is an ASTM
Standard E23 that defines a technique called the Charpy V-notch method for notched bar impact
testing of metallic materials. The impact energy is often used to determine whether the material
experiences a ductile-to-brittle transition as the temperature decreases. Suppose that we have
tested a sample of 10 specimens of a particular material with this procedure. We know hat we
can use the sample average 𝑋ത to estimate the true mean impact energy μ. However, we also
know that the true mean impact energy is unlikely to be exactly equal to your estimate.
Reporting the results of your test as a single number is unappealing because nothing inherent in
𝑋ത provides any information about how close it is to μ. Our estimate could be very close, or it
could be considerably far from the true mean. A way to avoid this is to report the estimate in
terms of a range of plausible values called a confidence interval.
7.1
Single Sample:
Estimating the Mean
7.1 Single Sample: Estimating the Mean
A confidence interval always specifies a confidence level, usually 90%,
95%, or 99%, which is a measure of the reliability of the procedure. So if a
95% confidence interval on the impact energy based on the data from our 10
specimens has a lower limit of 63.84 J and an upper limit of 65.08 J, then
we can say that at the 95% level of confidence any value of mean impact
energy between 63.84 J and 65.08 J is a plausible value. By reliability, we
mean that if we repeated this experiment over and over again, 95% of all
samples would produce a confidence interval that contains the true mean
impact energy, and only 5% of the time would the interval be in error.
7.1 Single Sample: Estimating the Mean

An interval estimate for a population parameter is called a confidence

interval. Information about the precision of estimation is conveyed by the
length of the interval. A short interval implies precise estimation. We cannot
be certain that the interval contains the true, unknown population parameter
– we use only a sample from the full population to compute the point
estimate and the interval. However, the confidence interval is constructed so
that we have high confidence that it does contain the unknown population
parameter. Confidence intervals are widely used in engineering and the
sciences.
7.1 Single Sample: Estimating the Mean
A tolerance interval is another important type of interval estimate. For example, the
chemical product viscosity data might be assumed to be normally distributed. We might like to
calculate limits that bound 95% of the viscosity values. For a normal distribution, we know
that 95% of the distribution is in the interval
μ - 1.96σ, μ + 1.96σ
However, this is not a useful tolerance interval because the parameters μ and σ are unknown.
Point estimates such as 𝑥ҧ and s can be used in the preceding equation for μ and σ. However, we
need to account for the potential error in each point estimate to form a tolerance interval for the
distribution. The result is an interval of the form
𝑥ҧ − 𝑘𝑠, 𝑥ҧ + 𝑘𝑠
where k is an appropriate constant (that is larger than 1.96 to account for the estimation error).
7.1 Single Sample: Estimating the Mean
As in the case of a confidence interval, it is not certain that the tolerance interval
bounds 95% of the distribution, but the interval is constructed so that we have high
confidence that it does. Tolerance intervals are widely used and, as we will subsequently
see, they are easy to calculate for normal distributions.
Confidence and tolerance intervals bound unknown elements of a distribution. In this
chapter, you will learn to appreciate the value of these intervals. A prediction interval
provides bounds on one (or more) future observations from the population. For example, a
prediction interval could be used to bound a single, new measurement of viscosity—another
useful interval. With a large sample size, the prediction interval for normally distributed
data tends to the tolerance interval, but for more modest sample sizes, the prediction and
tolerance intervals are different.
7.1 Single Sample: Estimating the Mean
Keep the purpose of the three types of interval estimates clear:
• A confidence interval bounds population or distribution parameters (such as the
mean viscosity).
• A tolerance interval bounds a selected proportion of a distribution.
• A prediction interval bounds future observations from the population or
distribution.

Our experience has been that it is easy to confuse the three types of intervals. For
example, a confidence interval is often reported when the problem situation calls for
a prediction interval.
❖ Confidence Interval on the Mean of a Normal
Distribution, Variance Known

If 𝑥ҧ is the sample mean of a random sample of size 𝑛 from a normal

population with known variance 𝜎 2 , a 100(1 − α)% CI (confidence interval)
on μ is given by

where 𝑍𝛼/2 is the upper 100α/2 percentage point of the standard normal
distribution.
For small samples selected from non-normal populations, we cannot
expect our degree of confidence to be accurate. However, for samples of size
n ≥ 30, with the shape of the distributions not too skewed, sampling theory
guarantees good results.
❖ Confidence Interval on the Mean of a Normal
Distribution, Variance Known

A Confidence Interval, constructed from sample data, is a range of values

that is likely to include the population parameter, at some specified
confidence level. The confidence interval for a population mean is
determined by taking the sample mean (the point estimate) and subtracting
and adding a margin of error to it.

ഥ±𝑬 𝝈 𝒛𝜶Τ : a single value, called the critical value

𝑿 𝑬 = 𝒛𝜶ൗ 𝟐
𝟐 𝒏 : can be found in the normal tables or by
using software
EXCEL: CONFIDENCE(𝛼, 𝜎, 𝑛)
𝑺𝒊𝒈𝒏𝒊𝒇𝒊𝒄𝒂𝒏𝒄𝒆 𝑳𝒆𝒗𝒆𝒍: 𝜶 = 𝟏 − 𝑪𝑳
❖ Confidence Interval on the Mean of a Normal
Distribution, Variance Known

Example 1: Scores on an exam are normally distributed with a population

standard deviation of 5.6. A random sample of 40 scores on the
exam has a mean of 32.

Estimate the population mean with

a) 80% confidence level
b) 90% confidence level
c) 98% confidence level
❖ Confidence Interval on the Mean of a Normal
Distribution, Variance Known