0% found this document useful (0 votes)

33 views23 pages

Lecture Slides 11 UN1201

Uploaded by

jindian0623

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views23 pages

Lecture Slides 11 UN1201

Uploaded by

jindian0623

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Stat UN1201- Intro.

to Statistics and Probability

Instructor: Dr. Banu Baydil
TAs: Audrey Yang, ay2658
Kian Saraf Poor, ks4291
Seonghun Lee, sl4836

In Stat UN1201, we will use the textbook:

“Probability and Statistics for Engineering and the
Sciences, 9th ed. Jay Devore, Brooks/Cole”
ISBN: 9781305251809
Some of the lecture slides are adapted/modified from textbook slides (Copyright ©
Cengage Learning. All rights reserved.) based on the needs of the course and might
cover additional material that is not in the course textbook.
You should make sure to read the corresponding chapters in the book as the topics are
covered in class, as well as the lecture slides, and take notes of any additional material
that might be covered during the lectures but might not be either in the course textbook
or in the lecture slides.
1
Basic Properties of Confidence Intervals
The basic concepts and properties of confidence intervals (CIs) are most
easily introduced by first focusing on a simple, albeit somewhat unrealistic,
problem situation.

Suppose that the parameter of interest is a population mean  and that

1. The population distribution is normal
2. The value of the population standard deviation  is known

(Normality of the population distribution is often a reasonable assumption.

However, if the value of  is unknown, it is typically implausible that the
value of  would be available.)

The actual sample observations x1, x2, …, xn are assumed to be the result
of a random sample X1, …, Xn from a normal distribution with mean value 
and standard deviation .
Irrespective of the sample size n, the sample mean X is normally distributed
with expected value  and standard deviation
Standardizing X, yields the standard normal variable
2
Basic Properties of Confidence Intervals
As the area under the standard normal curve between –1.96 and 1.96 is .95,

Manipulating the inequalities inside the parentheses above by multiply

through by , subtracting X from each term, and multiply through by –1
to eliminate the minus sign in front of  (which reverses the direction of each
inequality) yields

The event inside the parentheses, that the unknown constant  is in the
Interval , involves a random interval.

3
Basic Properties of Confidence Intervals
The interval’s width is 2  (1.96)  , which is not random; however, the
location of the interval (its midpoint X) is random.

The random interval

(7.4) centered at X

Therefore, the probability

can be paraphrased as “the probability that this random interval includes or
covers the true value of  is .95.”

Definition
If, after observing X1 = x1, X2 = x2, … , Xn = xn, we compute the observed
sample mean x and then substitute x into the expression above in place of
X, the resulting fixed interval is called a 95% confidence interval for .
4
Basic Properties of Confidence Intervals

1) This CI can be expressed either as

2) or as

3) or a concise expression for the interval is x  1.96  ,

where – gives the left endpoint (lower limit) and + gives the
right endpoint (upper limit).

5
Interpreting a Confidence Level
Consider 100 such constructed intervals. In the figure below the vertical
line cuts the measurement axis at the true (but unknown) value of .
Notice that 7 of the 100 intervals shown fail to
contain .
*In the long run, only 5% of the intervals so
constructed would fail to contain .
According to this interpretation, the confidence
level 95% is not so much a statement about
any particular interval such as (79.3, 80.7).
Instead it pertains to what would happen if a
very large number of like intervals were to be
constructed using the same CI formula.
One hundred 95% CIs (asterisks identify
intervals that do not include ).
6
Other Levels of Confidence
Definition
A 100(1 – )% confidence interval for the mean  of a normal population
when the value of  is known is given by

(7.5)

(or, equivalently, by )

Figure below shows that a probability of 1 –  is achieved by using z/2 in

place of 1.96.

P(–z/2  Z < z/2) = 1 – 

7
Confidence Level, Precision, and Sample Size
Why settle for a confidence level of 95% when a level of 99% is
achievable? The price paid for the higher confidence level is a wider
interval.
The width of the 95% interval is 2(1.96)  = 3.92  .
The width of the 99% interval is 2(2.58)  = 5.16  .
One has more confidence in the 99% interval because it is wider.
The width of the interval can be thought as specifying its precision or
accuracy. *Then the confidence level (or reliability) of the interval is
inversely related to its precision.
A highly reliable interval estimate may be imprecise, whereas a precise
interval may entail relatively low reliability.
Thus it cannot be said unequivocally that a 99% interval is to be
preferred to a 95% interval. *An appealing strategy is to specify both
the desired confidence level and interval width and then determine the
necessary sample size. 8
Confidence Level, Precision, and Sample Size

A general formula for the sample size n necessary to ensure an interval

width, w, is obtained from equating w to 2  z/2  and solving for n.

The sample size necessary for the CI (7.5) to have a width w is

*Note that:
1) The smaller the desired width w, the larger n must be.
2) n is an increasing function of  (more population variability
necessitates a larger sample size)
3) n is an increasing function of the confidence level 100(1 – ) (as 
decreases, z/2 increases).

9
Large-Sample Confidence Intervals for a Population Mean and Proportion

Earlier we have come across the CI for  which assumed that the population
distribution is normal with the value of  known.

We now present a large-sample CI for  whose validity does not require

these assumptions.

Let X1, X2, . . . , Xn be a random sample from a population having a mean 

and standard deviation . Provided that n is large, the Central Limit Theorem
(CLT) implies that has approximately a normal distribution whatever the
nature of the population distribution. Then it follows that
has approximately a standard normal distribution, and

Similarly as before, we can argue that a large-sample CI for  with a

confidence level of approximately 100(1 – )% is given by .
However, computation of this CI requires the unknown value of .
* For large n, one can the substitute the unknown  with its estimate S.
10
A Large-Sample Interval for 
Proposition
If n is sufficiently large, the standardized variable has
approximately a standard normal distribution. This implies that a large-
sample confidence interval for  with confidence level approximately
100(1 – )% is given by
.

This formula is valid regardless of the shape of the population distribution.

In words, the above CI is

(point estimate of  )  (z critical value) *(estimated standard
error of the mean)

*Generally speaking, n > 40 will be sufficient to justify the use of this

interval. This is somewhat more conservative than the rule of thumb for the
CLT because of the additional variability introduced by using S in place of .
11
A General Large-Sample Confidence Interval
Suppose that is an estimator satisfying the following properties:
(1) It has approximately a normal distribution;
(2) it is (at least approximately) unbiased; and
(3) an expression for , the standard deviation of , is available.
Standardizing yields the rv , which has approximately a
standard normal distribution.
(If involves unknown parameters, let be the estimate of obtained by
using estimates of the unknown parameters (e.g. estimates ).
Under general conditions (essentially that be close to for most samples),
a valid CI is

In words: (point estimate of  )  (z critical value) *(estimated standard

error of the estimator)
(eg. The large-sample interval .) 12
A Confidence Interval for a Population Proportion
Let p denote the proportion of “successes” in a population, where success
identifies an individual or object with a specified property (e.g., individuals
who graduated from college, computers that do not need warranty service).

A random sample of n individuals is to be selected. Let X be the number of

successes in the sample. Assume n is small compared to the population size.
Then, X can be viewed as a binomial rv with E(X) = np and .

Furthermore, when the sample size n is very large and both np  10 and
nq  10, (q = 1 – p), the natural estimator of p, ( = X/n, the sample fraction
of successes), has approximately a normal distribution.

Recalling that E( ) = p (unbiasedness) and , one can

construct a large-sample confidence interval for p with confidence level
approximately 100(1 – )% as

13
Intervals Based on a Normal Population Distribution

The CLT cannot be invoked when n is small. In this case, one way to
proceed is to make a specific assumption about the form of the
population distribution and then derive a CI tailored to that assumption.

Assumption
The population of interest is normal, so that X1, … , Xn constitutes a
random sample from a normal distribution with both  and  unknown.

The key result in earlier section was that for large n, S was close to σ,
and the rv had approximately a standard normal
distribution.

When n is small, S is likely not to be close to σ, so the variability in the

distribution of Z arises from randomness in both the numerator and the
denominator. As a result the probability distribution of
will be more spread out than the standard normal distribution.

14
Intervals Based on a Normal Population Distribution
The result on which inferences are based introduces a new family of
probability distributions called t distributions.

Theorem
When is the mean of a random sample of size n from a normal
distribution with mean , the rv
(7.13)

has a probability distribution called a t distribution with n – 1 degrees of

freedom (df).

* Although the variable of interest is still , we now

denote it by T to emphasize that it does not have a standard normal
distribution when n is small.

*Any particular t distribution results from specifying the value of a single

parameter, n , (taking positive integer values), called the number of degrees of
freedom, abbreviated df. 15
Properties of t Distributions
Properties of t Distributions
Let tn denote the t distribution with n df.

1. Each tn curve is bell-shaped and centered at 0.

2. Each tn curve is more spread out than the standard

normal (z) curve.

3. As n increases, the spread of the corresponding tn curve

decreases.

4. As n → , the sequence of tn curves approaches the

standard normal curve (so the z curve is often called the
t curve with df = ).
16
Properties of t Distributions
The figure below illustrates several of these properties for
selected values of n.

tn and z curves

17
Properties of t Distributions
Notation
Let t,n = the number on the measurement axis for which the area under
the t curve with n df to the right of t,n is ; t,n is called a t critical
value.

Illustration of a t,n critical value

Because t curves are symmetric about zero, –t,n captures lower-tail

area . Appendix Table A.5 gives t,n for selected values of  and n.

The columns of the table correspond to different values of . To obtain

t.05,15, go to the  =.05 column, look down to the n = 15 row, and read
t.05,15 = 1.753.

18
The One-Sample t Confidence Interval
Proposition
Let and s be the sample mean and sample standard deviation computed
from the results of a random sample from a normal population with mean
. Then a 100(1 – )% confidence interval for  is

or, more compactly

19
Confidence Intervals for the Variance and Standard Deviation of a Normal Population

Although inferences concerning a population variance 2 or standard

deviation  are usually of less interest than those about a mean or
proportion, there are occasions when such procedures are needed.
In the case of a normal population distribution, inferences are based on
the following result concerning the sample variance S2.

Theorem
Let X1, X2, … , Xn be a random sample from a normal distribution with
parameters  and 2. Then the rv

has a chi-squared ( 2) probability distribution with n – 1 df.

20
Confidence Intervals for the Variance and Standard Deviation of a Normal Population

The graphs of several 2 probability density functions

(pdf’s) are illustrated in the figure below.

Graphs of chi-squared density functions

21
Confidence Intervals for the Variance and Standard Deviation of a Normal Population

Notation
Let called a chi-squared critical value, denote the number on the
horizontal axis such that  of the area under the chi-squared curve with
v df lies to the right of
The chi-squared distribution is not symmetric, so Appendix Table A.7
contains values of both for  near 0 and near 1:

(a) (b)
notation illustrated
22
Confidence Intervals for the Variance and Standard Deviation of a Normal Population

*A 100(1 – )% confidence interval for the variance 2 of

a normal population has lower limit

and upper limit

*A confidence interval for the standard deviation  has

lower and upper limits that are the square roots of the
corresponding limits in the interval for 2.

(Original PDF) Australasian Business Statistics, 4th Editioninstant Download
100% (3)
(Original PDF) Australasian Business Statistics, 4th Editioninstant Download
55 pages
Math 221 Week 1 Quiz
No ratings yet
Math 221 Week 1 Quiz
10 pages
Statistics For Managers Using Microsoft® Excel 5th Edition: Some Important Discrete Probability Distributions
No ratings yet
Statistics For Managers Using Microsoft® Excel 5th Edition: Some Important Discrete Probability Distributions
48 pages
Intervals
No ratings yet
Intervals
43 pages
Chap3 3 2012
No ratings yet
Chap3 3 2012
37 pages
Ch3 Prob II Anu Fall24 1
No ratings yet
Ch3 Prob II Anu Fall24 1
20 pages
Confidence Intervals Explained
100% (1)
Confidence Intervals Explained
54 pages
QM CH 5 Conf Interval
No ratings yet
QM CH 5 Conf Interval
7 pages
Confidence Intervals Explained
No ratings yet
Confidence Intervals Explained
34 pages
4 Confidence Intervals
100% (1)
4 Confidence Intervals
49 pages
Description of Uncertainty 2024
No ratings yet
Description of Uncertainty 2024
16 pages
Understanding Confidence Intervals
No ratings yet
Understanding Confidence Intervals
10 pages
Complete Business Statistics: Confidence Intervals
No ratings yet
Complete Business Statistics: Confidence Intervals
50 pages
Estimation and Confidence Intervals
No ratings yet
Estimation and Confidence Intervals
28 pages
Estimation 06
No ratings yet
Estimation 06
29 pages
Chap 006
No ratings yet
Chap 006
38 pages
Estimation: Point Estimation Point Estimate
No ratings yet
Estimation: Point Estimation Point Estimate
13 pages
Applied Statistics and Probability For Engineers Chapter - 8
No ratings yet
Applied Statistics and Probability For Engineers Chapter - 8
13 pages
Probability Distributions: by Dr. Ameer Kadhim Hussein. M.B.Ch.B. FICMS (Community Medicine
No ratings yet
Probability Distributions: by Dr. Ameer Kadhim Hussein. M.B.Ch.B. FICMS (Community Medicine
37 pages
Statistical Intervals 2
No ratings yet
Statistical Intervals 2
58 pages
66cc482dcab874225a22d789 Chapter8 StatisticalIntervalsforaSingleSample
No ratings yet
66cc482dcab874225a22d789 Chapter8 StatisticalIntervalsforaSingleSample
17 pages
L6.Confindence Interval 2023
No ratings yet
L6.Confindence Interval 2023
45 pages
8 Interval Estimation
No ratings yet
8 Interval Estimation
11 pages
Estimations
No ratings yet
Estimations
24 pages
Chap5 Estimation Upload
No ratings yet
Chap5 Estimation Upload
50 pages
Chap 5
No ratings yet
Chap 5
22 pages
Estimation
No ratings yet
Estimation
29 pages
Statistical Intervals in Engineering Data Analysis
No ratings yet
Statistical Intervals in Engineering Data Analysis
27 pages
Chap 6
No ratings yet
Chap 6
27 pages
Chapter 7 Statistical Intervals
No ratings yet
Chapter 7 Statistical Intervals
113 pages
Point and Interval Estimates
No ratings yet
Point and Interval Estimates
17 pages
Confidence Intervals For The Mean Known Variance
No ratings yet
Confidence Intervals For The Mean Known Variance
5 pages
Chapter Five Statistical Inferences Estimating For Single Populations Estimating Population Mean With Large Sample Size
No ratings yet
Chapter Five Statistical Inferences Estimating For Single Populations Estimating Population Mean With Large Sample Size
13 pages
Interval Estimation
No ratings yet
Interval Estimation
62 pages
9a BMGT 220 S.I. Theory of Estimation
No ratings yet
9a BMGT 220 S.I. Theory of Estimation
5 pages
Business Statistics Interval Estimation 2025
No ratings yet
Business Statistics Interval Estimation 2025
60 pages
R24 Statistical Thinking For The 21st Century Effect Size Chapt 10
No ratings yet
R24 Statistical Thinking For The 21st Century Effect Size Chapt 10
11 pages
BRM - Lesson7 Confidence Interval
No ratings yet
BRM - Lesson7 Confidence Interval
17 pages
2 Parametric Test Part I
No ratings yet
2 Parametric Test Part I
120 pages
SEE5211 Chapter5 P2017
No ratings yet
SEE5211 Chapter5 P2017
48 pages
Chapter 6 Part I: Confidence Intervals For Motivating Example
No ratings yet
Chapter 6 Part I: Confidence Intervals For Motivating Example
6 pages
Chap 8
No ratings yet
Chap 8
10 pages
8 Statistical Estimation
No ratings yet
8 Statistical Estimation
12 pages
Estimation and Confidence Intervals: Mcgraw Hill/Irwin
No ratings yet
Estimation and Confidence Intervals: Mcgraw Hill/Irwin
15 pages
Materi 4 Estimasi Titik Dan Interval-Edit
No ratings yet
Materi 4 Estimasi Titik Dan Interval-Edit
73 pages
Estimation
No ratings yet
Estimation
44 pages
Estimation and CI
No ratings yet
Estimation and CI
87 pages
Lec 10
No ratings yet
Lec 10
22 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Stats: Interval Estimation Guide
No ratings yet
Stats: Interval Estimation Guide
4 pages
Tryfos-Interval Estimation
No ratings yet
Tryfos-Interval Estimation
49 pages
Estimation
No ratings yet
Estimation
40 pages
Estimation 1920
No ratings yet
Estimation 1920
51 pages
Lesson 14
No ratings yet
Lesson 14
28 pages
Confidence Intervals
No ratings yet
Confidence Intervals
28 pages
Chapter Two-Four
No ratings yet
Chapter Two-Four
118 pages
Lecture 3
No ratings yet
Lecture 3
10 pages
11.estimation IV
No ratings yet
11.estimation IV
62 pages
Estimatation
No ratings yet
Estimatation
21 pages
ST2187 - Block 9 Confidence Interval Estimation
No ratings yet
ST2187 - Block 9 Confidence Interval Estimation
17 pages
PDF Lesson 2 Understanding Confidence Interval Estimates For The Population Mean
No ratings yet
PDF Lesson 2 Understanding Confidence Interval Estimates For The Population Mean
33 pages
Biostat Handouts Lesson 2
No ratings yet
Biostat Handouts Lesson 2
55 pages
Fixed Factory Overhead Variances
No ratings yet
Fixed Factory Overhead Variances
2 pages
Overhead Variances Final
100% (2)
Overhead Variances Final
12 pages
The Chi-Squared Distribution
No ratings yet
The Chi-Squared Distribution
24 pages
Franco 2004
No ratings yet
Franco 2004
13 pages
Selecting The Most Effective Nudge: Evidence From A Large-Scale Experiment On Immunization
No ratings yet
Selecting The Most Effective Nudge: Evidence From A Large-Scale Experiment On Immunization
92 pages
Interpersonal Intelligence Study
No ratings yet
Interpersonal Intelligence Study
10 pages
2 Statistics and Probability g11 Quarter 4 Module 2 Identifying Parameter To Be Tested Given A Real Life Problem
No ratings yet
2 Statistics and Probability g11 Quarter 4 Module 2 Identifying Parameter To Be Tested Given A Real Life Problem
20 pages
Nifty Synopsis
No ratings yet
Nifty Synopsis
10 pages
Essentials of Statistics For Business and Economics 7th Edition by David R Anderson
No ratings yet
Essentials of Statistics For Business and Economics 7th Edition by David R Anderson
324 pages
Example of Two Group Discriminant Analysis
No ratings yet
Example of Two Group Discriminant Analysis
7 pages
Spatial Autocorrelation
No ratings yet
Spatial Autocorrelation
10 pages
SR MPC Eamcet Maths
No ratings yet
SR MPC Eamcet Maths
12 pages
AI Gait Prediction for Rehab Devices
No ratings yet
AI Gait Prediction for Rehab Devices
12 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
22 pages
Cambridge International AS & A Level: Further Mathematics 9231/41
No ratings yet
Cambridge International AS & A Level: Further Mathematics 9231/41
16 pages
Apos For Hipap Complete Manual
No ratings yet
Apos For Hipap Complete Manual
131 pages
Quant Summary
No ratings yet
Quant Summary
16 pages
BA3203 2021-2022 Semester 1
No ratings yet
BA3203 2021-2022 Semester 1
6 pages
SOILPROP, A Program For Estimating Unsaturated Soil Hydraulic Properties and Their Uncertainty From Particle Size Distribution Data
No ratings yet
SOILPROP, A Program For Estimating Unsaturated Soil Hydraulic Properties and Their Uncertainty From Particle Size Distribution Data
48 pages
Mobin Statistics
No ratings yet
Mobin Statistics
5 pages
CE 207 Lecture 09 - Hypothesis Testing - Two Samples
No ratings yet
CE 207 Lecture 09 - Hypothesis Testing - Two Samples
24 pages
SHS Statistics Module: Parameter Estimation
No ratings yet
SHS Statistics Module: Parameter Estimation
15 pages
Sscportal - In: SSC, CGL, Cpo, Capfs, Fci, MTS, Grade B SSC Exam
No ratings yet
Sscportal - In: SSC, CGL, Cpo, Capfs, Fci, MTS, Grade B SSC Exam
6 pages
BCASyllabus
No ratings yet
BCASyllabus
34 pages
Factor Analysis
No ratings yet
Factor Analysis
54 pages
Noah, Ioseph, and Operational Hydrology: Yorktown Heights, New York 10598
No ratings yet
Noah, Ioseph, and Operational Hydrology: Yorktown Heights, New York 10598
10 pages

Lecture Slides 11 UN1201

Uploaded by

Lecture Slides 11 UN1201

Uploaded by

Stat UN1201- Intro.

to Statistics and Probability

In Stat UN1201, we will use the textbook:

Suppose that the parameter of interest is a population mean  and that

(Normality of the population distribution is often a reasonable assumption.

Manipulating the inequalities inside the parentheses above by multiply

The random interval

Therefore, the probability

1) This CI can be expressed either as

3) or a concise expression for the interval is x  1.96  ,

Figure below shows that a probability of 1 –  is achieved by using z/2 in

P(–z/2  Z < z/2) = 1 – 

A general formula for the sample size n necessary to ensure an interval

The sample size necessary for the CI (7.5) to have a width w is

We now present a large-sample CI for  whose validity does not require

Let X1, X2, . . . , Xn be a random sample from a population having a mean 

Similarly as before, we can argue that a large-sample CI for  with a

This formula is valid regardless of the shape of the population distribution.

In words, the above CI is

*Generally speaking, n > 40 will be sufficient to justify the use of this

In words: (point estimate of  )  (z critical value) *(estimated standard

A random sample of n individuals is to be selected. Let X be the number of

Recalling that E( ) = p (unbiasedness) and , one can

When n is small, S is likely not to be close to σ, so the variability in the

has a probability distribution called a t distribution with n – 1 degrees of

* Although the variable of interest is still , we now

*Any particular t distribution results from specifying the value of a single

1. Each tn curve is bell-shaped and centered at 0.

2. Each tn curve is more spread out than the standard

3. As n increases, the spread of the corresponding tn curve

4. As n → , the sequence of tn curves approaches the

Illustration of a t,n critical value

Because t curves are symmetric about zero, –t,n captures lower-tail

The columns of the table correspond to different values of . To obtain

or, more compactly

Although inferences concerning a population variance 2 or standard

has a chi-squared ( 2) probability distribution with n – 1 df.

The graphs of several 2 probability density functions

Graphs of chi-squared density functions

*A 100(1 – )% confidence interval for the variance 2 of

and upper limit

*A confidence interval for the standard deviation  has

You might also like