Chapter 7 Estimation
Chapter 7 Estimation
Estimation
6-2
Types of estimators
• Point Estimate
A single-valued estimate.
A single element chosen from a sampling
distribution.
Conveys little information about the actual value of
the population parameter, about the accuracy of
the estimate.
• Confidence Interval or Interval Estimate
An interval or range of values believed to include
the unknown population parameter.
Associated with the interval is a measure of the
confidence we have that the interval does indeed
contain the parameter of interest.
6-3
1. Point Estimation
Definition:
A parameter is a numerical descriptive measure
population parameter.
We use X , S2, S , p, etc. to estimate μ, σ2, σ, P (or
π), etc.
6-4
Definition:
A point estimate of some population parameter
O is a single value Ô of a sample statistic
Properties
1. The mean of the sampling distribution of
means is the same as the population mean, μ .
Confidence Intervals
Using Statistics
Confidence Interval for the Population Mean
When the Population Standard Deviation is
Known
Confidence Intervals for When is Unknown -
The t Distribution
Large-Sample Confidence Intervals for the
Population Proportion p
Sample Size Determination
6-8
AAconfidence
confidenceinterval
intervalor orinterval
intervalestimate
estimateisisaarange
rangeor
orinterval
intervalof
of
numbersbelieved
numbers believedtotoinclude
includeananunknown
unknownpopulation
populationparameter.
parameter.
Associatedwith
Associated withthe
theinterval
intervalisisaameasure
measureofofthe
theconfidence
confidencewe wehave
have
thatthe
that theinterval
intervaldoes
doesindeed
indeedcontain
containthe
theparameter
parameterof ofinterest.
interest.
If the population distribution is normal, the
sampling distribution of the mean is normal.
If the sample is sufficiently large, regardless of
the shape of the population distribution, the
sampling distribution is normal (Central Limit
Theorem).
f(z)
or 0.2
or
0.1
x 1.96 x 1.96 0.95
P
P x 1.96 n x 1.96 n 0.95
n n 0.0
-4 -3 -2 -1 0 1 2 3 4
z
6-11
Aftersampling,
After sampling,approximat
approximately
ely95%
95%of
of such
suchintervals
intervals
x 1.96
x 1.96
nn
willinclude
will includethe
thepopulation
populationmean
mean(and
(and5%
5%of
of them
themwill
willnot).
not).
Thatis,
That 1.96
is,xx 1.96 isisaa95%
95%confidence
confidenceinterval for..
intervalfor
nn
6-12
Approximately95%
Approximately 95%of ofthe
theintervals
intervals
Sampling
p n Distribution
u of the Mean aroundthe
x 1.96 around thesample
samplemean
meancan
canbebe
n
0.4
expectedtotoinclude
expected includethe
theactual
actualvalue
valueof
ofthe
the
0.3
95%
populationmean,
population mean,.. (When
(Whenthethesample
sample
meanfalls
mean fallswithin
withinthe
the95%
95%interval
intervalaround
around
f(x)
0.2
0.1
thepopulation
the populationmean.)
mean.)
2.5% 2.5%
x x x
**5%
0.0
196
.
n
196
.
n
x
5%of
ofsuch
suchintervals
intervalsaround
aroundthe
thesample
sample
meancan
mean canbebeexpected
expectednot
nottotoinclude
includethe
the
x
actualvalue
actual valueof
ofthe
thepopulation
populationmean.
mean.
x
(Whenthe
(When thesample
samplemean
meanfalls
fallsoutside
outsidethe
the
x
95%interval
intervalaround
aroundthe
thepopulation
population
* x
x
95%
mean.)
x mean.)
x
x
x
x
x
*
6-13
Interpretation:
a.Probabilistic: in repeated sampling, 100(1-α)% of all
intervals will include μ
b.Practical: we are 100(1-α)% confident that a interval
contains μ.
6-14
AA95%
95%confidence
confidenceinterval forwhen
intervalfor whenisisknown
knownandandsampling
samplingisis
donefrom
done fromaanormal
normalpopulation,
population,ororaalarge
largesample
sampleisisused:
used:
x 1.96
n
Thequantity
quantity 1.96
The isisoften
oftencalled
calledthe
themargin
marginof
oferror
erroror
orthe
the
n
samplingerror.
sampling error.
z
2 2
(1 )
2 2
z z
2
2
6-16
(1 )
z
2 2 (1 )
Whensampling
When samplingfrom
fromthe
thesame
samepopulation,
population,using
usingaafixed
fixedsample
samplesize,
size,the
the
higherthe
higher theconfidence
confidencelevel,
level,the
thewider
widerthe
theconfidence
confidenceinterval.
interval.
St an d ar d N or m al Dis tri b uti o n St an d ar d N or m al Di s tri b uti o n
0.4 0.4
0.3 0.3
f(z)
f(z)
0.2 0.2
0.1 0.1
0.0 0.0
-5 -4 -3 -2 -1 0 1 2 3 4 5 -5 -4 -3 -2 -1 0 1 2 3 4 5
Z Z
Whensampling
When samplingfrom
fromthe
thesame
samepopulation,
population,using
usingaafixed
fixedconfidence
confidence
level,the
level, thelarger
largerthe
thesample
samplesize,
size,n,
n,the
thenarrower
narrowerthetheconfidence
confidence
interval.
interval.
S am
S am pplin
lingg D
D is
istrib
tributio
utionn ooff th
thee M
Meean
an S am
S am pplin
lingg D
D is
istrib
trib utio
utio nn ooff th
thee M
Mee an
an
00.4
.4 00.9
.9
00.8
.8
00.3
.3 00.7
.7
00.6
.6
00.5
.5
f(x)
f(x)
00.2
.2
f(x)
f(x)
00.4
.4
00.3
.3
00.1
.1
00.2
.2
00.1
.1
00.0
.0
00.0
.0
xx xx
A
A physical
physical therapist
therapist wished
wished to
to estimate,
estimate, with
with
99% confidence,
99% confidence, the
the mean
mean maximal
maximal strength
strength
of aa particular
of particular muscle
muscle in in aa certain
certain group
group of
of
individuals. He
individuals. He assume
assume thatthat strength
strength scores
scores
are approximately
are approximately normally
normally distributed
distributed with
with
aa variance
variance of of 144.
144. AA sample
sample of of 15
15 subjects
subjects
who participated
who participated inin the
the experiment
experiment yielded
yielded aa
mean of
mean of 84.3.
84.3. What
What is is 90%
90% CI?CI?
6-20
Solution
α = 0.01⇒ Zα/2 = 2.58
Mean =84.3, n=15, σ =12
84.3 ± 2.58(12/ √15) ⇒ 84.3 ± 8.0 ⇒ (76.3, 92.3)
⇒ We are 99% confident that the population mean is
between 76.3 and 92.3.
6-21
asasthe
thenumber
numberofofdegrees
degreesofoffreedom
freedomincreases
increases
6-22
(1-)100%
AA(1- )100%confidence
confidenceinterval forwhen
intervalfor whenisisnot
notknown
known
(assumingaanormally
(assuming normallydistributed
distributedpopulation):
population):
s
x t
n
2
wheret isisthe
where thevalue
valueofofthe
thettdistribution
distributionwith
withn-1n-1degrees
degreesof
of
2
freedomthat
freedom thatcuts
cutsoff
offaatail
tailarea
areaof
of 2 totoits
itsright.
right.
6-23
}
f(t)
0 .2
9 1.383 1.833 2.262 2.821 3.250
10 1.372 1.812 2.228 2.764 3.169
11 1.363 1.796 2.201 2.718 3.106 0 .1
12 1.356 1.782 2.179 2.681 3.055
13 1.350 1.771 2.160 2.650 3.012
14 1.345 1.761 2.145 2.624 2.977 0 .0
15 1.341 1.753 2.131 2.602 2.947 -1.372 0 1.372
-2.228 2.228
16 1.337 1.746 2.120 2.583 2.921
}
t
17 1.333 1.740 2.110 2.567 2.898
18 1.330 1.734 2.101 2.552 2.878
19 1.328 1.729 2.093 2.539 2.861 Area = 0.025 Area = 0.025
20 1.325 1.725 2.086 2.528 2.845
21 1.323 1.721 2.080 2.518 2.831
22
23
1.321
1.319
1.717
1.714
2.074
2.069
2.508
2.500
2.819
2.807 Wheneverisisnot
Whenever notknown
known(and
(andthe
thepopulation
populationisis
24
25
1.318
1.316
1.711
1.708
2.064
2.060
2.492
2.485
2.797
2.787 assumednormal),
assumed normal),thethecorrect
correctdistribution
distributiontotouse
useisis
26
27
1.315
1.314
1.706
1.703
2.056
2.052
2.479
2.473
2.779
2.771 thet tdistribution
the distributionwith
withn-1
n-1degrees
degreesofoffreedom.
freedom.
28
29
1.313
1.311
1.701
1.699
2.048
2.045
2.467
2.462
2.763
2.756 Note,however,
Note, however,that
thatfor
forlarge
largedegrees
degreesofoffreedom,
freedom,
30
40
1.310
1.303
1.697
1.684
2.042
2.021
2.457
2.423
2.750
2.704 thet tdistribution
the distributionisisapproximated
approximatedwellwellbybythe
theZZ
60
120
1.296
1.289
1.671
1.658
2.000
1.980
2.390
2.358
2.660
2.617 distribution.
distribution.
1.282 1.645 1.960 2.326 2.576
6-24
Solution
• tα/2, n-1 / = t 0.025,10 = 2.2281
1 . 51 ± 2 . 2281(0 . 33/11)
1 . 51± 0 . 221
(1 . 289 ,1 . 731 )
• We are 95% sure that the μ (1 . 289 ,1 . 731 ) population mean lies
between 1.289 and 1.731
6-25
Example 6-3: An economist wants to estimate the average amount in checking accounts
at banks in a given region. A random sample of 100 accounts gives x-bar = $357.60
and s = $140.00. Give a 95% confidence interval for , the average amount in any
checking account at a bank in the given region.
s 140.00
x z0.025 357.60 196
. 357.60 27.44 33016,38504
. .
n 100
6-27
For estimating p , a sample is considered large enough when both n p an n q are greater
than 5.
6-28
AAmarketing
marketingresearch
researchfirm
firmwants
wantstotoestimate
estimatethe
theshare
sharethat
thatforeign
foreigncompanies
companies
haveininthe
have theAmerican
Americanmarket
marketfor forcertain
certainproducts.
products. AArandom
randomsample
sampleof
of100
100
consumersisisobtained,
consumers obtained,and
andititisisfound
foundthat
that34
34people
peopleininthe
thesample
sampleare
areusers
users
offoreign-made
of foreign-madeproducts;
products;the
therestrestare
areusers
usersof
ofdomestic
domesticproducts.
products. Give
Giveaa
95%confidence
95% confidenceinterval
intervalfor
forthe
theshare
shareof
offoreign
foreignproducts
productsininthis
thismarket.
market.
pq ( 0.34 )( 0.66)
p z 0.34 1.96
2
n 100
0.34 (1.96)( 0.04737 )
0.34 0.0928
0.2472 ,0.4328
Thus,the
Thus, thefirm
firmmay
maybebe95%
95%confident
confidentthat
thatforeign
foreignmanufacturers
manufacturerscontrol
control
anywherefrom
anywhere from24.72%
24.72%toto43.28%
43.28%ofofthe
themarket.
market.
6-30
LowerLevel
Lower Levelof
ofConfidence
Confidence LargerSample
Larger SampleSize
Size
Beforedetermining
Before determiningthe
thenecessary
necessarysample
samplesize,
size,three
threequestions
questionsmust
must
beanswered:
be answered:
•• How
Howclose
closedo
doyou
youwant
wantyour
yoursample
sampleestimate
estimateto
tobe
beto
tothe
the
unknownparameter?
unknown parameter? (What
(Whatisisthe
thedesired bound, B?)
desired bound, B?)
•• What
Whatdo doyou
youwant
wantthe
thedesired
desiredconfidence
confidencelevel (1-)to
level (1-) to
beso
be sothat
thatthe
thedistance
distancebetween
betweenyour
yourestimate
estimateand
andthe
the
parameteris
parameter isless
lessthan
thanor
orequal
equalto
toB?
B?
•• What
Whatis isyour
yourestimate
estimateof
ofthe
thevariance
variance(or
(orstandard
standard
deviation)of
deviation) ofthe
thepopulation
populationin
inquestion?
question?
Thesample
The samplesize
sizedetermines
determinesthe
thebound
boundofofaastatistic,
statistic,since
sincethe
thestandard
standard
errorof
error ofaastatistic
statisticshrinks
shrinksas
asthe
thesample
samplesize
sizeincreases:
increases:
Sample size = 2n
Standard error
of statistic
Sample size = n
Standard error
of statistic
6-33
6-34
AAmarketing
marketingresearch
researchfirm
firmwants
wantstotoconduct
conductaasurvey
surveytotoestimate
estimatethe
theaverage
average
amountspent
amount spentononentertainment
entertainmentby byeach
eachperson
personvisiting
visitingaapopular
popularresort.
resort. The
The
peoplewho
people whoplan
planthe
thesurvey
surveywould
wouldlike
liketotodetermine
determinethe
theaverage
averageamount
amountspent
spentby
by
allpeople
all peoplevisiting
visitingthe
theresort
resorttotowithin
within$120,
$120,with
with95%
95%confidence.
confidence. From
Frompast
past
operationof
operation ofthe
theresort,
resort,ananestimate
estimateofofthe
thepopulation
populationstandard
standarddeviation
deviationisis
ss==$400.
$400. What
Whatisisthe
theminimum
minimumrequired
requiredsample
samplesize?
size?
zz 2
2
2
2
n
n 2
2
BB 2
2
((1
1..96
96)) ((400
400))
2 2
2 2
120
120 2
2
42
42..684 43
684 43
6-35
Themanufacturers
The manufacturersof ofaasports
sportscar
carwant
wanttotoestimate
estimatethe
theproportion
proportionof
ofpeople
peopleininaa
givenincome
given incomebracket
bracketwho
whoare
areinterested
interestedininthe
themodel.
model. The
Thecompany
companywants
wantstoto
knowthe
know thepopulation
populationproportion,
proportion,p,p,totowithin
within0.01
0.01with
with99%
99%confidence.
confidence. Current
Current
companyrecords
company recordsindicate
indicatethat
thatthe
theproportion
proportionppmaymaybebearound
around0.25.
0.25. What
Whatisisthe
the
minimumrequired
minimum requiredsample
samplesize
sizefor
forthis
thissurvey?
survey?
2
z2 pq
zpq
n
n B22
2
2
B
2
2 .576
2 ( 0.25)( 0.75)
2.576 (0.25)(0.75)
2
010
010
. . 2
124.42
125
124.42 125