0% found this document useful (0 votes)

56 views43 pages

Introduction To Sample Size and Power Calculations

This document discusses sample size and power calculations. It begins by asking how likely it is to reject the null hypothesis when the alternative is true, known as power. It then shows how to calculate power for given sample sizes using normal distributions and significance levels. Several examples are provided of calculating power for different study designs and effect sizes. Formulas are derived for calculating the sample size needed to achieve a desired power for comparing means or proportions between groups. Factors that influence power and sample size requirements are identified as the size of the effect, standard deviation, sample size, and desired significance level.

Uploaded by

k kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views43 pages

Introduction To Sample Size and Power Calculations

Uploaded by

k kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 43

Introduction to sample

size and power

calculations

How much chance do we have

to reject the null hypothesis
when the alternative is in fact
true?
(whats the probability of
detecting a real effect?)

Can we quantify how

much power we have for
given sample sizes?

study 1: 263 cases, 1241 controls

Null
Distribution:
difference=0.

Rejection region.
Any value >= 6.5
(0+3.3*1.96)
For 5% significance
level, one-tail
area=2.5%

(Z/2 = 1.96)

Power= chance of being in the

Clinically relevant
rejection region if the alternative
alternative: is true=area to the right of this
difference=10%.
line (in yellow)

study 1: 263 cases, 1241 controls

Rejection region.
Any value >= 6.5
(0+3.3*1.96)

Power here:
6.5 10
P( Z >
)=
3.3
P( Z > 1.06) = 85%

Power= chance of being in the

rejection region if the alternative
is true=area to the right of this
line (in yellow)

study 1: 50 cases, 50 controls

Critical value=
0+10*1.96=20

Z/2=1.96

2.5% area
Power closer to
15% now.

Study 2: 18 treated, 72 controls, STD DEV = 2

Critical value=
0+0.52*1.96 = 1

Clinically relevant
alternative:
difference=4 points

Power is nearly
100%!

Study 2: 18 treated, 72 controls, STD DEV=10

Critical value=
0+2.58*1.96 = 5

Power is about
40%

Study 2: 18 treated, 72 controls, effect size=1.0

Critical value=
0+0.52*1.96 = 1

Power is about
50%
Clinically relevant
alternative:
difference=1 point

Factors Affecting Power

1. Size of the effect
2. Standard deviation of the
characteristic
3. Bigger sample size
4. Significance level desired

1. Bigger difference from the null mean

Null

Clinically
relevant
alternative

average weight from samples of 100

2. Bigger standard deviation

average weight from samples of 100

3. Bigger Sample Size

average weight from samples of 100

4. Higher significance level

Rejection region.

average weight from samples of 100

Sample size calculations

Based on these elements, you can

write a formal mathematical
equation that relates power,
sample size, effect size, standard
deviation, and significance level

**WE WILL DERIVE THESE

FORMULAS FORMALLY SHORTLY**

Simple formula for

difference in means

Represents the
desired power
(typically .84 for
80% power).

Sample size in
each group
(assumes equal
sized groups)

2 ( Z Z/2 )
2

difference Represents the

Standard
deviation of the
outcome variable

Effect Size
(the
difference in

desired level of
statistical
significance
(typically 1.96).

Simple formula for

difference in proportions
Represents the
desired power
(typically .84 for
80% power).

Sample size in
each group
(assumes equal
sized groups)

2( p )(1 p )( Z Z/2 )

A measure of
variability
(similar to
standard

(p1 p2 )
Effect Size
(the
difference in
proportions)

2
Represents the
desired level
of statistical
significance
(typically

Derivation of sample size

formula.

Study 2: 18 treated, 72 controls, effect size=1.0

Critical value= 0+.52*1.96=1

Power close to 50%

SAMPLE SIZE AND POWER FORMULAS

Critical value=
0+standard error (difference)*Z/2

Power= area to right of Z=

critical value - alternative difference (here 1)

standard error (diff)

e.g . here :Z

0
; power 50%
standard error (diff)

Power= area to right of Z=

critical value - alternative difference

standard error (diff)

Z/2 * standard error (diff) - difference

Z
standard error(diff)
Power is the area to the right of Z .
difference
Z Z/2
OR power is the area to the left of standard error(diff) Z .
Since normal charts give us the area
to the left by convention, we need to
difference
Z
Z/2 use
standard error(diff)
- Z to get the correct value.

Z power Z

Most textbooks just call this

Z; Ill use the term Zpower to
avoid confusion.

the area to the left of Z power the area to the right of Z

All-purpose power
formula

Z power

difference

Z / 2
standard error(difference)

Derivation of a sample
size formula

s.e.(diff )

n1 n2
2

Sample size is embedded in

the standard error.

if ratio r of group 2 to group 1 : s.e.(diff )

n1 rn1
2

Algebra
Z power

Z power

difference

2 2

n1 rn1

difference
(r 1) 2
rn1
2

( Z power Z/2 ) (

Z/2

difference
(r 1) 2
rn1

( r 1) ( Z power Z/2 ) rn1difference

rn1 difference ( r 1) ( Z power Z/2 )

( r 1) ( Z power Z/2 )
2

rdifference 2

(r 1) ( Z power Z/2 )
n1
2
r
difference
2

If r 1 (equal groups), then n1

2 2 ( Z power Z/2 ) 2
difference 2

Sample size formula for

difference in means
(r 1) ( Z power Z/2 )
n1
2
r
difference
2

where :
n 1 size of smaller group
r ratio of larger group to smaller group

standard deviation of the characteristic

diffference clinically meaningful difference in means of the outcome
Z power corresponds to power (.84 80% power)
Z / 2 corresponds to two - tailed significance level (1.96 for .05)

Examples

Example 1: You want to calculate how much

power you will have to see a difference of 3.0
IQ points between two groups: 30 male doctors
and 30 female doctors. If you expect the
standard deviation to be about 10 on an IQ
test for both groups, then the standard error
for the difference will be about:

10 2 10 2

= 2.57
30
30

Power formula
Z power

Z
Z power

Z / 2
(d *)

d*
2 2
n

Z / 2

d* n

Z / 2
2

d*
3
d* n
3
Z / 2
1.96 .79 or ZZpower

/2
(d *)
2.57
2
10

30
1.96 .79
2

P(Z -.79) =.21; only 21% power to see a difference of 3 IQ points.

Example 2: How many people

would you need to sample in each
group to achieve power of 80%
(corresponds to Z=.84)

2 2 ( Z Z / 2 ) 2
(d *) 2

100(2)(.84 1.96) 2

174
2
(3)

174/group; 348 altogether

Sample Size needed for

comparing two
proportions:
Example: I am going to run a case-control
study to determine if pancreatic cancer is
linked to drinking coffee. If I want 80%
power to detect a 10% difference in the
proportion of coffee drinkers among
cases vs. controls (if coffee drinking and
pancreatic cancer are linked, we would
expect that a higher proportion of cases
would be coffee drinkers than controls),
how many cases and controls should I
sample? About half the population drinks
coffee.

Derivation of a sample
size formula:

The standard error of the difference of two proportion

p (1 p ) p (1 p )

n1
n2

Derivation of a sample
size formula:
Here, if we assume equal sample size and
that, under the null hypothesis proportions of
coffee drinkers is .5 in both cases and
controls, then
s.e.(diff)=

.5(1 .5) .5(1 .5)

.5 / n
n
n

Z power

test statistic

Z / 2
s.e.(test statistic )

Z power =

.10
.5 / n

1.96

For 80% power

.84

.10
.5 / n

.84 1.96

1.96
.10
.5 / n

2
.
10
n
(.84 1.96) 2
.5
.5(.84 1.96) 2
n
392
2
.10

There is 80% area to

the left of a Z-score
of .84 on a standard
normal curve;
therefore, there is 80%
area to the right of
-.84.

Would take 392 cases and 392 controls to have 80% power!
Total=784

Question 2:
How many total cases and controls would I
have to sample to get 80% power for the
same study, if I sample 2 controls for
every case?

Ask yourself, what changes here?

Z power

test statistic
Z / 2
s.e.(test statistic)

p (1 p ) p (1 p )
.25 .25
.25 .5
.75
.75

2n
n
2n
n
2 n 2n
2n
2n

Different size groups

.84

.10
.75 / 2n

1.96
.10

.84 1.96

.75 / 2n

(.10 2 ) 2n
(.84 1.96)
.75
.75(.84 1.96) 2
n
294
2
( 2).10
2

Need: 294 cases and 2x294=588 controls. 882 total.

Note: you get the best power for the lowest sample size if you keep both groups equal (882 > 784).
You would only want to make groups unequal if there was an obvious difference in the cost or ease of
collecting data on one group. E.g., cases of pancreatic cancer are rare and take time to find.

General sample size

formula
s.e.(diff )

p (1 p ) p (1 p)

rn
n

p(1 p ) rp(1 p )
( r 1) p (1 p)

rn
rn
rn

2
r 1 p (1 p )( Z power Z / 2 )
n
r
( p1 p 2 ) 2

General sample size needs

when outcome is binary:
2
p
(
1

p
)(
Z

Z
)
r 1

/2
n
2
r
( p1 p2 )
where :
n size of smaller group
r ratio of larger group2 to smaller group

2 ( Z power Z / 2 ) 2

p1 p2 clinically
n meaningful difference in proportions of the outcome
2

Z corresponds to power (.84

80%
(diff
) power)

Z / 2 corresponds to two - tailed significance level (1.96 for .05)

Compare with when

outcome is continuous:
(r 1) ( Z Z/2 )
n1
2
r
difference
2

where :
n1 size of smaller group
r ratio of larger group to smaller group

standard deviation of the characteristic

diffference clinically meaningful difference in means of the outcome
Z corresponds to power (.84 80% power)
Z / 2 corresponds to two - tailed significance level (1.96 for .05)

Question

How many subjects would we need to

sample to have 80% power to detect an
average increase in MCAT biology score
of 1 point, if the average change without
instruction (just due to chance) is plus or
minus 3 points (=standard deviation of
change)?

Standard error here=

change
n

3
n

Z power
Z power

test statistic

Z / 2
s.e.(test statistic )

Z / 2
D
n

( Z power Z / 2 ) 2
2

Where
D=change from
test 1 to test 2.
(difference)

D ( Z power Z / 2 )
D2

Therefore, need:
(9)(1.96+.84)2/1=
70peopletotal
2

Sample size for paired

data:
2

d ( Z Z/2 )
difference

where :
n sample size
standard deviation of the within - pair difference
diffference clinically meaningful difference
Z corresponds to power (.84 80% power)
Z / 2 corresponds to two - tailed significance level (1.96 for .05)

Paired data difference in

proportion: sample size:
n

p (1 p )( Z Z / 2 ) 2
( p1 p2 )

where :
n sample size for 1 group

p1 p2 clinically
2meaningful
( Z powerdifference
Z / in2 )dependent proportions
2

n s to power (.84 80%

Z correspond
2 power)
(diff )

Z / 2 corresponds to two - tailed significance level (1.96 for .05)

Introduction To Sample Size and Power Calculations
No ratings yet
Introduction To Sample Size and Power Calculations
43 pages
: µ = 0 vs H: µ 6= 0. Previous work shows that σ = 2. A change in BMI of 1.5 is considered important to detect (if the true effect size is 1.5 or higher
No ratings yet
: µ = 0 vs H: µ 6= 0. Previous work shows that σ = 2. A change in BMI of 1.5 is considered important to detect (if the true effect size is 1.5 or higher
5 pages
Kang (2021)
No ratings yet
Kang (2021)
12 pages
STAT171 Statistical Data Analysis (2015)
No ratings yet
STAT171 Statistical Data Analysis (2015)
55 pages
Hypothesis Testing - Standard Error - Effect Size - Power
No ratings yet
Hypothesis Testing - Standard Error - Effect Size - Power
32 pages
Optimal Sample Size Calculation Guide
No ratings yet
Optimal Sample Size Calculation Guide
34 pages
Case-Control Study Sample Size Guide
No ratings yet
Case-Control Study Sample Size Guide
14 pages
Sample Size Calculation Guide
No ratings yet
Sample Size Calculation Guide
14 pages
5 Sample Size Determ
No ratings yet
5 Sample Size Determ
29 pages
DR Pinzon - Sample Size Klinik
No ratings yet
DR Pinzon - Sample Size Klinik
45 pages
Sample Size Guide for Researchers
No ratings yet
Sample Size Guide for Researchers
5 pages
Sample Size
No ratings yet
Sample Size
5 pages
Introduction To Hypothesis Testing, Power Analysis and Sample Size Calculations
No ratings yet
Introduction To Hypothesis Testing, Power Analysis and Sample Size Calculations
8 pages
Stat Lea Int Cal PDF
No ratings yet
Stat Lea Int Cal PDF
5 pages
A/B Test Sample Size Guide
No ratings yet
A/B Test Sample Size Guide
12 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
30 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
6 pages
Pi Is 2058534917300732
No ratings yet
Pi Is 2058534917300732
3 pages
05 Design
No ratings yet
05 Design
41 pages
Power and Sample Size Calculation
No ratings yet
Power and Sample Size Calculation
13 pages
Power Analysis
No ratings yet
Power Analysis
37 pages
Complemento Aula 8
No ratings yet
Complemento Aula 8
43 pages
Sample Size Calculation & Software
No ratings yet
Sample Size Calculation & Software
26 pages
Introduction To Statistics With GraphPad Prism Slides
No ratings yet
Introduction To Statistics With GraphPad Prism Slides
101 pages
Case-Control Study Sample Size
No ratings yet
Case-Control Study Sample Size
14 pages
Sample Size Estimation
No ratings yet
Sample Size Estimation
12 pages
Newbies Trial
No ratings yet
Newbies Trial
4 pages
Introduction To Statistics With GraphPad Prism Slides
No ratings yet
Introduction To Statistics With GraphPad Prism Slides
101 pages
Power and Sample Size Calculation
No ratings yet
Power and Sample Size Calculation
13 pages
The T-Test: Inferences About Population Means
No ratings yet
The T-Test: Inferences About Population Means
31 pages
Sample Size Determination: BY DR Zubair K.O
100% (1)
Sample Size Determination: BY DR Zubair K.O
43 pages
Sample Size Determination: BY DR Zubair K.O
100% (1)
Sample Size Determination: BY DR Zubair K.O
43 pages
14 Power and Sample Size
No ratings yet
14 Power and Sample Size
4 pages
Sample Size and Power of Study
No ratings yet
Sample Size and Power of Study
5 pages
Chapter 11
No ratings yet
Chapter 11
24 pages
Sample Size Calculation Guide
No ratings yet
Sample Size Calculation Guide
5 pages
Metlit 10 - Besar Sample - 20180824-Slides-1
No ratings yet
Metlit 10 - Besar Sample - 20180824-Slides-1
39 pages
Stats Power
No ratings yet
Stats Power
53 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
19 pages
This Icon: Go Forwards Go Backwards Exit
No ratings yet
This Icon: Go Forwards Go Backwards Exit
25 pages
PK PM Jan 2014 Power Handout
No ratings yet
PK PM Jan 2014 Power Handout
14 pages
Some Practical Guidelines For Effective Sample Size Determination
No ratings yet
Some Practical Guidelines For Effective Sample Size Determination
7 pages
Power Analysis and Sample Size - Richard
No ratings yet
Power Analysis and Sample Size - Richard
22 pages
2sample Size Determination Jan 2023
No ratings yet
2sample Size Determination Jan 2023
69 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
36 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
36 pages
Metlit 10-Besar Sampel - 20210920
No ratings yet
Metlit 10-Besar Sampel - 20210920
41 pages
Gerstman PP09
No ratings yet
Gerstman PP09
36 pages
7 Sample Size Determination
No ratings yet
7 Sample Size Determination
27 pages
Statistical Power and Effect Size 1
No ratings yet
Statistical Power and Effect Size 1
29 pages
Sample Size Gpower Module
No ratings yet
Sample Size Gpower Module
106 pages
Power and Sample Size
No ratings yet
Power and Sample Size
4 pages
Single Group When Observations Are Not Normally Distributed
No ratings yet
Single Group When Observations Are Not Normally Distributed
35 pages
Sample Size (PH.D.) Aswan - PPSX
No ratings yet
Sample Size (PH.D.) Aswan - PPSX
44 pages
Samplesize Determination
100% (1)
Samplesize Determination
42 pages
Sample Size Estimation Guide
No ratings yet
Sample Size Estimation Guide
4 pages
22 Hypothesis 2
No ratings yet
22 Hypothesis 2
36 pages
Sample Size
No ratings yet
Sample Size
45 pages
Sample Size Calculation
No ratings yet
Sample Size Calculation
3 pages

Introduction To Sample Size and Power Calculations

Uploaded by

Introduction To Sample Size and Power Calculations

Uploaded by

Introduction to sample

size and power

How much chance do we have

Can we quantify how

study 1: 263 cases, 1241 controls

Power= chance of being in the

study 1: 263 cases, 1241 controls

Power= chance of being in the

study 1: 50 cases, 50 controls

Study 2: 18 treated, 72 controls, STD DEV = 2

Study 2: 18 treated, 72 controls, STD DEV=10

Study 2: 18 treated, 72 controls, effect size=1.0

Factors Affecting Power

1. Bigger difference from the null mean

average weight from samples of 100

2. Bigger standard deviation

average weight from samples of 100

3. Bigger Sample Size

average weight from samples of 100

4. Higher significance level

average weight from samples of 100

Sample size calculations

Based on these elements, you can

**WE WILL DERIVE THESE

Simple formula for

difference Represents the

Simple formula for

Derivation of sample size

Study 2: 18 treated, 72 controls, effect size=1.0

Power close to 50%

SAMPLE SIZE AND POWER FORMULAS

Power= area to right of Z=

critical value - alternative difference (here 1)

Power= area to right of Z=

critical value - alternative difference

Z/2 * standard error (diff) - difference

Most textbooks just call this

the area to the left of Z power the area to the right of Z

Sample size is embedded in

if ratio r of group 2 to group 1 : s.e.(diff )

( r 1) ( Z power Z/2 ) rn1difference

rn1 difference ( r 1) ( Z power Z/2 )

If r 1 (equal groups), then n1

Sample size formula for

standard deviation of the characteristic

Example 1: You want to calculate how much

P(Z -.79) =.21; only 21% power to see a difference of 3 IQ points.

Example 2: How many people

174/group; 348 altogether

Sample Size needed for

The standard error of the difference of two proportion

.5(1 .5) .5(1 .5)

For 80% power

There is 80% area to

Ask yourself, what changes here?

Different size groups

Need: 294 cases and 2x294=588 controls. 882 total.

General sample size

General sample size needs

Z corresponds to power (.84

Z / 2 corresponds to two - tailed significance level (1.96 for .05)

Compare with when

standard deviation of the characteristic

How many subjects would we need to

Standard error here=

Sample size for paired

Paired data difference in

n s to power (.84 80%

Z / 2 corresponds to two - tailed significance level (1.96 for .05)

You might also like