0% found this document useful (0 votes)

35 views7 pages

Multiple Comparisons Testing

Uploaded by

estian.maritz1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views7 pages

Multiple Comparisons Testing

Uploaded by

estian.maritz1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Multiple Comparisons Testing

Introduction
In the previous section we discussed the steps to perform an ANOVA and the procedures to test the assump-
tions of an ANOVA in R. Suppose now that we reject H0 and conclude that at least one of the population
group means differs from the others. Today we consider the second step, which is finding the specific groups
that differ from the rest in terms of their population means.

Multiple comparisons testing

Before the multiple comparisons procedure is discussed, we consider a problem that arises when multiple
independent tests are performed. It will be shown that as the number of independent hypothesis tests
increases, the probability of a Type I error increases. We will consider one method to limit the risk of
rejecting a true null hypothesis. Thereafter, the procedure to perform multiple comparisons testing is
discussed.

Inflation of the type I error

Generally, the t-test is used to test the hypothesis that two populations’ means are equal. If an ANOVA
indicates that at least one group’s population mean differs from the others, we would like to be able to
determine which groups differ significantly from each other. It may seem reasonable to perform all possible
t-tests to achieve this. However, in the section on Multiple Regression we outlined a problem called alpha
spending. This is an inflation of the Type I error rate, thereby making it more likely to falsely reject the
null hypothesis. If we would like the overall error rate to remain fixed at α, then we have to make some kind
of adjustment to the significance level we use for each pairwise comparison.
Consider testing for i, j = 1, 2, ..., k; i 6= j the following hypothesis:
H0 : µi = µj
H1 : µi 6= µj .

If there are k groups, there are m = k2 tests. Suppose each of these tests is performed at a level of

(i)
significance α. Suppose we denote the null hypothesis of each test by H0 , i = 1, 2, ..., m. Furthermore,
(i)
assume that none of the groups differ, i.e., H0 is true for all i = 1, 2, ..., m. We expect that there is α
probability to conclude that at least one group differs from the rest. However,

(i) (i) (i) (i)
P Reject at least one H0 | All H0 are true = 1 − P Do not reject any H0 | All H0 are true
m
!
(i) (i)
\
=1−P Do not reject H0 | H0 is true
i=1
m
(i) (i)
Y
=1− P Do not reject H0 | H0 is true
i=1
m
= 1 − (1 − α)

1
m
Thus, testing m independent hypothesis tests leads to an overall level of significance of 1 − (1 − α) . The
figure below shows this function for varying number of tests, m.

library(ggplot2)

## Warning: package ’ggplot2’ was built under R version 4.0.5

library(ggpubr)

M = 50
overall_significance = function(m, alpha){
adjusted_alpha = 1 - (1-alpha)ˆm
return(adjusted_alpha)
}
alpha = 0.05
m = 1:M
alpha_adj = sapply(m, overall_significance, alpha = alpha)
df_plot = data.frame('m'=m, 'AdjustedAlpha' = alpha_adj)

continuous_m = seq(1,M,by=0.1)
adj_alpha_continuous = sapply(continuous_m, overall_significance, alpha = alpha)
df_plot_c = data.frame('m'=continuous_m, 'AdjustedAlpha' = adj_alpha_continuous)

ggplot(data = df_plot, aes(x = m, y = AdjustedAlpha)) +

geom_point(size = 1, alpha = 0.6) +
geom_line(data = df_plot_c, aes(x = m, y = AdjustedAlpha)) +
xlab('Number of tests') +
ylab('Adjusted level of significance') +
theme_pubr() +
scale_x_continuous(breaks = c(1, seq(5,M,by=5))) +
scale_y_continuous(breaks = seq(0, 1, by=0.1))

0.9
Adjusted level of significance

0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1

1 5 10 15 20 25 30 35 40 45 50
Number of tests

It is clear that some form of adjustment must be made to maintain the required level of significance. Once
such method is known as the Bonferroni method which is now discussed.

2
ALSO TUKEY TEST!!!
The Bonferroni adjustment

If we perform m tests at a level of significance of α? , then the actual probability of a type I error is
α = 1 − (1 − α? )m . It can be shown that (1 − α? )m ≈ 1 − mα? for small α? . Therefore, α ≈ mα? . Hence,
we see that if we want to perform each test at a level of significance of αspecif ied , we should use
αspecif ied
αadjusted = .
m

Therefore, each test will be performed using the decision rule: reject H0 if p − value < αadjusted . We see we
can also use αspecif ied and adjust our p-values, i.e., reject H0 if m × p − value < αspecif ied . This adjustment
is known as the Bonferroni adjustment.
Recall that, keeping all other parameters equal, if the level of significance is decreased, the power of a test
decreases. Therefore, these adjustments aim to keep our significance level constant, but it still decreases the
power of a test. Similarly, since the power of the test is decreased, the probability of a type II error increases.

Recap: Two sample tests

Notice if we have two groups, then the hypothesis test of an ANOVA is

H0 : µ1 = µ2
H1 : µ1 6= µ2

Furthermore, we assume that the two samples are independent and that the population variances are equal.
For such a test, the test statistic is

(x̄1 − x̄2 ) − (µ1 − µ2 )

tcalc = r ∼ t(n1 + n2 − 2).
2 1 1
sp n1 + n2

Notice that P2
(n1 − 1)s21 + (n2 − 1)s22 j=1 (nj − 1)s2j
s2p = = = M SE.
n1 + n2 − 2 n−k

Pairwise two-sample tests

Consider k groups for which we perform the pairwise tests

H0 : µi = µj
H1 : µi 6= µj ;
i, j = 1, 2, ..., k;
i 6= j.

k

Assume each test must be performed at a level of significance of α. Note there are m = 2 tests.
α
Step 1: Adjust the level of significance with αnew = m.

Step 2: Calculate the M SE with

Pk
j=1 (nj − 1)s2j
M SE = .
n−k

3
Step 3: Compute the test statistic as
(x̄i − x̄j ) − (µi − µj )
tcalc = r .
1 1
M SE ni + nj

Step 4: Compute a critical value as tαnew /2 (n − k) or a p-value with 2 × P (Tn−k >| tcalc |).

Confidence interval for the difference between two means

Note since
(x̄i − x̄j ) − (µi − µj )
tcalc = r ∼ t(n − k),
1 1
M SE ni + nj

It follows that  
 (x̄i − x̄j ) − (µi − µj ) 
−tα/2,n−k < r
P < tα/2,n−k  = 1 − α.


M SE n1i + n1j

Hence, a 100(1 − α)% confidence interval for (µi − µj ) is

s
1 1
conf1−α (µi − µj ) = (x̄i − x̄j ) ± t 2m
α
,n−k M SE + .
ni nj

If this confidence interval includes zero, we can conclude at a α level of significance that there is insufficient
evidence that µi and µj differ significantly. This is the relationship between hypothesis testing for two-sided
tests and confidence intervals.
Consider the data of the advertisements and the number of juices sold. Suppose we want to compare the
average number of juices sold for the convenience and quality groups. Let C denote the convenience group
and let Q denote the quality group. We have

nC = nQ = 20 ; x̄C = 577.55 ; x̄Q = 653 ; M SE = 8894.447

Suppose we test the following hypothesis at a 5% level of significance:

H0 : µC − µQ = 0
H1 : µC − µQ 6= 0

The test statistic is given by

(x̄C − x̄Q ) − (µC − µQ )
tcalc = r
M SE n1C + n1Q

(577.55 − 653) − 0
=q
1 1

8894.447 20 + 20
= −2.5299.

Since there are in fact three tests, we must use the Bonferroni adjusted level of significance, αadj = 0.053 =
0.0167. Therefore, the correct critical value is ±tcrit = ±2.1808 such that P (T57 > tcrit ) = 0.0167. Using the
critical value approach, we reject H0 at the 5% significance level since | tcalc |= 2.5299 > 2.1808 =| tcrit |.

4
The p-value of the test is 2 × P (T57 >| tcalc |) = 2 × P (T57 > 2.5299) = 0.0142. The p-value is still less than
αadj = 0.0167, and therefore we reject the null hypothesis at the 5% level of significance. Note that we can
also compute the Bonferroni adjusted p-value given by 3 × 0.0142 = 0.043 which can be compared to the
original level of significance, α = 0.05.
Finally, the 95% confidence interval for the difference between the population means is given by
s
1 1
conf0.95 (µC − µQ ) = (x̄C − x̄Q ) ± t m
α
,n−k M SE +
nC nQ
s
1 1
= (577.55 − 653) ± 2.1808 8894.447 +
20 20
= [−140.4917 ; − 10.4083]

Since the confidence interval does not include zero, we conclude again that there is a significant difference
between the population average number of juices sold for the convenience and quality advertisements.

Performing multiple comparisons in R

We consider the same dataset as in the previous examples: ExampleDataNarrow.txt.
Import the data and print the structure.

str(dat)

## ’data.frame’: 60 obs. of 2 variables:

## $ Population: Factor w/ 3 levels "Convenience",..: 1 1 1 1 1 1 1 1 1 1 ...
## $ Sales : int 529 658 793 514 663 719 711 606 461 529 ...

Pairwise hypothsis tests

# No adjustment to p-values
pairwise.t.test(x = dat$Sales, g = dat$Population,
p.adjust.method = 'none')

##
## Pairwise comparisons using t tests with pooled SD
##
## data: dat$Sales and dat$Population
##
## Convenience Price
## Price 0.301 -
## Quality 0.014 0.143
##
## P value adjustment method: none

# Bonferroni adjustment
pairwise.t.test(x = dat$Sales, g = dat$Population,
p.adjust.method = 'bonferroni')

5
##
## Pairwise comparisons using t tests with pooled SD
##
## data: dat$Sales and dat$Population
##
## Convenience Price
## Price 0.904 -
## Quality 0.043 0.428
##
## P value adjustment method: bonferroni

Note that one way to visualise the different groups would be a side-by-side box plot. Below is an example
of another plot where we plot the confidence intervals of each group.

# Get means of each group

convMean = mean(dat$Sales[dat$Population == 'Convenience'])
qualMean = mean(dat$Sales[dat$Population == 'Quality'])
priceMean = mean(dat$Sales[dat$Population == 'Price'])

means = c(convMean, qualMean, priceMean)

# Get MSE
convVar = var(dat$Sales[dat$Population == 'Convenience'])
qualVar = var(dat$Sales[dat$Population == 'Quality'])
priceVar = var(dat$Sales[dat$Population == 'Price'])
vars = c(convVar, qualVar, priceVar)
n_vec = rep(20, 3)
mse = sum((n_vec - 1)*vars)/(sum(n_vec) - length(n_vec))

# Get CI for each group (no adjustment)

alpha = 0.05
t_crit = qt(1 - alpha/2, df = 57)
error = t_crit*sqrt(mse*(1/20+1/20))

# Make plot
df_plot = data.frame('Population' =
as.factor(c('Convenience',
'Quality',
'Price')))

df_plot$Mean = means

df_plot$error = rep(error, length.out = nrow(df_plot))

ggplot(data = df_plot, aes(x = Population, y = Mean)) +

geom_errorbar(aes(ymin = Mean - error,
ymax = Mean + error),
width = 0.1) +
geom_point(shape = 1, size = 3, fill='white') +
xlab('Population') +
ylab('Sales') +
theme_pubr()

6
700

650
Sales

600

550

Convenience Price Quality

Population

Bon Ferroni
No ratings yet
Bon Ferroni
3 pages
Hypothesis Testing Guide
No ratings yet
Hypothesis Testing Guide
8 pages
MINITAB Tip Sheet 7: One, Two and Paired Sample T-Tests
No ratings yet
MINITAB Tip Sheet 7: One, Two and Paired Sample T-Tests
6 pages
Hypothesis Test Errors
No ratings yet
Hypothesis Test Errors
16 pages
IB Biology IA Statistical Analysis Guide
No ratings yet
IB Biology IA Statistical Analysis Guide
20 pages
CAMI16 - Data Analytics
No ratings yet
CAMI16 - Data Analytics
55 pages
2812 Week7
No ratings yet
2812 Week7
42 pages
Chapter 3 - Section 3.5
No ratings yet
Chapter 3 - Section 3.5
19 pages
CH4 (Lecture 3 and 4)
No ratings yet
CH4 (Lecture 3 and 4)
14 pages
Comparing Independent Groups, T-Tests and Anova
No ratings yet
Comparing Independent Groups, T-Tests and Anova
39 pages
生物统计方法与应用10-ANOVA and Logistic Regression
No ratings yet
生物统计方法与应用10-ANOVA and Logistic Regression
29 pages
Chapter 6 Hypothesis Test Anova
No ratings yet
Chapter 6 Hypothesis Test Anova
63 pages
10 - 1-Sample T-Test
No ratings yet
10 - 1-Sample T-Test
31 pages
Hypothesis Testing and T-tests in R
No ratings yet
Hypothesis Testing and T-tests in R
16 pages
Post-Hoc ANOVA Test
No ratings yet
Post-Hoc ANOVA Test
16 pages
R Commands New 2
No ratings yet
R Commands New 2
23 pages
Module 3 Hypothesis Testing Using R
No ratings yet
Module 3 Hypothesis Testing Using R
7 pages
Lab Kamal Sir
No ratings yet
Lab Kamal Sir
5 pages
Chapter10 - ANOVA - Student
No ratings yet
Chapter10 - ANOVA - Student
38 pages
AEA 309 - Lecture 4
No ratings yet
AEA 309 - Lecture 4
37 pages
Lab One Sample Hypothesis Test
No ratings yet
Lab One Sample Hypothesis Test
15 pages
Slidesc53 5
No ratings yet
Slidesc53 5
68 pages
Linear Regression Model - Applied - Part 3
No ratings yet
Linear Regression Model - Applied - Part 3
40 pages
Hypothesis Python
No ratings yet
Hypothesis Python
42 pages
00 Lab Notes
No ratings yet
00 Lab Notes
8 pages
Ac 07 PDF
No ratings yet
Ac 07 PDF
3 pages
Statistical Analysis Homework
No ratings yet
Statistical Analysis Homework
6 pages
Module 5
No ratings yet
Module 5
47 pages
Paired t-Test Analysis Guide
No ratings yet
Paired t-Test Analysis Guide
7 pages
Unit4 R
No ratings yet
Unit4 R
21 pages
ST332 - Lecture07 - 2 Variances & Paired T-Test
No ratings yet
ST332 - Lecture07 - 2 Variances & Paired T-Test
7 pages
Lecture 15
No ratings yet
Lecture 15
27 pages
T-Test Sample Problems
100% (10)
T-Test Sample Problems
4 pages
Business Analytics & Machine Learning: Regression Analysis
No ratings yet
Business Analytics & Machine Learning: Regression Analysis
58 pages
Week 6 Chick Point
No ratings yet
Week 6 Chick Point
14 pages
Ecn 306
No ratings yet
Ecn 306
43 pages
Lec 19 - Inferences For Two Samples
No ratings yet
Lec 19 - Inferences For Two Samples
43 pages
Topic 2. Distributions, Hypothesis Testing, and Sample Size Determination
No ratings yet
Topic 2. Distributions, Hypothesis Testing, and Sample Size Determination
15 pages
Basic Statistics in The Toolbar of Minitab's Help
No ratings yet
Basic Statistics in The Toolbar of Minitab's Help
17 pages
Regression Analysis for Economists
No ratings yet
Regression Analysis for Economists
31 pages
Discussion+on+Multiple+Regression ShimengHuang
No ratings yet
Discussion+on+Multiple+Regression ShimengHuang
35 pages
AFM 113 W22 Lecture Slides Chap 10
No ratings yet
AFM 113 W22 Lecture Slides Chap 10
66 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
60 pages
Anova Two Ways: Citra Dewi Rakhmania 02211850012005
No ratings yet
Anova Two Ways: Citra Dewi Rakhmania 02211850012005
39 pages
Hypothesis Testing by Example Hands On Approach Using R
No ratings yet
Hypothesis Testing by Example Hands On Approach Using R
39 pages
T-Test Guide for Data Analytics Course
No ratings yet
T-Test Guide for Data Analytics Course
30 pages
Classical Ols Assumption
No ratings yet
Classical Ols Assumption
6 pages
Statistical Tests Guide
No ratings yet
Statistical Tests Guide
15 pages
22-23 323 Week6Notes
No ratings yet
22-23 323 Week6Notes
28 pages
Lecture 01
No ratings yet
Lecture 01
8 pages
Hypothesis Testing for Means
No ratings yet
Hypothesis Testing for Means
61 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
40 pages
Introduction To Hypothesis Testing24
No ratings yet
Introduction To Hypothesis Testing24
54 pages
Power
No ratings yet
Power
29 pages
STAT 1150 Worksheet 6
No ratings yet
STAT 1150 Worksheet 6
9 pages
Anova Ancova Presentation To Research Sig University of Phoenix March 2021
No ratings yet
Anova Ancova Presentation To Research Sig University of Phoenix March 2021
66 pages
Ayush BRM File
No ratings yet
Ayush BRM File
35 pages
Memo
No ratings yet
Memo
1 page
Chapter 1
No ratings yet
Chapter 1
13 pages
FAF VAF 1 Memo 2023
No ratings yet
FAF VAF 1 Memo 2023
4 pages
Chapter 3
No ratings yet
Chapter 3
13 pages
Hydrogen Molecule Ion MO Theory
No ratings yet
Hydrogen Molecule Ion MO Theory
17 pages
The Hidden Symmetry of The Coulomb Problem in Relativistic Quantum Mechanics - From Pauli To Dirac (2006)
No ratings yet
The Hidden Symmetry of The Coulomb Problem in Relativistic Quantum Mechanics - From Pauli To Dirac (2006)
5 pages
The Four Fundamental Forces of Nature
100% (1)
The Four Fundamental Forces of Nature
3 pages
Sociology Exam for Students
No ratings yet
Sociology Exam for Students
3 pages
Interactionism Theory
No ratings yet
Interactionism Theory
2 pages
Analytisk Mekanik Formelsamling
No ratings yet
Analytisk Mekanik Formelsamling
6 pages
7 Understanding Hypothesis Testing
40% (5)
7 Understanding Hypothesis Testing
49 pages
EXPLORING THE DIRECT AND INDIRECT EFFECTS BETWEEN DEPENDENT AND INDEPENDENT VARIABLE USING REGRESSION ANALYSIS AND PATH ANALYSIS IN RESPECT OF SELF CONFIDENCE AND ANXIETY IN ENGLISH LANGUAGE ON ATTITUDE TOWARDS ENGLISH LANGUAGE
No ratings yet
EXPLORING THE DIRECT AND INDIRECT EFFECTS BETWEEN DEPENDENT AND INDEPENDENT VARIABLE USING REGRESSION ANALYSIS AND PATH ANALYSIS IN RESPECT OF SELF CONFIDENCE AND ANXIETY IN ENGLISH LANGUAGE ON ATTITUDE TOWARDS ENGLISH LANGUAGE
9 pages
Space The Final Illusion
No ratings yet
Space The Final Illusion
4 pages
Document
No ratings yet
Document
2 pages
Chapter 12 Inference About A Population: QMDS 202 Data Analysis and Modeling
No ratings yet
Chapter 12 Inference About A Population: QMDS 202 Data Analysis and Modeling
7 pages
Poisson Distribution Explained - Intuition, Examples, and Derivation - Towards Data Science
No ratings yet
Poisson Distribution Explained - Intuition, Examples, and Derivation - Towards Data Science
10 pages
Entangled Photons and Bell's Inequality: Abstract - Christopher Marsh
No ratings yet
Entangled Photons and Bell's Inequality: Abstract - Christopher Marsh
14 pages
Parametric Statistical Inference 30
No ratings yet
Parametric Statistical Inference 30
6 pages
Higgs Triplet Model Insights
No ratings yet
Higgs Triplet Model Insights
54 pages
The Perspectives
No ratings yet
The Perspectives
2 pages
Topic 1 - Obm210 - 2022 (5
No ratings yet
Topic 1 - Obm210 - 2022 (5
6 pages
Physics Beyond the Standard Model
No ratings yet
Physics Beyond the Standard Model
21 pages
Formula Sheet
No ratings yet
Formula Sheet
4 pages
Normal Distribution Density FUNCTION TABLE
100% (1)
Normal Distribution Density FUNCTION TABLE
2 pages
Mathematics & Physics: A Unified Vision
No ratings yet
Mathematics & Physics: A Unified Vision
6 pages
Humanistic Approach Activities
No ratings yet
Humanistic Approach Activities
3 pages
Bayesian Statistics Primer PDF
No ratings yet
Bayesian Statistics Primer PDF
23 pages
Probability Distribution Guide
No ratings yet
Probability Distribution Guide
3 pages
Data Management & Statistics Guide
No ratings yet
Data Management & Statistics Guide
13 pages
Sap101 Assesement 1
No ratings yet
Sap101 Assesement 1
7 pages
MATLAB Distribution Fitting Errors
No ratings yet
MATLAB Distribution Fitting Errors
6 pages
Basic Concepts in Relativity and Early Quantum Theory - Resnick & Halliday
100% (1)
Basic Concepts in Relativity and Early Quantum Theory - Resnick & Halliday
353 pages
General Chemistry 1: Module 4, Lesson 1: Quantum Mechanical Model of An Atom
100% (1)
General Chemistry 1: Module 4, Lesson 1: Quantum Mechanical Model of An Atom
4 pages

Multiple Comparisons Testing

Uploaded by

Multiple Comparisons Testing

Uploaded by

Multiple Comparisons Testing

Multiple comparisons testing

Inflation of the type I error

## Warning: package ’ggplot2’ was built under R version 4.0.5

ggplot(data = df_plot, aes(x = m, y = AdjustedAlpha)) +

Recap: Two sample tests

Notice if we have two groups, then the hypothesis test of an ANOVA is

(x̄1 − x̄2 ) − (µ1 − µ2 )

Pairwise two-sample tests

Consider k groups for which we perform the pairwise tests

Step 2: Calculate the M SE with

Confidence interval for the difference between two means

Hence, a 100(1 − α)% confidence interval for (µi − µj ) is

nC = nQ = 20 ; x̄C = 577.55 ; x̄Q = 653 ; M SE = 8894.447

Suppose we test the following hypothesis at a 5% level of significance:

The test statistic is given by

Performing multiple comparisons in R

## ’data.frame’: 60 obs. of 2 variables:

Pairwise hypothsis tests

# Get means of each group

means = c(convMean, qualMean, priceMean)

# Get CI for each group (no adjustment)

df_plot$error = rep(error, length.out = nrow(df_plot))

ggplot(data = df_plot, aes(x = Population, y = Mean)) +

Convenience Price Quality

You might also like