0% found this document useful (0 votes)

17 views28 pages

Chapter 11

The document describes a randomized complete block design (RCBD) for comparing 7 algorithms on 180 problem instances divided into 36 groups. The RCBD accounts for variability between instance groups as a nuisance or blocking factor. Each algorithm-instance combination is run 30 times, and results are averaged within instances and groups, yielding 36 independent observations per algorithm. The statistical model for RCBD partitions variability into algorithm effects, block effects, and residual error. Hypothesis tests focus on algorithm effects while blocking removes between-group variability from error terms.

Uploaded by

dedhi yustendi ternak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views28 pages

Chapter 11

Uploaded by

dedhi yustendi ternak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Design and Analysis of Experiments

11 - Blocking

Version 2.11

Felipe Campelo
http://orcslab.cpdee.ufmg.br/

Graduate Program in Electrical Engineering

Belo Horizonte
May 2015
“Science doesn’t always go forwards. It’s a bit
like doing a Rubik’s cube. You sometimes have
to make more of a mess with a Rubik’s cube
before you can get it to go right.”

Jocelyn Bell Burnell

1943 -
Irish astrophysicist

Image: http://www.topbritishinnovations.org/PastInnovations/DiscoveryofPulsars.aspx
Blocking
Nuisance factors

In the completely randomized design (CRD) introduced in the last

chapter, the observations are assigned - as the name implies -in a
completely random manner.

Implicitly, we assume that the experimental conditions are

homogeneous, i.e., that no other important effects are present besides
those of the experimental factor.

In some situations, however, the experiment may present other factors

that are not of immediate interest, but that can influence the response
variable. This was already discussed when we introduced the concept
of pairing in Chapter 7.
Blocking
Nuisance factors

The generalization of the paired design for an arbitrary number of

factor levels is called blocking, which is an elegant way configure the
data collection in order to enable the modeling and exclusion of the
effects of nuisance factors.

In fact, it can be argued that the systematic use of blocking is

beneficial even in the absence of known nuisance factors, as it
helps boosting statistical power by
excluding the between-replicates “Block what you can,
randomize what you cannot.”
variability from the residual term. George E.P. Box (1919–2013)
British Statistician

Image: https://www.engr.wisc.edu/eday/eday1990.html
Blocking
Blocking versus Randomizing

The idea behind randomization is trying to prevent unknown factors to

bias our observations;

Blocking comes into play whenever we know from the beginning that
certain factors can influence our response variable, but for some
reason we are not interested in their effectsa , for instance:

the effect of different batches of raw material when comparing the

performance of chemical reactions;

the effect of different benchmark problems when comparing the

performance of computer algorithms.

a
If we’re interested in their effects, they are no longer nuisance factors, but additional experimental factors. We’ll deal with
factorial designs in the next chapter.
Example: algorithm comparison
Problem definition

A Ph.D. student decides to compare a standard optimization algorithm

with six modified versions for the solution of a certain family of Vehicle
Routing Problems (VRPs). His intention is to verify whether any of the
modified versions is systematically better than the standard one.

The algorithms are applied for the solution of 180 problem instances,
divided in 36 groups of five homogeneous instances. The algorithms
are run 30 times on each instance, and the cost value found after each
run is recorded.

The student wants an experiment with α = 0.05 and

power π = 1 − β = 0.8 for a minimally interesting
effect δ ∗ = 50.

Image: PhD Comics by Jorge Cham:

http://www.phdcomics.com/comics/archive.php?comicid=1139
Example: algorithm comparison
Nuisance factor

Besides the effect of the experimental factor (Algorithm), we have a

possible nuisance effect due to the variability between experimental
units (instance groups).

If we employ a CRD, the residual term would contain all the

between-instances variability, which would almost certainly mask any
between-algorithms effect of interest.

Since we can control the allocation of algorithm runs within each

instance, one possibility is to do it in a systematic way so that we can
model the instance effects (in addition to the factor effect).
Example: algorithm comparison
Randomized complete block design

Within each block, the order of execution should in principle be

randomized, so as to prevent other unknown factors from affecting our
analysisb .

This experimental setup is known as a randomized complete block

design (RCBD) with one blocking factor.

b
For this kind of computer experiment the within-block randomization is unnecessary. It is very important, however, when
dealing with systems that are less controllable.
Randomized complete block design
Assumptions

The RCBD assumes:

one replicate per block c ;

independent blocks;

independent within-block randomization.

Failure to account for dependency structures (i.e., using blocks that are
not independent) can result in pseudoreplication, i.e., the use of the
incorrect number of degrees-of-freedom in the test statistics, which
usually results in an inflation of the type-I error rate.

c
For multiple within-block replicates the appropriate design is known as Generalized randomized block design (GRBD),
which can be analyzed using a different model.
Example: algorithm comparison
RCBD

To comply with these requirements, we consider as our response

variable the average performance of an algorithm on a particular
instance group.

The performance of each algorithm will be averaged within each

individual instance (mean of 30 runs), and then within each group
(mean of 5 instances), so that each algorithm has 36 independent
observations.
Example: algorithm comparison
Data loading and preprocessing

# Load data
> data <- read.table("../data files/algo.csv", header = TRUE)
# Aggregate data (algorithm means by instance group)
> aggdata <- with(data, aggregate(x = Result,
by = list(Algorithm, Group),
FUN = mean))
# Rename columns and coerce factor variables
> names(aggdata) <- c("Algorithm", "Instance_Group", "Y")
> for (i in 1:2) aggdata[, i] <- as.factor(aggdata[, i])

> levels(aggdata$Algorithm) <- c("Original",

unlist(lapply(X = "Mod",
FUN = paste0, 1:6)))
> summary(aggdata)
Algorithm Instance_Group Y
Original:36 1 : 7 Min. : 227.3
Mod1 :36 2 : 7 1st Qu.: 440.0
Mod2 :36 3 : 7 Median : 756.8
Mod3 :36 4 : 7 Mean : 821.0
Mod4 :36 5 : 7 3rd Qu.:1020.5
Mod5 :36 6 : 7 Max. :2743.6
Mod6 :36 (Other):210$
Example: algorithm comparison
Exploratory data analysis

library(ggplot2)
> p <- ggplot(aggdata, aes(x = Instance_Group, y = Y,
group = Algorithm, colour = Algorithm))

> p + geom_line(linetype=2) + geom_point(size=5)

Analysis of the RCBD
Statistical model

In the general case, we have a levels of the experimental factor, and b

levels for the blocking factor. The statistical model is given as:

(
i = 1, . . . , a
yij = µ + τi + βj + ij , assuming ij ∼ N 0, σ 2
j = 1, . . . , b

Similarly to the CRD we have, by construction:

a
X b
X
τi = 0 βj = 0
i=1 j=1
Analysis of the RCBD
Statistical model

In the RCBD, we are interested only on the effect of the experimental

factor. Consequently, the test hypotheses refer only to its coefficients:
(
H0 : τi = 0, ∀i = 1, · · · , a
H1 : ∃ τi 6= 0
The partition of the total sample variability is given by:
a X
b
X 2
SST = yij − ȳ··
i=1 j=1
a b a X
b
X X 2 X 2
=b (ȳi· − ȳ·· )2 + a ȳ·j − ȳ·· + yij − ȳi· − ȳ·j + ȳ··
i=1 j=1 i=1 j=1
| {z } | {z } | {z }
SSlevels SSblocks SSE
Analysis of the RCBD
Statistical model

The mean squares are given by:

SSlevels SSblocks SSE

MSlevels = MSblocks = MSE =
a−1 b−1 (a − 1)(b − 1)

and their expected values are:

a b
βj2
P
τi2 a
P
b
i=1 j=1
E [MSlevels ] = σ 2 + E [MSblocks ] = σ2 +
a−1 b−1

E [MSE ] = σ 2
Analysis of the RCBD
Statistical model

a
τi2
P
b
i=1
E [MSlevels ] = σ 2 + E [MSE ] = σ 2
a−1
Under the null hypotheses we have that the test statistic

MSlevels
F0 =
MSE

is a distributed according to an F distribution with (a − 1) numerator

degrees-of-freedom and (a − 1)(b − 1) denominator DoFs.

The critical region for the test is therefore given by

f0 > Fα;(a−1),(a−1)(b−1)
Analysis of the RCBD
Statistical model

# First model
> model <- aov(Y~Algorithm+Instance_Group, data=aggdata)

> summary(model)
Df Sum Sq Mean Sq F value Pr(>F)
Algorithm 6 359949 59991 41.75 <2e-16 ***
Instance_Group 35 60438639 1726818 1201.86 <2e-16 ***
Residuals 210 301725 1437
---
Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

> summary.lm(model)$r.squared
[1] 0.9950618
Analysis of the RCBD
Statistical model - residuals

> par(mfrow = c(2, 2))

> plot(model, pch = 20, las = 1)
Analysis of the RCBD
Statistical model - data transformation

# Trying with log-transformed response variable

> model2 <- aov(log(Y)~Algorithm+Instance_Group, data=aggdata)

> summary(model2)
Df Sum Sq Mean Sq F value Pr(>F)
Algorithm 6 0.60 0.1002 120.4 <2e-16 ***
Instance_Group 35 89.92 2.5690 3089.0 <2e-16 ***
Residuals 210 0.17 0.0008
---

> summary.lm(model2)$r.squared
[1] 0.9980742
Analysis of the RCBD
Statistical model - residuals

> par(mfrow = c(2, 2))

> plot(model2, pch = 20, las = 1)
Analysis of the RCBD
Multiple blocking factors

If there is more than one factor that need to be blocked out of the
analysis, it is possible to simply add new terms to the statistical model.
The sum of squares for this new term is similar to the ones already
shown for the single block case;
Analysis of the RCBD
Blocking efficiency

After an experiment is performed and the ANOVA model is fit to the

data, it is possible to calculate the relative blocking efficiency (E),
which quantifies how much larger a CRD would have to be in order to
attain the same power as the corresponding RCBD.

The relative efficiency can be calculated from the mean squares

returned in the ANOVA table of the RCBD as:

(b − 1)MSBlocks + b(a − 1)MSE

E=
(ab − 1)MSE

A value of 1.3, for instance, would indicate that the CRD would have
required 30% more observations to achieve the same power. For the
running example, this value can be calculated from the summary table:
E = 431.6
Analysis of the RCBD
Sample size calculations

The determination of the required sample size for an RCBD follows the
same procedures as the CRD: the number of replicates calculated
using the CRD formulas represent the number of blocks required
(since we have one replicate per block).

The within-group variability term necessary for calculating the required

sample size corresponds to the estimated residual variance, i.e., the
unexplained variability after accounting for the experimental and
blocking factors, which can be somewhat challenging to estimate
without a information on previous studies (or a pilot experiment).
Analysis of the RCBD
Multiple comparisons

For the post-hoc multiple comparison procedures, we also follow the

same general guidelines used for the CRD, with some minor
differences.

Paired t-tests are used for the pairwise comparisons, with

dfE = (a − 1)(b − 1) as the number of degrees of freedom used for the
reference distributions;

As with the CRD, it may be convenient to use the post-hoc pairwise

comparison procedures as the basis for sample size calculations,
particularly in cases where the cost of observations is relatively low.
Analysis of the RCBD
Multiple comparisons

> library(multcomp)
> duntest <- glht(model2, linfct = mcp(Algorithm = "Dunnett"))
> duntestCI <- confint(duntest)
> par(mar = c(5, 8, 4, 2), las = 1)
> plot(duntestCI, xlab = "Mean difference (log scale)")
Randomized complete block designs
Final considerations

The RCBD is a generalization of the pairing concept used with t-tests,

and provides an elegant way to remove nuisance effects from the
analysis;

The consequence of using a blocking design is an increased sensitivity

of the test, which means smaller sample sizes.

In practice, running an experiment as a RCBD results in little

experimental overhead, and should be preferred over the CRD
whenever possible.

The design and analysis of incomplete block designs (e.g., with

missing observations) are also relatively simplee .

e
See Montgomery (2005) for details.
Bibliography

Required reading
1 D.C. Montgomery, Design and Analysis of Experiments, Ch. 4-5, 5th ed., Wiley, 2005;
2 M. Christman, R. Littell, STA166 - Statistical Methods Research I: RCBD ANOVA Notes II -
http://goo.gl/KrpbKL
Recommended reading
1 R. Dawkins, Unweaving the Rainbow: Science, Delusion and the Appetite for Wonder,
Mariner Books, 2000.
2 J. Morgan and C. Albon, Partially Derivative (podcast) -
http://www.partiallyderivative.com
About this material
Conditions of use and referencing

This work is licensed under the Creative Commons CC BY-NC-SA 4.0 license
(Attribution Non-Commercial Share Alike International License version 4.0).
http://creativecommons.org/licenses/by-nc-sa/4.0/

Please reference this work as:

Felipe Campelo (2015), Lecture Notes on Design and Analysis of Experiments.

Online: https://github.com/fcampelo/Design-and-Analysis-of-Experiments
Version 2.11, Chapter 11; Creative Commons BY-NC-SA 4.0.
@Misc{Campelo2015-01,
title={Lecture Notes on Design and Analysis of Experiments},
author={Felipe Campelo},
howPublished={\url{https://github.com/fcampelo/Design-and-Analysis-of-Experiments}},
year={2015},
note={Version 2.11, Chapter 11; Creative Commons BY-NC-SA 4.0.},
}

DAE12
No ratings yet
DAE12
43 pages
Experimental Design for Researchers
No ratings yet
Experimental Design for Researchers
4 pages
Experiment Blocking
No ratings yet
Experiment Blocking
17 pages
In5058 W05S1
No ratings yet
In5058 W05S1
9 pages
Lentner Et Al 1989 The Efficiency of Blocking How To Use MS (Blocks) MS (Error) Correctly
No ratings yet
Lentner Et Al 1989 The Efficiency of Blocking How To Use MS (Blocks) MS (Error) Correctly
4 pages
Module04 RandomizedCompleteBlockDesign
No ratings yet
Module04 RandomizedCompleteBlockDesign
28 pages
RCBDD
No ratings yet
RCBDD
15 pages
Regression & Blocking in Experiments
No ratings yet
Regression & Blocking in Experiments
62 pages
Section 4.1 To Sec. 4.2.1
No ratings yet
Section 4.1 To Sec. 4.2.1
16 pages
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
No ratings yet
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
34 pages
Chapter 6. Two Way Classification Designs Complete
No ratings yet
Chapter 6. Two Way Classification Designs Complete
70 pages
Blocking Factors
100% (1)
Blocking Factors
56 pages
Chapter 4 Design of Experiment
No ratings yet
Chapter 4 Design of Experiment
40 pages
Lec24 PDF
No ratings yet
Lec24 PDF
32 pages
RCBD
No ratings yet
RCBD
32 pages
Design and Analysis of Experiments
No ratings yet
Design and Analysis of Experiments
30 pages
CH 5 Single Factor Experiments - Blocking
No ratings yet
CH 5 Single Factor Experiments - Blocking
19 pages
Lecture Material Slide 25
No ratings yet
Lecture Material Slide 25
31 pages
Alg Design
No ratings yet
Alg Design
25 pages
Chapter No. 04 Randomized Blocks, Latin Square, and Related Designs - 02 (Presentation)
No ratings yet
Chapter No. 04 Randomized Blocks, Latin Square, and Related Designs - 02 (Presentation)
38 pages
Design and Analysis of Experiments: Blocking
No ratings yet
Design and Analysis of Experiments: Blocking
25 pages
CH 04
100% (1)
CH 04
30 pages
DoE Disgn and Experiment 4+
No ratings yet
DoE Disgn and Experiment 4+
153 pages
IE533 - Chapter 04
No ratings yet
IE533 - Chapter 04
32 pages
Blocking Designs: Randomized Complete Block Design
No ratings yet
Blocking Designs: Randomized Complete Block Design
8 pages
21.0 Two-Factor Designs: Answer Questions RCBD Concrete Example Two-Way ANOVA Popcorn Example
No ratings yet
21.0 Two-Factor Designs: Answer Questions RCBD Concrete Example Two-Way ANOVA Popcorn Example
29 pages
Randomized Block Design Guide
No ratings yet
Randomized Block Design Guide
21 pages
Topic 6. Two-Way Designs: Randomized Complete Block Design
No ratings yet
Topic 6. Two-Way Designs: Randomized Complete Block Design
21 pages
Design With Two or More Factors Block
No ratings yet
Design With Two or More Factors Block
3 pages
Randomized Block Design Guide
No ratings yet
Randomized Block Design Guide
15 pages
Experimental Design Basics
No ratings yet
Experimental Design Basics
40 pages
Randomized Complete Block Design Guide
No ratings yet
Randomized Complete Block Design Guide
20 pages
Randomized Block Design: by Group - 9
No ratings yet
Randomized Block Design: by Group - 9
5 pages
Design of Engineering Experiments Part 3 - The Blocking Principle The Blocking Principle
No ratings yet
Design of Engineering Experiments Part 3 - The Blocking Principle The Blocking Principle
5 pages
Doe
No ratings yet
Doe
90 pages
Latin Square Design Analysis
No ratings yet
Latin Square Design Analysis
16 pages
CRD Is Best Suited For Experiments With A Small Number of Treatments
No ratings yet
CRD Is Best Suited For Experiments With A Small Number of Treatments
14 pages
Randomized Block Design Lecture
No ratings yet
Randomized Block Design Lecture
5 pages
G-2 Standaed Design. (Final)
No ratings yet
G-2 Standaed Design. (Final)
25 pages
Types of Research Design Under Random Block Design
No ratings yet
Types of Research Design Under Random Block Design
3 pages
Statistical Design and Analysis Methods
No ratings yet
Statistical Design and Analysis Methods
4 pages
Experimental Design Methods Guide
No ratings yet
Experimental Design Methods Guide
20 pages
Index Terms Links
No ratings yet
Index Terms Links
17 pages
Randomized Block Design Guide
No ratings yet
Randomized Block Design Guide
18 pages
Randomized Block Design
No ratings yet
Randomized Block Design
7 pages
AEE 302 Note - Unit 7
No ratings yet
AEE 302 Note - Unit 7
3 pages
Random Sets Approach and Its Applications
No ratings yet
Random Sets Approach and Its Applications
12 pages
Design of Experiments
No ratings yet
Design of Experiments
12 pages
Desingn of Experiments ch07
No ratings yet
Desingn of Experiments ch07
5 pages
DAE13
No ratings yet
DAE13
44 pages
Social Science Data Analysis Guide
No ratings yet
Social Science Data Analysis Guide
12 pages
Section 2
No ratings yet
Section 2
41 pages
Module 7 AgStat
No ratings yet
Module 7 AgStat
22 pages
Summary DOE
No ratings yet
Summary DOE
82 pages
ReaKontrol v110 Manual
No ratings yet
ReaKontrol v110 Manual
8 pages
A New PWM Controller With One Cycle Response
No ratings yet
A New PWM Controller With One Cycle Response
7 pages
Network & System Admin Basics
100% (1)
Network & System Admin Basics
33 pages
B311-221 10.0.1.1 (H187SP60C983) Firmware Release Notes
100% (1)
B311-221 10.0.1.1 (H187SP60C983) Firmware Release Notes
10 pages
IBM Dumps
No ratings yet
IBM Dumps
31 pages
Zambian Grid Code
100% (1)
Zambian Grid Code
174 pages
Cat I - Set 2 - E-Commerce Security
No ratings yet
Cat I - Set 2 - E-Commerce Security
6 pages
Black Box Testing
No ratings yet
Black Box Testing
3 pages
CompTIA SY0-401 Exam Prep Guide
100% (1)
CompTIA SY0-401 Exam Prep Guide
6 pages
Learning Guide: Tour Service Level III
No ratings yet
Learning Guide: Tour Service Level III
35 pages
AI Boost for YouTube Long-Form Videos
No ratings yet
AI Boost for YouTube Long-Form Videos
26 pages
Main Projrct
No ratings yet
Main Projrct
61 pages
NCERT Solutions For Class 12 Maths Chapter 3 Matrices Exercise 3.3
No ratings yet
NCERT Solutions For Class 12 Maths Chapter 3 Matrices Exercise 3.3
13 pages
SAP Info Steward 4.3 Upgrade Guide
No ratings yet
SAP Info Steward 4.3 Upgrade Guide
28 pages
IPv6 Addressing Simplified
No ratings yet
IPv6 Addressing Simplified
6 pages
Footstep Power Generating System
No ratings yet
Footstep Power Generating System
6 pages
ID Reader Installation Manualote
No ratings yet
ID Reader Installation Manualote
37 pages
DSB 1610 4X0
No ratings yet
DSB 1610 4X0
2 pages
TOPIC 5 - Introduction To BIM - PART 3
No ratings yet
TOPIC 5 - Introduction To BIM - PART 3
76 pages
Srinivasan Padmanabhan Resume
No ratings yet
Srinivasan Padmanabhan Resume
6 pages
Mini Project Report 2024-25-1
No ratings yet
Mini Project Report 2024-25-1
29 pages
Requirement For Variant Generation in
No ratings yet
Requirement For Variant Generation in
15 pages
g11 Etech Month of October 2
No ratings yet
g11 Etech Month of October 2
41 pages
ABB Robot Control S4C or S4P in Conjunction With WAGO Profibus Components 750-303 and 750-301
No ratings yet
ABB Robot Control S4C or S4P in Conjunction With WAGO Profibus Components 750-303 and 750-301
12 pages
MicroCont LabManual Updated
No ratings yet
MicroCont LabManual Updated
53 pages
Practical 1
No ratings yet
Practical 1
96 pages
MioPocket Readme
100% (1)
MioPocket Readme
30 pages
Ixia - MITIGATING RISK IN NFV
No ratings yet
Ixia - MITIGATING RISK IN NFV
7 pages
Org Baldurs Gate II Shadow of Amn Quick Reference Card
No ratings yet
Org Baldurs Gate II Shadow of Amn Quick Reference Card
6 pages
Kartiki Black Book (Sem-VI)
No ratings yet
Kartiki Black Book (Sem-VI)
67 pages

Chapter 11

Uploaded by

Chapter 11

Uploaded by

Design and Analysis of Experiments

Graduate Program in Electrical Engineering

Jocelyn Bell Burnell

In the completely randomized design (CRD) introduced in the last

Implicitly, we assume that the experimental conditions are

In some situations, however, the experiment may present other factors

The generalization of the paired design for an arbitrary number of

In fact, it can be argued that the systematic use of blocking is

The idea behind randomization is trying to prevent unknown factors to

the effect of different batches of raw material when comparing the

the effect of different benchmark problems when comparing the

A Ph.D. student decides to compare a standard optimization algorithm

The student wants an experiment with α = 0.05 and

Image: PhD Comics by Jorge Cham:

Besides the effect of the experimental factor (Algorithm), we have a

If we employ a CRD, the residual term would contain all the

Since we can control the allocation of algorithm runs within each

Within each block, the order of execution should in principle be

This experimental setup is known as a randomized complete block

The RCBD assumes:

independent within-block randomization.

To comply with these requirements, we consider as our response

The performance of each algorithm will be averaged within each

> levels(aggdata$Algorithm) <- c("Original",

> p + geom_line(linetype=2) + geom_point(size=5)

In the general case, we have a levels of the experimental factor, and b

Similarly to the CRD we have, by construction:

In the RCBD, we are interested only on the effect of the experimental

The mean squares are given by:

SSlevels SSblocks SSE

and their expected values are:

is a distributed according to an F distribution with (a − 1) numerator

The critical region for the test is therefore given by

> par(mfrow = c(2, 2))

# Trying with log-transformed response variable

> par(mfrow = c(2, 2))

After an experiment is performed and the ANOVA model is fit to the

The relative efficiency can be calculated from the mean squares

(b − 1)MSBlocks + b(a − 1)MSE

The within-group variability term necessary for calculating the required

For the post-hoc multiple comparison procedures, we also follow the

Paired t-tests are used for the pairwise comparisons, with

As with the CRD, it may be convenient to use the post-hoc pairwise

The RCBD is a generalization of the pairing concept used with t-tests,

The consequence of using a blocking design is an increased sensitivity

In practice, running an experiment as a RCBD results in little

The design and analysis of incomplete block designs (e.g., with

Please reference this work as:

Felipe Campelo (2015), Lecture Notes on Design and Analysis of Experiments.

You might also like