Designs to reduce the error

# Designs to reduce the error

**Session 7**

.light[MATH 80667A: Experimental Design and Statistical Methods <br>for Quantitative Research in Management <br>
HEC Montréal
]

]
---
class: title title-1

# Outline

.box-3.large.sp-after-half[Blocking]

.box-7.large.sp-after-half[Analysis of covariance]

---

layout: false
name: blocking
class: center middle section-title section-title-3 animated fadeIn

# Blocking

---

---

# Terminology for *nuisance*

.pull-left-3.center[

.box-inv-3.sp-after-half[
**Block**
]

Source of variation, but of no interest
<br> known and controllable

.box-inv-3.sp-after-half[Example]
timing <br> lab technician <br>  machine

]

.pull-middle-3.center[

.box-inv-3.sp-after-half[
**Covariates**
]

Explanatory measured **before** the experiment

Cannot be acted upon

.box-inv-3.sp-after-half[Example]
socioeconomic variables <br> environmental conditions
]

.pull-right-3.center[

.box-inv-3.sp-after-half[
**Noise factor**
]

Under which setting is response least affected?

.box-inv-3.sp-after-half[Example]
temperature <br> processing
]

???

Example 15.2.3 in Dean & Voss of Lipton soup mixture

A: number of mixer ports through which vegetable oil was added (two levels, 1 and 3);
B: temperature of mixer jacket (two levels; ambient temperature, presence of cooling water);
C: mixing time (two levels; 60 and 80 sec);
D: batch weight (two levels; 1500 and 2000 lb);
E: delay between mixing and packaging (two levels; 1 day and 7 days).

---

# Why blocking? 
.pull-left.sp-after-half[
.box-inv-3.sp-after-half[ 
Design experiment to reduce the effect of uncontrolled variations 
]
]
.pull-right.sp-after-half[
.box-inv-3.sp-after-half[
In general, increases the power of the `$F$`
test for treatment effects.
]
]

.pull-left.sp-after-half[
.box-inv-3.sp-after-half[
Group units in sets as alike as possible.
]
]

.pull-right.sp-after-half[
.box-inv-3.sp-after-half[
(Often) compare only treatments, so interactions are not included.
]
]

---

# Assignment to treatment

.box-inv-3.medium.sp-after-half[
Divide subjects within each block
]

.box-inv-3.medium.sp-after-half[
Randomly allocate to treatment within block
]

.box-3.sp-after-half[
(stratified sampling)
]

---

# Block-treatment design

Without interaction,

`$$\underset{\text{response}\vphantom{b}}{Y_{ij}} =  \underset{\text{global mean}}{\mu\vphantom{\beta_j}} + \underset{\text{treatment}\vphantom{b}}{\alpha_i\vphantom{\beta_j}} + \underset{\text{blocking}}{\beta_j}+ \underset{\text{error}\vphantom{b}}{\varepsilon_{ij}\vphantom{\beta_j}}$$`

Compromise between

- reduced variability for residuals,
- loss of degrees of freedom due to estimation of `$\beta$`'s.

---

# Example: Resting metabolic rate
From Dean, Voss and Draguljić (2017), Example 10.4.1 (p. 311)

> experiment that was run to compare the effects of inpatient and outpatient protocols on the in-laboratory measurement of resting metabolic rate (RMR) in humans. A previous study had indicated measurements of RMR on elderly individuals to be 8% higher using an outpatient protocol than with an inpatient protocol. If the measurements depend on the protocol, then comparison of the results of studies conducted by different laboratories using different protocols would be difficult. The experimenters hoped to conclude that the effect on RMR of different protocols was negligible.

???

The experimental treatments consisted of three protocols: (1) an inpatient protocol in which meals
were controlled—the patient was fed the evening meal and spent the night in the laboratory, then RMR
was measured in the morning; (2) an outpatient protocol in which meals were controlled—the patient
was fed the same evening meal at the laboratory but spent the night at home, then RMR was measured
in the morning; and (3) an outpatient protocol in which meals were not strictly controlled—the patient
was instructed to fast for 12 hours prior to measurement of RMR in the morning.

---
# Fitting the complete block design

```r
url <- "https://edsm.rbind.io/files/data/resting_metabolic_rate.txt"
# transform integers to factors (categorical)
resting <- read.table(url, header = TRUE) |>
  dplyr::mutate(
    subject = factor(subject), #blocking factor
    protocol = factor(protocol), #experimental factor
    rate = rate/1000)
```

]

```r
# Force sum-to-zero parametrization for unordered factors
options(contrasts = c("contr.sum", "contr.poly"))
# Fit model with blocking
model_block <- aov(rate ~ subject + protocol, data = resting)
# One-way ANOVA (no blocking)
model_raw <- aov(rate ~ protocol, data = resting)
# anova(model_block)
# anova(model_raw)
```
]

```r
ggplot(data = resting,
       aes(x = subject,
           y = rate,
           group = protocol,
           color = protocol)) +
     geom_line() + 
     labs(y = "mean resting\n metabolic rate") + 
     theme_classic() +
     theme(legend.position = "bottom")
```
]
]
---

# Interaction plot

---

# Impact of blocking
.panelset[
.panel[.panel-name[ANOVA table (with blocking)]
<table class="table" style="margin-left: auto; margin-right: auto;">
<caption>Analysis of variance table - with blocking</caption>
 <thead>
  <tr>
   <th style="text-align:left;">   </th>
   <th style="text-align:right;"> Degrees of freedom </th>
   <th style="text-align:right;"> Sum of squares </th>
   <th style="text-align:right;"> Mean square </th>
   <th style="text-align:right;"> F statistic </th>
   <th style="text-align:right;"> p-value </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> subject </td>
   <td style="text-align:right;"> 8 </td>
   <td style="text-align:right;"> 23.12 </td>
   <td style="text-align:right;"> 2.89 </td>
   <td style="text-align:right;"> 37.42 </td>
   <td style="text-align:right;"> 0.000 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> protocol </td>
   <td style="text-align:right;"> 2 </td>
   <td style="text-align:right;"> 0.04 </td>
   <td style="text-align:right;"> 0.02 </td>
   <td style="text-align:right;"> 0.23 </td>
   <td style="text-align:right;"> 0.795 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> Residuals </td>
   <td style="text-align:right;"> 16 </td>
   <td style="text-align:right;"> 1.24 </td>
   <td style="text-align:right;"> 0.08 </td>
   <td style="text-align:right;">  </td>
   <td style="text-align:right;">  </td>
  </tr>
</tbody>
</table>
]

.panel[.panel-name[ANOVA table (without blocking)]
<table class="table" style="margin-left: auto; margin-right: auto;">
<caption>Analysis of variance table - without blocking</caption>
 <thead>
  <tr>
   <th style="text-align:left;">   </th>
   <th style="text-align:right;"> Degrees of freedom </th>
   <th style="text-align:right;"> Sum of squares </th>
   <th style="text-align:right;"> Mean square </th>
   <th style="text-align:right;"> F statistic </th>
   <th style="text-align:right;"> p-value </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> protocol </td>
   <td style="text-align:right;"> 2 </td>
   <td style="text-align:right;"> 0.04 </td>
   <td style="text-align:right;"> 0.02 </td>
   <td style="text-align:right;"> 0.02 </td>
   <td style="text-align:right;"> 0.982 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> Residuals </td>
   <td style="text-align:right;"> 24 </td>
   <td style="text-align:right;"> 24.35 </td>
   <td style="text-align:right;"> 1.01 </td>
   <td style="text-align:right;">  </td>
   <td style="text-align:right;">  </td>
  </tr>
</tbody>
</table>
]

]

---
layout: false
name: ancova
class: center middle section-title section-title-7

# Analysis of covariance

---

# IJLR: It's Just a Linear Regression...

All ANOVA models covered so far are linear regression model.

The latter says that

`$$\underset{\text{average response}\vphantom{l}}{\textsf{E}(Y_{i}\vphantom{\beta_p})}  = \underset{\text{linear (i.e., additive) combination of explanatories}}{\beta_0 + \beta_1 \mathrm{X}_{1i} + \cdots + \beta_p \mathrm{X}_{pi}}$$`

In an ANOVA, the model matrix `$\mathbf{X}$` simply includes columns with `$-1$`, `$0$` and `$1$` for group indicators that enforce sum-to-zero constraints.

---

# What's in a model?

In experimental designs, the explanatories are

- experimental factors (categorical)
- continuous (dose-response)

.box-inv-7.medium[
Random assignment implies<br> no systematic difference between groups.
]

---

# ANCOVA = Analysis of covariance

- Analysis of variance with added continuous covariate(s) to reduce experimental error (similar to blocking).
- These continuous covariates are typically concomitant variables (measured alongside response).
- Including them in the mean response (as slopes) can help reduce the experimental error (residual error).

---

# Control to gain power!

.box-inv-7.sp-after-half[
Identify external sources of variations
]

- enhance balance of design (randomization)
- reduce mean squared error of residuals to increase power

These steps should in principle increase power **if** the variables used as control are correlated with the response.

---

# Example

Abstract of [van Stekelenburg et al. (2021)](https://doi.org/10.1177/09567976211007788)

> In three experiments with more than 1,500 U.S. adults who held false beliefs, participants first learned the value of scientific consensus and how to identify it. Subsequently, they read a news article with information about a scientific consensus opposing their beliefs. We found strong evidence that in the domain of genetically engineered food, this two-step communication strategy was more successful in correcting misperceptions than merely communicating scientific consensus.

???

Aart van Stekelenburg, Gabi Schaap, Harm Veling and Moniek Buijzen (2021), Boosting Understanding and Identification of Scientific Consensus Can Help to Correct False Beliefs, Psychological Science
https://doi.org/10.1177/09567976211007788
---

# Experiment 2: Genetically Engineered Food

We focus on a single experiment; preregistered exclusion criteria led to `$n=442$` total sample size (unbalanced design).

Three experimental conditions:

.float-left.center[.box-7[`Boost`] .box-7[`Boost Plus`] .box-7[Consensus only (`consensus`)] ]

---

# Model formulation

Use `post` as response variable and `prior` beliefs as a control variable in the analysis of covariance.

> their response was measured on a visual analogue scale ranging from –⁠100 (I am 100% certain this is false) to 100 (I am 100% certain this is true) with 0 (I don’t know) in the middle.

---
# Plot of post vs prior response

---

# Model formulation

Average for the `$r$`th replication of the `$i$`th experimental group is
`$$\begin{align*}\mathsf{E}(\texttt{post}_{ir}) &= \mu + \alpha_i\texttt{condition}_i + \beta \texttt{prior}_{ir}.\\
\mathsf{Va}(\texttt{post}_{ir}) &= \sigma^2\end{align*}$$`

We assume that there is no interaction between `condition` and `prior`
- the slopes for `prior` are the same for each `condition` group.
- the effect of prior is linear

---

# Contrasts of interest

1. Difference between average boosts (`Boost` and `BoostPlus`) and control (`consensus`)
2. Difference between `Boost` and `BoostPlus` (pairwise)

Inclusion of the `prior` score leads to increased precision for the mean (reduces variability).

---

# Contrasts with ANCOVA

- The estimated marginal means will be based on detrended values `$\neq$` group averages
- In the `emmeans` package, the average of the covariate is used as value.
- the difference between levels of `condition` are the same for any value of `prior` (parallel lines), but the uncertainty changes.

Multiple testing adjustments:

- Methods of Bonferroni (prespecified number of tests) and Scheffé (arbitrary contrasts) still apply
- Can't use Tukey anymore (adjusted means are not independent anymore).

---

# Data analysis

```r
library(emmeans)
options(contrasts = c("contr.sum", "contr.poly"))
data(SSVB21_S2, package = "hecedsm")
# Check balance
with(SSVB21_S2, table(condition))
```
] 
.panel[.panel-name[Scatterplot]
.pull-left.small[

```r
library(ggplot2)
ggplot(data = SSVB21_S2,
       aes(x = prior, y = post)) + 
       geom_point() + 
       geom_smooth(method = "lm",
                   se = FALSE)
```
.small[Strong correlation; note responses that achieve max of scale.]
]
.pull-right[
<img src="img/07/Experiment2_scatter.png" width="70%" style="display: block; margin: auto;" />

]
]
.panel[.panel-name[Model]

```r
# Check that the data are well randomized
car::Anova(lm(prior ~ condition, data = SSVB21_S2), type = 3)
# Fit linear model with continuous covariate
model1 <- lm(post ~ condition + prior, data = SSVB21_S2)
# Fit model without for comparison
model2 <- lm(post ~ condition, data = SSVB21_S2)
# Global test for differences - of NO INTEREST
car::Anova(model1, type = 3)
car::Anova(model2, type = 3)
```
]
.panel[.panel-name[ANOVA]
.pull-left.small[
<table>
 <thead>
  <tr>
   <th style="text-align:left;"> term </th>
   <th style="text-align:right;"> sum of squares </th>
   <th style="text-align:right;"> df </th>
   <th style="text-align:right;"> statistic </th>
   <th style="text-align:right;"> p-value </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> (Intercept) </td>
   <td style="text-align:right;"> 166341 </td>
   <td style="text-align:right;"> 1 </td>
   <td style="text-align:right;"> 71.7 </td>
   <td style="text-align:right;"> 0.00 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: black !important;background-color: yellow !important;"> condition </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 14107 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 2 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 3.0 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 0.05 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> prior </td>
   <td style="text-align:right;"> 385385 </td>
   <td style="text-align:right;"> 1 </td>
   <td style="text-align:right;"> 166.1 </td>
   <td style="text-align:right;"> 0.00 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> Residuals </td>
   <td style="text-align:right;"> 1016461 </td>
   <td style="text-align:right;"> 438 </td>
   <td style="text-align:right;">  </td>
   <td style="text-align:right;">  </td>
  </tr>
</tbody>
</table>
]
.pull-right.small[
<table>
 <thead>
  <tr>
   <th style="text-align:left;"> term </th>
   <th style="text-align:right;"> sum of squares </th>
   <th style="text-align:right;"> df </th>
   <th style="text-align:right;"> statistic </th>
   <th style="text-align:right;"> p-value </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> (Intercept) </td>
   <td style="text-align:right;"> 123377 </td>
   <td style="text-align:right;"> 1 </td>
   <td style="text-align:right;"> 38.64 </td>
   <td style="text-align:right;"> 0.000 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: black !important;background-color: yellow !important;"> condition </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 11680 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 2 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 1.83 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 0.162 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> Residuals </td>
   <td style="text-align:right;"> 1401846 </td>
   <td style="text-align:right;"> 439 </td>
   <td style="text-align:right;">  </td>
   <td style="text-align:right;">  </td>
  </tr>
</tbody>
</table>
]
]
]
---
class: title title-7
# Data analysis

```r
emm1 <- emmeans(model1, specs = "condition")
# Note order: Boost, BoostPlus, consensus
emm2 <- emmeans(model2, specs = "condition")
# Not comparable: since one is detrended and the other isn't
contrast_list <- list(
   "boost vs control" = c(0.5,  0.5, -1), 
   #av. boosts vs consensus
   "Boost vs BoostPlus" = c(1, -1,  0))
contrast(emm1, 
         method = contrast_list, 
         p.adjust = "holm")
```
]
.panel[.panel-name[_t_-tests]
.pull-left.small[
<table>
 <thead>
  <tr>
   <th style="text-align:left;"> contrast </th>
   <th style="text-align:right;"> estimate </th>
   <th style="text-align:right;"> se </th>
   <th style="text-align:right;"> df </th>
   <th style="text-align:right;"> t stat </th>
   <th style="text-align:right;"> p-value </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> boost vs control </td>
   <td style="text-align:right;"> -8.37 </td>
   <td style="text-align:right;"> 4.88 </td>
   <td style="text-align:right;"> 438 </td>
   <td style="text-align:right;"> -1.72 </td>
   <td style="text-align:right;"> 0.09 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> Boost vs BoostPlus </td>
   <td style="text-align:right;"> 9.95 </td>
   <td style="text-align:right;"> 5.60 </td>
   <td style="text-align:right;"> 438 </td>
   <td style="text-align:right;"> 1.78 </td>
   <td style="text-align:right;"> 0.08 </td>
  </tr>
</tbody>
</table>
Contrasts with ANCOVA with `prior` (Holm-Bonferroni adjustment with `$k=2$` tests)
]
.pull-right.small[
<table>
 <thead>
  <tr>
   <th style="text-align:left;"> contrast </th>
   <th style="text-align:right;"> estimate </th>
   <th style="text-align:right;"> se </th>
   <th style="text-align:right;"> df </th>
   <th style="text-align:right;"> t stat </th>
   <th style="text-align:right;"> p-value </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> boost vs control </td>
   <td style="text-align:right;"> -5.71 </td>
   <td style="text-align:right;"> 5.71 </td>
   <td style="text-align:right;"> 439 </td>
   <td style="text-align:right;"> -1.00 </td>
   <td style="text-align:right;"> 0.32 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> Boost vs BoostPlus </td>
   <td style="text-align:right;"> 10.74 </td>
   <td style="text-align:right;"> 6.57 </td>
   <td style="text-align:right;"> 439 </td>
   <td style="text-align:right;"> 1.63 </td>
   <td style="text-align:right;"> 0.10 </td>
  </tr>
</tbody>
</table>
Contrasts for ANOVA (Holm-Bonferroni adjustment with `$k=2$` tests)
]
]
.panel[.panel-name[Assumptions]

.pull-left.small[

```r
# Test equality of variance
levene <- car::leveneTest(
   resid(model1) ~ condition, 
   data = SSVB21_S2,
   center = 'mean')
# Equality of slopes (interaction)
car::Anova(lm(post ~ condition * prior, 
           data = SSVB21_S2),
           model1, type = 3)
```

Levene's test of equality of variance: _F_ (2, 439) = 2.05 with a `$p$`-value of 0.13.
]
.pull-right.small[

<table>
 <thead>
  <tr>
   <th style="text-align:left;"> term </th>
   <th style="text-align:right;"> sum of squares </th>
   <th style="text-align:right;"> df </th>
   <th style="text-align:right;"> statistic </th>
   <th style="text-align:right;"> p-value </th>
  </tr>
 </thead>
<tbody>
  <tr>
   <td style="text-align:left;"> (Intercept) </td>
   <td style="text-align:right;"> 165573 </td>
   <td style="text-align:right;"> 1 </td>
   <td style="text-align:right;"> 71.3 </td>
   <td style="text-align:right;"> 0.0 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> condition </td>
   <td style="text-align:right;"> 4245 </td>
   <td style="text-align:right;"> 2 </td>
   <td style="text-align:right;"> 0.9 </td>
   <td style="text-align:right;"> 0.4 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> prior </td>
   <td style="text-align:right;"> 382596 </td>
   <td style="text-align:right;"> 1 </td>
   <td style="text-align:right;"> 164.9 </td>
   <td style="text-align:right;"> 0.0 </td>
  </tr>
  <tr>
   <td style="text-align:left;font-weight: bold;color: black !important;background-color: yellow !important;"> condition:prior </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 3257 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 2 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 0.7 </td>
   <td style="text-align:right;font-weight: bold;color: black !important;background-color: yellow !important;"> 0.5 </td>
  </tr>
  <tr>
   <td style="text-align:left;"> Residuals </td>
   <td style="text-align:right;"> 1016461 </td>
   <td style="text-align:right;"> 438 </td>
   <td style="text-align:right;">  </td>
   <td style="text-align:right;">  </td>
  </tr>
</tbody>
</table>

Model with interaction `condition*prior`. Slopes don't differ between condition.
]
]
]
---

# The kitchen sink approach

.box-inv-7.medium.sp-after[Should we control for more stuff?]

.box-7.medium.sp-after[NO! ANCOVA is a design to reduce error]

- Randomization should ensure that there is no confounding
- No difference (on average) between group given a value of the covariate.
- If it isn't the case, adjustment matters

---

# Equal trends
.small[
- If trends are different, meaningful comparisons (?)
- Differences between groups depend on value of the covariate
]
<img src="07-slides_files/figure-html/ancovadifftrend-1.png" width="50%" style="display: block; margin: auto;" />
.small[Due to lack of overlap, comparisons hazardous as they entail extrapolation one way or another.]

---

# Testing equal slope

.box-7.sp-after[Compare two nested models]

- Null `$\mathscr{H}_0$`: model with covariate
- Alternative `$\mathscr{H}_a$`: model with interaction covariate * experimental factor