Post hoc test in R for repeated measures ANOVA with 2 within-variables

Question

Say, I have a full factorial design with 30 participants (ppt_id), each of which was tested in 4 conditions:

(left side + blue color) AND (left side + red color) AND (right side + blue color) AND (right side + red color)

And my dependent variable is reaction time (RT).

df = matrix(nrow = 120, ncol = 0) 
df = as.data.frame(df)
df$ppt_id = c(rep(c(1:30),4))
df$color = c(rep("red", 60), rep("blue", 60))
df$side = c(rep(c(rep("left", 30), rep("right", 30)),2))
df$RT = c(rnorm(30, 600, 50),
          rnorm(30, 650, 50),
          rnorm(30, 700, 50),
          rnorm(30, 600, 50))

Now, I calculate rmANOVA and find out that there is a significant interaction between factors Side and Color:

anova_test(
  data = df, 
  dv = RT, 
  wid = ppt_id,
  within = c(color, side),
  effect.size = "pes")

How can I perform multiple comparisons of all conditions with each other (a post hoc test) in R? All I found were functions for one variable with more than two levels (one-way ANOVA) or for mixed designs (i.e., one between- and one within-variable).

Are you asking programming or statistical question? For latter, please post on [CrossValidated](https://stats.stackexchange.com/) — Parfait, Jul 14 '22 at 00:01
Programming, i.e., which package/function is needed to calculate this test in R. — Alex M, Jul 14 '22 at 09:23

Vincent · Answer 1 · 2022-07-14T12:47:14.837

One option may be to use the afex package to fit the ANOVA and the marginaleffects package to compute contrasts. (Disclaimer: I am the author of marginaleffects.)

First, fit the model:

library(marginaleffects)
library(afex)

df = matrix(nrow = 120, ncol = 0) 
df = as.data.frame(df)
df$ppt_id = c(rep(c(1:30),4))
df$color = c(rep("red", 60), rep("blue", 60))
df$side = c(rep(c(rep("left", 30), rep("right", 30)),2))
df$RT = c(rnorm(30, 600, 50),
          rnorm(30, 650, 50),
          rnorm(30, 700, 50),
          rnorm(30, 600, 50))

mod <- aov_ez(
    id = "ppt_id",
    dv = "RT",
    within = c("side", "color"),
    data = df)

Then, do post hoc processing to get the contrasts. This will tell you how the values of the outcome predicted by the model change when we manipulate the explanators (and their pairwise combinations):

cmp <- comparisons(
    # the model
    mod, 
    # hold other regressors at their means if there are any
    newdata = "mean",
    # vector of variables we want to "manipulate" in the contrast
    variables = c("side", "color"),
    # we care about interactions
    interactions = TRUE)

summary(cmp)
#> Average contrasts 
#>           side      color  Effect Std. Error z value   Pr(>|z|)
#> 1  left - left blue - red 110.489      10.71 10.3148 < 2.22e-16
#> 2 right - left  red - red  51.055      13.84  3.6896 0.00022457
#> 3 right - left blue - red   6.274      12.19  0.5147 0.60674367
#>    2.5 % 97.5 %
#> 1  89.49 131.48
#> 2  23.93  78.18
#> 3 -17.62  30.16
#> 
#> Model type:  afex_aov 
#> Prediction type:  response

There is a detailed vignette on contrasts here: https://vincentarelbundock.github.io/marginaleffects/articles/contrasts.html

Edit: A few notes on interpretation

Start by using the afex package’s default predict() method to compute predicted outcome for an individual with color blue and either values of the side variable:

nd <- data.frame(side = c("left", "right"), color = "blue", ppt_id = 1)
nd
#>    side color ppt_id
#> 1  left  blue      1
#> 2 right  blue      1

predict(mod, newdata = nd)
#>        1        2 
#> 678.7812 609.2096

The difference between left+blue and right+blue is:

diff(predict(mod, newdata = nd))
#>         2 
#> -69.57154

We can obtain the same result with standard errors using the predictions() and comparisons() functions from marginaleffects():

predictions(mod, newdata = nd)
#>   rowid     type predicted std.error statistic p.value conf.low conf.high  side color ppt_id
#> 1     1 response  678.7812  11.02046  61.59284       0 657.1815  700.3808  left  blue      1
#> 2     2 response  609.2096   9.27690  65.66953       0 591.0272  627.3920 right  blue      1

comparisons(
    mod,
    variables = "side",
    newdata = datagrid(color = "blue", ppt_id = 1))
#>   rowid     type        term contrast_side comparison std.error statistic      p.value  conf.low conf.high side color ppt_id
#> 1     1 response interaction  right - left  -69.57154  14.12988 -4.923717 8.491553e-07 -97.26559 -41.87748 left  blue      1

This is a simple “contrast”: What happens to the predicted outcome when side changes from left to right, and color is equal to blue for individual 1?

Now we can compute the same contrast between “left” and “right”, but for individuals in either color conditions:

comparisons(
    mod,
    variables = "side",
    newdata = datagrid(color = c("red", "blue"), ppt_id = 1))
#>   rowid     type        term contrast_side comparison std.error statistic      p.value  conf.low conf.high side color ppt_id
#> 1     1 response interaction  right - left   52.41222  13.74332  3.813650 1.369293e-04  25.47581  79.34864 left   red      1
#> 2     2 response interaction  right - left  -69.57154  14.12988 -4.923717 8.491553e-07 -97.26559 -41.87748 left  blue      1

In the first example I initially gave you, you see “interacted” contrasts: What happens when both the side and color variables change simultaneously?

I’m not an expert in the afex package, so you will have to refer to their documentation to answer the “within” question.

Usually, [recommending libraries/software](https://meta.stackoverflow.com/a/251135/1422451) is discouraged on any SE site. Answers should relate to whatever implementation OP uses. OP should do the necessary [research](https://meta.stackoverflow.com/questions/261592/how-much-research-effort-is-expected-of-stack-overflow-users) picking a solution with an earnest attempt before asking for help. — Parfait, Jul 14 '22 at 00:10
Do you mean that *asking for software recommendations* is discouraged (which is what your link says) or that *recommending software* is discouraged? I always understood that the former is discouraged (for good reasons), but I see that latter being upvoted all the time. — Vincent, Jul 14 '22 at 02:28
Sorry, I don't quite understand how to read the output of comparisons. E.g., I would like to know whether the (right + blue) condition is different from the (left + blue) condition and get stats for this comparison. Where should I look? Also, does it account for these being within-factors, i.e., not independent observations? — Alex M, Jul 14 '22 at 12:03
I added a few notes on interpretation. Hopefully this helps. — Vincent, Jul 14 '22 at 12:47

Alex M · Answer 2 · 2022-07-19T10:07:05.147

Perhaps I'm not able to understand the answer by @Vincent, or maybe I did not explain clearly what I needed.

Ideally, I would like to get a table with all possible comparisons for this dataset, i.e., there should be six comparisons (because there are 4 possible conditions), something like what this function does:

## combine both independent variables into one
df$condition = paste0(df$side, "_", df$color)
df$condition = as.factor(df$condition)

## calculate pairwise tests
mult_comp = df %>%
  pairwise_t_test(
    RT ~ condition, paired = TRUE, 
    p.adjust.method = "holm")%>%
  select(-statistic, -df)

Now, the problem is that this function can only work with one independent variable (the same is true for a standard function pairwise.t.test), while I actually have two independent variables.

One possible solution is to calculate ANOVA by using the function aov and then use the function TukeyHSD for calculating pairwise comparisons:

anova_df = aov(RT ~ side*color, data = df)

TukeyHSD(anova_df)

The downside is that the calculation is then limited to the Tukey method, which might not always be appropriate. E.g., in this case it would be not statistically correct to use it, since I have a repeated-measures design - and it is not possible to use other corrections with this function.

Post hoc test in R for repeated measures ANOVA with 2 within-variables

2 Answers2