MaAsLin2 for Differential Abundance Analysis in a Repeated Measures Microbiome Study with Multiple Covariates

I’m working on a microbiome study aiming to identify microbial taxa significantly associated with clinical outcomes (success vs. failure). My dataset includes: 3 timepoints, three treatment methods, other covariates, and repeated measures across individual patients. Given the repeated measures and covariates in my dataset, is MaAsLin2 appropriate for identifying differentially abundant microbes across timepoints and treatment methods?
So, how should I define the formula in MaAsLin2 to account for:

  1. Fixed effects: outcome (success/failure), timepoint, treatment method, and other covariates
  2. Random effect: repeated measures per PatientID

I would appreciate it if anyone could share an example formula setup or best practices for configuring MaAsLin2 in this kind of scenario.

Hi,

There’s a fair amount of discussion about this type of design (here, here, and a lot of old ones), but in short:

  • If you actually care about differences over time, the model you’ve specified will give one effect per time point, which assumes the effect of time is the same for all treatment groups. If you want different time effects per treatment group, add an interaction term (though you might need a lot of data for this to fit well).
  • If you don’t care about differences over time because nothing’s changing consistently for these patients with the time point, don’t include time point.

Also, MaAsLin 3 exists now in addition to 2, and it helps distinguish between prevalence and abundance effects in addition to many other ease-of-use improvements.

Will