Reference level changes lead to different pairwise differential abundance results in Maaslin3

xingbo · May 11, 2026, 7:00pm

I have a question regarding the effect of changing the reference group in a categorical variable.

I am analyzing three groups: right, left, and rectum, using Maaslin3 with:

fixed effect of interest: tissue location (right, left, rectum)
covariates: sex, BMI, and age
model: abundance
significance threshold: qval_individual < 0.1

As I understand it, Maaslin3 uses the first factor level as the reference group.

When I set right as the reference level, Maaslin3 compares:

left vs right
rectum vs right

When I set rectum as the reference level, Maaslin3 compares:

right vs rectum
left vs rectum

However, the number of significant differential features detected for the right vs rectum comparison is different between these two runs, even though biologically this should represent the same pairwise comparison.

For example:

using right as reference gives results for rectum vs right
using rectum as reference gives results for right vs rectum

but the significant feature counts are not identical.

Is this expected behavior in Maaslin3?

If so, could you explain why changing the reference level changes the number of detected features for what appears to be the same pairwise comparison?

I am wondering whether this could be related to:

model parameterization or contrast coding,
how qval_individual is calculated,
multiple testing correction being applied separately to different coefficient sets,
interaction with covariates (sex, BMI, age),
prevalence/filtering procedures,
or some other aspect of the Maaslin3 implementation.

Thank you very much for your help.

WillNickols · May 11, 2026, 7:30pm

Hi,

A few questions:

Are the differences you’re observing only in the significance test or also the coefficients too?
How different are the results in the two cases?

If you’d prefer to post a chunk of the two results here or email me at willnickols@g.harvard.edu, I can taker a closer look. I would think the 2 models should give equivalent results, but there might be quirks in the median comparison producing differences.

Will

Topic		Replies	Views
Pairwise comparisons with multi-level categorical variable in MaAsLin3 MaAsLin	5	180	November 18, 2025
Maaslin2 heatmap and significance_results table MaAsLin	2	428	October 25, 2023
Questions about choosing analysis method Masslin3 MaAsLin	17	858	April 17, 2025
Inverting reference MaAsLin	6	35	May 4, 2026
Pairwise comparisons for interaction terms in maaslin3 MaAsLin	10	443	May 6, 2026

Reference level changes lead to different pairwise differential abundance results in Maaslin3

Related topics