Hi, I’m using Maaslin2 to perform association analysis on microbial functional abundance data (KEGG Orthologs), and I have some questions about the gFC (geometric Fold Change) output and its interpretation.
fitdata = Maaslin2(
input_data = ko_otu_df, # KO functional abundance, relative abundance (RA)
input_metadata = pd, # Clinical metadata
output = "maaslin2_results",
fixed_effects = c("Group", "AGE", "SEX"),
reference = c("Group,HC"),
min_abundance = 0.0005,
min_prevalence = 0.1,
normalization = "CLR", # Using CLR normalization
transform = "NONE", # No additional transformation
analysis_method = "LM",
cores = 10
)
My questions:
- With
normalization="CLR"
andtransform="NONE"
, can thegFC
output by Maaslin2 be interpreted as the geometric fold change of abundance between groups? - If
gFC = exp(coef)
does not represent the traditional fold change under this setting, would you recommend switching tonormalization="TSS"
+transform="LOG"
to obtain more interpretable fold change values? - Alternatively, is there a recommended approach to calculate the fold change between groups separately (e.g., using mean or median of raw relative abundances) while keeping CLR normalization for the regression model?
Thank you very much for your help!