Metaphlan3 analysis with Maaslin2

Hi @plicht - we usually don’t recommend one model over the others and leave it to the user’s best judgment. All these included models have been carefully validated (as described in our preprint) so that they together represent a multi-model system appropriate for many different microbial community data types (taxonomy or functional profiles), environments (human or otherwise), and measurements (counts or relative counts) along with the implementation of alternative normalization/transformation schemes and statistical models as we strongly believe that the best model for a given dataset is highly context-dependent.

In your case, the total number of detected features is only one way to assess this performance. I recommend deep-diving into the detected features if they are meaningful biologically with respect to effect size, overall distribution, or prior knowledge. An intersection of a few plausible results is a good starting point if you want to start from a reduced set of features.

One minor point: for relative abundances, count models such as negative binomial and ZINB are not appropriate, which might explain why you are not seeing any significant results from running those models. Other than that, CPLM is also an appropriate model for a high number of zero counts in the data. I hope this helps in your decision-making to some extent :slight_smile:

Best,
Himel