Subsetting 16S rRNA Data

Jacob_Nearing · May 11, 2020, 9:17pm

Hello Maaslin2 authors/users,

Does anyone have any recommendations on whether we should subset our 16S rRNA amplicon data to the same read depth before running Maaslin2? I noticed by default it runs its own normalization, however, I couldn’t find a clear answer on the usage of this feature for 16S rRNA data.

Thanks, Jacob Nearing

Kelsey_Thompson · May 14, 2020, 10:56am

Hi Jacob,

Thanks for the question! Rarefying the data is not standard practice for me in my analysis. Plenty of researchers do use it, so if you wanted to it is not wrong. I really like this manuscript by McMurdie and Holmes on the practice: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003531

How I typically approach sequencing depth in my analysis is to first remove any samples below a given threshold, for 16S I normally choose 5,000 reads. Then I look at the distribution of reads across all of my samples, if that varies a lot and you are concerned with it potentially impacting your analysis you can include sequencing depth as a covariate in your MaAsLin model to correct for these differences. MaAsLin does not incorporate a rarefying step on its own. The two things it employs are a normalization (TSS for MaAsLin2 as default) and transformation of the data (LOG for MaAsLin2).

I hope that helped! Let us know if you have any additional questions.

Best,
Kelsey

Jacob_Nearing · May 14, 2020, 6:37pm

Hi Kelsey,

Thanks for clearing that up and taking the time to answer my question. That makes a lot of sense and is generally how I go about my own analysis. I was mostly curious to see what others were doing as some tools are built explicitly to deal with data in a compositional aware manner and so rarefying the data is not recommended while other tools highly suggest this practice. It was not clear to me whether this was the case for Maaslin2.

Thanks,
Jacob Nearing

Topic		Replies	Views
Filter-normalize order and comments on tutorial MaAsLin	1	599	May 20, 2022
Metagenomic and min_abundance filtering MaAsLin	2	568	February 14, 2023
Normalization methods MaAsLin	2	101	March 6, 2024
MaAsLin2 normalization methods MaAsLin	1	2119	October 2, 2020
How to define the transformation / the normalization to use in Maaslin2 MaAsLin	4	2437	May 26, 2021

Subsetting 16S rRNA Data

Related Topics