Statistical Approach for Comparing Functional Potential of Selected SGBs vs. Community Background

microbiome1 · July 7, 2025, 3:19pm

I have a set of metagenomic samples that have been processed using BioBakery tools.

Let’s say I’ve identified several SGBs of interest. I’d like to compare their genetic potential (pathways/EC’s) to that of the rest of the microbial community (i.e., all other SGBs). What would be an appropriate statistical approach to test this?

I assume it’s important to account for differences in SGB’s relative abundances - if the SGBs of interest are less abundant overall, I would naturally expect them to contribute less to a given functional feature.

To complicate things a bit more, the groups are unbalanced: I’m comparing ~40 SGBs of interest against >1000 other SGBs. What would be a good way to address both the abundance weighting and the imbalance in group size?

Thanks!

Topic		Replies	Views
About the Downstream analysis and statistics category Downstream analysis and statistics	0	861	November 12, 2019
Quantifying uncultured bacteria MetaPhlAn	0	199	September 27, 2021
Total Bacteria species relative abundance no longer sums to 1 after converting the SGB profile to GTDB MetaPhlAn	11	1240	April 6, 2023
Comprehensive names/aliases of taxonomy units in the gut microbiome Downstream analysis and statistics	1	283	November 17, 2020
Help with Understanding Microbial Community Analysis Tools HUMAnN	1	51	October 11, 2024

Statistical Approach for Comparing Functional Potential of Selected SGBs vs. Community Background

Related topics