Rarefaction analysis from metatranscriptome data

omkarkul · June 22, 2021, 1:33am

Hi,
I am a beginner in bioinformatics. I have analyzed a metatranscriptome dataset with HUMAnN2 pipeline and it has generated “bugs_list.txt” as expected. However, this file only shows the relative abundance of the detected taxa. Is there a way I could access the number of raw reads for the marker gene of every taxon?
My motivation for that is to perform rarefaction analysis to see whether sequencing depth was sufficient to detect all functionally active taxa. Based on that, I want to determine sequencing depth for another experiment where I am only interested in getting the list of active taxa.

franzosa · July 6, 2021, 5:39pm

Sorry for my slow reply - the only way to get to read counts would be to look at the raw mapping files that HUMAnN parses to build its various abundance profiles and directly count the number of reads hitting each sequence (or taxon). All of these files are available under your sample’s temp/ directory.

omkarkul · July 7, 2021, 4:11am

Thanks, Eric! Are you referring to “sample_diamond_aligned.tsv”? I see the following files in the temp folder.
tempfolder

franzosa · July 7, 2021, 9:29pm

Correct - that file stores the raw mapping of reads to protein sequences (of unclassified taxonomy) whereas the equivalent “_bowtie2_aligned.tsv” file stores the mapping of reads to pangenome sequences. You could count up the number of times each target sequence occurs in that file to get something more count-like (noting that it won’t directly maps HUMAnN’s abundances due to lack of filtering/normalization).

Topic		Replies	Views
Count of individual genes from ChocoPhLan database rather than UniRef gene family based RPK HUMAnN	2	493	January 8, 2021
Unmapped reads - relative abundance and absolute counts in gene families output HUMAnN	1	372	September 21, 2023
How should I compare HUMAnN 4 “Counts” from concatenated paired-end reads to known species read abundances? HUMAnN	3	126	November 6, 2025
How can I get taxa relative abundances from HUMAnN? HUMAnN	0	283	April 8, 2023
Get OTU table in humann output HUMAnN	14	1263	July 29, 2022

Rarefaction analysis from metatranscriptome data

Related topics