Percent mapped reads

jjsimz · June 30, 2020, 1:00am

Hi, I am new to HUMAnN3 (and HUMAnN) in general. I have no problem installing it using conda. I compared the mapping of the demo fastq file before and after installing the complete ChocoPhlAn and UniRef90 databases. Surprisingly, the percent mapped reads only improved marginally from

Unaligned reads after nucleotide alignment: 88.3095238095 %
to
Unaligned reads after nucleotide alignment: 87.3714285714 %, for ChocoPhIAn, and

Unaligned reads after translated alignment: 83.6190476190 %
to
Unaligned reads after translated alignment: 80.3904761905 %, for UniRef90.

Is there a mistake? Or is it common that only 20% reads map for a typical stool sample? (I understand that this may be a limitation of the NCBI database/ annotation rather than the pipeline per se).

Thanks,
Choon

franzosa · June 30, 2020, 2:16am

This is expected since we are still using the HUMAnN 2.0 demo dataset in the 3.0 alpha release, and it’s very shallow. While many of the reads hit genes in the demo and full databases, they don’t do so with sufficient coverage to believe that those genes are actually present (hence the reads are reported as unexplained). If you turn down/off the subject coverage filters you should see the majority of reads explained.

We’ll be making an improved demo for the official v3.0 release.

For a real stool sample I’d expect 50-80% of reads to be mapped by HUMAnN.

jjsimz · June 30, 2020, 1:12pm

Thanks for the quick response!

Topic		Replies	Views
Getting 67% unaligned reads with HUMANnN 3.0 HUMAnN	9	2310	June 28, 2022
Low percentage of aligned reads HUMAnN	17	5095	January 29, 2021
High Percentage of Unmapped and Unintegrated Reads in HUMAnN3 Analysis HUMAnN	2	170	October 30, 2024
High unaligned percentage HUMAnN	6	456	August 5, 2021
Humann3 "Unaligned reads after..." HUMAnN	0	570	November 16, 2020

Percent mapped reads

Related topics