MetaPhlAn 4.x Support for Long Reads?

PileOfAmoebas · March 20, 2025, 11:23am

Hi!

I’m trying to use MetaPhlAn 4.1.1 (as part of HUMAnN 3.9) on some very long Oxford Nanopore reads (10,000 up to 100,000bp and larger).

Unfortunately, it is consuming an enormous amount of RAM. A 28mb file containing reads no larger than 100,000bp (the smallest fastq.gz in the dataset) consumed 70GB of RAM to pass the Bowtie2 step. Other files in my dataset (<100GB) reported OOM errors at this stage.

I have seen that Bowtie2 performs poorly on long reads. Is there any interest in integrating a long-read aligner like Minimap2 to circumvent this issue?

Also: Is there any other way I can use HUMAnN for my dataset? I am willing to bypass using MetaPhlAn entirely, if you guys have any ideas…

lindacova · March 27, 2025, 10:35am

Hi @PileOfAmoebas! We are testing MetaPhlAn on long reads with Minimap2 and it will be available in some weeks with the next MetaPhlAn release. The code we are testing is in the code_refactor branch of the MetaPhlAn repo if you are interested

Topic		Replies	Views
Longs Reads (Nanopore) Humann HUMAnN	1	462	January 12, 2024
Large constant run time cost to metaphlan? MetaPhlAn	1	793	September 7, 2020
Using strainphlan with long reads? StrainPhlAn	1	32	March 18, 2025
Take a big memory usage and less rate of CPUs in humann3 HUMAnN	3	470	February 28, 2024
Allowing bowtie2 memory mapping in metaphlan MetaPhlAn	2	518	October 5, 2022

MetaPhlAn 4.x Support for Long Reads?

Related topics