Is long computation time in humann3 related to not using knead data?

gcagle1 · February 18, 2021, 9:54pm

Greetings,
I am running 150 bp PE metagenomic reads from soil samples on a university HPC cluster and the diamond step does not finish after running for 7 days. I am using the uniref50 database and it is the only one in my data/uniref folder.

I previously did QC with trimmomatic so did not use knead data, and I concatenated my QC forward and reverse reads for input. I had the impression from reading the docs that knead data was not necessary, but after looking into knead data more I see that it can filter rRNA. So I’m wondering if filtering rRNA with knead data prior to running humann is suggested reduce run time?

I would greatly appreciate some clarification as to if removing rRNA with knead data would reduce humann’s run time.
If indeed filtering rRNA with knead data is recommended, could I filter the bowtie2_unaligned.fa file from humann and proceed with --resume? Or would I need to delete all the intermediate temp output files and start over?

Thanks!
Grace

lauren.j.mciver · February 18, 2021, 10:20pm

Hi Grace, Thanks for reaching out! How many cores are you using for your HUMAnN run? Also how large is your input file? Unless you are running with a single core and you have a very large input file, that run time seems very long. Do you see any errors in any logs? If you expect contamination in your samples I would suggest running Kneaddata in addition to the QC you ran. The only way Kneaddata would reduce the HUMAnN run time was if it filtered a significant number of your reads (so HUMAnN had significantly less reads to align).

Thank you,
Lauren

Topic		Replies	Views
Optimising Humann run time - low species number - uniref database question HUMAnN	2	890	February 11, 2022
HUMAnN3 analysis not proceeding with RNAseq data HUMAnN	2	195	April 15, 2024
Bowtie2 unaligned reads slow HUMAnN	14	1975	November 8, 2024
HUMAnN become slow HUMAnN	4	269	September 6, 2023
HMP2 metagenomics rawfiles IBDMDB	1	77	May 10, 2024

Is long computation time in humann3 related to not using knead data?

Related topics