Do I have to remove overlapping reads from paire_end data before Metahplan?

Michela_Francesconi · November 27, 2024, 3:55pm

I have paired_end files from shoutgun metagenomics analysis (251 bp). Before starting with Metaphan, I run fastqc and fastq_screen to check how my files are.
I used KneadData to delete the human genome, and now it is ok. (I also notice that all my files do not pass the “Per Base Sequence Content.” Is this a problem? All the other control is OK.)

Should I also have to delete overlapping reads between R1 and R2? How can I do it? I try your preprocessing.py file in Python, but I do not understand the difference with KneadData. Can you help me?

Thanks
Michela

Topic		Replies	Views
What to do with unmatched paired-end reads from kneadata outputs? KneadData	1	475	August 3, 2023
MetaPhlAn preprocessing of reads MetaPhlAn	1	547	December 3, 2021
Dealing with paired-end shotgun metagenomics sequencing ran on two lanes KneadData	0	381	May 13, 2022
Kneaddata output as input for metaphlan KneadData	1	21	September 30, 2025
Massive difference between paired reads' counts KneadData	1	642	May 1, 2021

Do I have to remove overlapping reads from paire_end data before Metahplan?

Related topics