All paired-end read unmatched

Zoexfq · July 15, 2024, 5:29am

Hi, I just came across the same issue.

When using KneadData, it appears that the software first outputs a SAM file and then processes this file to determine the mapping results. My understanding is that during this post-processing step, the software identifies paired reads by examining the suffix of each read’s name, looking for either ‘/1’ or ‘/2’ to differentiate between the two ends of a pair.

While this approach works seamlessly with raw sequencing data, I have found that when working with data obtained from public databases, the read names are often sanitized, and the distinguishing ‘/1’ or ‘/2’ suffixes are removed. This could potentially lead to misidentification of paired reads during the post-processing phase.

Bowtie2 actually offers built-in options to handle such cases elegantly. The --un-conc and --un parameters in bowtie2 are specifically designed to output unmapped reads in a way that retains the paired-end information, even when the read names have been altered or are absent of these suffixes.

Could you maybe include an option for users to enable bowtie2’s --un-conc and --un parameters during the mapping process? This would allow for better handling of paired-end reads with modified names.

Topic		Replies	Views
Updated kneaddata to fix issue with paired-end reads? KneadData	9	1502	October 12, 2023
Paired-end data results in unpaired output KneadData	27	5807	June 20, 2024
Problem with paired end demo on new install KneadData	15	2460	October 4, 2024
Strange output from paired end kneaddata input KneadData	2	2163	August 28, 2020
Kneaddata fail to recognize paired end data KneadData	2	380	August 30, 2023

All paired-end read unmatched

Related topics