Hi
I try to understand the output files generated in kneaddata for a paired-end sample
Files:
Sample_1 (Forward) (11G)
Sample_2 (Reverse) (9.7G)
I obtain:
Sample_kneaddata.log
Sample_1.kneaddata.repeats.removed.1.fastq (8.6G)
Sample_2.kneaddata.repeats.removed.2.fastq (7.2G)
Sample_1.kneaddata.repeats.removed.unmatched.1.fastq (1.2G)
Sample_2.kneaddata.repeats.removed.unmatched.2.fastq (85.9MB)
Sample_1.kneaddata.trimmed.1.fastq (8.7G)
Sample_2.kneaddata.trimmed.2.fastq (7.3G)
Sample_1.kneaddata.trimmed.single.1.fastq (1.2G)
Sample_2.kneaddata.trimmed.single.2.fastq (86.3 MB)
I suppose that Sample_1.kneaddata.repeats.removed.1.fastq (and _2) are the files that I need to continue to the next step in the analysis, due to the file size (smaller than trimmed files). It is right?
Also, I don’t understand what is single in “Sample_X.kneaddata.trimmed.single.X.fastq” and why I obtain a so important difference between file size _1 (1.2G) and _2 (86.3MB)
I appreciate your help
All the best,
Joao