The bioBakery help forum

Kneaddata log file error

Dear developers,

I found an error in kneaddata log file. it seems that it wrongly matches the reference database and the decontaminated file. Below is a demo of the issue:

the database is marked as hg37dec_v0.1 but from the file name (SILVA_128fastq), rRNA database is used at this step. This minor error leads to the merged log file wrongly report the number of reads get filtered at each step. It would be very helpful if you could help me to look into this issue.

Best wishes,
Shen

Hi @auas ,

Apologies for the late reply. Can you provide us the kneaddata command that you were using (reference database parameter)?

Regards,
Sagun

Hi Sagun,

Nice to hear from you. Below is the command I am using:

"kneaddata --reorder --serial --input {input.re_trimmed_p1} --input {input.re_trimmed_p2} -o {params.kneaddata_root} -db {params.rRNA} -db {params.hDNA} -db {params.hRNA} \
	 --run-fastqc-start --run-fastqc-end  --remove-intermediate-output --trimmomatic-options 'HEADCROP:15 SLIDINGWINDOW:4:15 MINLEN:50' \
	 -t 4 --log {output.log_fp}  --output-prefix {params.sample_id}"

And the reference databases are:

hg37_bt2="/data/TOOLS/kneaddata-0.7.10/reference_DB/hg37_bt2/hg37dec_v0.1"
hg38_mRNA_bt2="/data/TOOLS/kneaddata-0.7.10/reference_DB/hg38_mRNA_bt2/human_hg38_refMrna"
SILVA_128_rRNA_bt2="/data/TOOLS/kneaddata-0.7.10/reference_DB/SILVA_128_rRNA_bt2/SILVA_128_LSUParc_SSUParc_ribosomal_RNA"

Looking forward to hearing from you.
Best wishes,
Shen