Hello everyone,
I met a very serious problem about the contaminated reads removing using bowtie2 in the kneaddata pipeline (v0.12.0).
I want to perform quality control on my giant panda (Ailuropoda_melanoleuca) fecal metagenomic data. I have downloaded the reference genome (GCF_002007445.2) in NCBI and build the bowtie2 index.
#I downloaded the ref genome using aspera and unzipped it. It is located in ~/Desktop/kneaddata_tutorial/
#Then I build the bowtie index
bowtie2-build -f GCF_002007445.2_ASM200744v3_genomic.fna Ailuropoda_melanoleuca_db
#Finally, I run the kneaddata pipeline
kneaddata --input1 ①-11.9-4.R1.fq.gz --input2 ①-11.9-4.R2.fq.gz \
> --reference-db ~/Desktop/kneaddata_tutorial/Ailuropoda_melanoleuca_db \
> --output KneaddataOutputPairedEnd_2 \
> --threads 24 --sequencer-source none --quality-scores phred33
#check the read count of each stage
kneaddata_read_count_table --input KneaddataOutputPairedEnd_2/ --output KneaddataOutputPairedEnd_2.tsv
KneaddataOutputPairedEnd_2.tsv (460 Bytes)
The reads number of decontaminated file is unbelievably zero! I have no idea about this result. Does it mean that there aren’t any microbe reads in my data?