Kneaddata decontamination algorithm

sagunmaharjann · May 5, 2020, 2:40pm

My team has started doing some microbiome analysis recently, and came across several useful tools developed by your lab, including Kneaddata.

We are particularly curious about the decontamination step, and wonder if you could share more details with us about how exactly it is done?

Many thanks in advance for your kind advice!

Cheers,
Marie

franzosa · May 7, 2020, 8:15pm

Hi Marie - You can read a bit more about KneadData here:

https://bitbucket.org/biobakery/kneaddata/wiki/Home

(Note that we’re in the process of migrating all of this material to Github; KneadData hasn’t moved yet.) The QC in KneadData is currently a two-step process: 1) general read-level QC and 2) contaminant read depletion.

Step 1 uses Trimmomatic and follows general best practices for shotgun sequencing reads: i.e. trim away low-quality bases and then discard the read if there aren’t enough high-quality bases remaining.

Step 2 maps the high-quality reads against one or more databases of contaminant sequences. In the case of human-associated metagenomes, we use a modified version of the human genome as the database for this step (modified = containing additional “decoy” sequences that are also believed to represent human contamination).

Topic		Replies	Views
Kneaddata not decontaminating fully KneadData	0	533	January 8, 2021
About the Kneaddata category KneadData	0	782	October 28, 2019
What happends with bowtie? KneadData	1	675	December 14, 2021
Why I got more reads after the quality control of KneadData? KneadData	1	1225	May 15, 2020
Should I use kneaddata in the following condition? KneadData	1	561	August 28, 2020

Kneaddata decontamination algorithm

Related topics