The bioBakery help forum

Marker gene length

Hi @fbeghini
What is the length of the Clade-specific marker genes in your database in base pairs? I want to know this because I want to keep a minimum read length for all the samples so I should not take a minimum read length that would affect my results.


The markers in the database have different lengths. From the bioBakery 3 paper:

we use the pan-proteome built using the UniRef90 clusters considering all proteins with a length between 150 and 1500 amino acids.