I have 17 MAGs/bins and I have annotated them so they belong to different classes. I want to pull genomes from the Genbank that are closely related to my MAGs. Here is my command for Actinomycetia class:
[e] no reference genomes found for “c__Actinomycetia”, please check the taxonomic label provided
However, if I manually search in Genbank I do find many Actinomycetia genomes.
Could you please help me how can I pull genomes for a specific class, family or genus for the purpose of drawing phylogenitc tree to see how similar/dissimilar my MAGs are from the genomes already present in the Genbank?
Hi and thanks for reporting this.
So, basically, the problem is with the label. The mapping of the taxonomy in PhyloPhlAn is from a couple of years ago and the class name “Actinomycetia” was after that. I think you will find all genomes you need by changing the taxonomic label to the old “c__Actinobacteria”.
The taxa2genomes_cpa201901_up201901.txt.bz2 is what PhyloPhlAn uses to download genomes. In your specific case, you might want to try specifying a different label with the -g param. In particular, for the two labels you are not finding, you can now try with s__Patescibacteria_group_bacterium and then with p__Candidatus_Andersenbacteria.