I’ve searched in the NCBI extensively and I don’t understand where these labels were taken and what they mean (for example: s__GGB3008_SGB3999). I’m having trouble justifying them in my research.
Hi @Anderson
The taxonomies you are showing here above correspond to several unknown SGBs (uSGBs), i.e. SGBs defined purely by metagenomic-assembled genomes (MAGs). As the uSGBs by definition do not contain any reference genome in NBCI, some part of their taxonomy are totally unknown and thus, we assign them a numeric identifier. The 6 cases you are showing below are uSGBs that are unknown up to the phylum level, this is, we did not find any reference genome that shared, at least, 70% identity with them, and thus, we are only confident to assign a phylogeny up to the phylum.
If you are interested in more details about the SGBs and the MetaPhlAn 4 database, please, have a look at the following works: https://doi.org/10.1016/j.cell.2019.01.001
Hello,
Thanks for the answer that explains my case also :).
But how can I rely on the genomes that have been used ? I would like to take a look at the genome where the genes come from because i have interesting results.