DB version:
mpa_vJun23_CHOCOPhlAnSGB_202403
Downloaded from:
http://cmprod1.cibio.unitn.it/biobakery4/metaphlan_databases/mpa_vJun23_CHOCOPhlAnSGB_202403.tar
The taxonomy ID of several species in genus Segatella are the same one 165179 .
But in NCBI taxonomy database, they are different species with various taxon ID. The 165179 is the exact taxon ID of Segatella copri.
import pickle
import bz2
a = pickle.load(bz2.BZ2File("mpa_vJun23_CHOCOPhlAnSGB_202403.pkl", "r"))
a["taxonomy"]["k__Bacteria|p__Bacteroidota|c__Bacteroidia|o__Bacteroidales|f__Prevotellaceae|g__Segatella|s__Sgatella_sinensis|t__SGB1638"]
a["taxonomy"]["k__Bacteria|p__Bacteroidota|c__Bacteroidia|o__Bacteroidales|f__Prevotellaceae|g__Segatella|s__Segatella_copri|t__SGB1626"]
Is this an issue of database, or an intended record ?
Thanks for your excellent tool.