I noticed this when my regex to pull out species names broke - there’s at least one species name that always seems to have brackets around the genus name:
k__Bacteria|p__Actinobacteria|c__Coriobacteriia|o__Coriobacteriales|f__Coriobacteriaceae|g__Enorma|s__[Collinsella]_massiliensis
In my dataset, its the only one that seems to be formatted this way. It also seems to be in the wrong genus?
$ conda list
# packages in environment at /home/vklepacc/miniconda3/envs/biobakery3:
#
# Name Version Build Channel
# ...
metaphlan 3.0.0.alpha pyh5ca1d4c_1 bioconda
# ...