Hi,
I’m running HUMAnN v3.9 using the mpa_vJun23_CHOCOPhlAnSGB_202403 database, and I have generated output files such as:
Gene families file (genefamilies.tsv)
Reactions file (reactions.tsv)
Pathway abundance file (pathabundance.tsv)
I understand that HUMAnN relies on MetaPhlAn’s taxonomic profiling, and the taxonomy used in the stratified outputs is based on SGB (Species-level Genome Bins).
I’ve already used the sgb_to_gtdb_profile.py script (provided with MetaPhlAn 4) to convert MetaPhlAn taxonomic profiles to GTDB format.
Now, I would like to ask:
Can the same sgb_to_gtdb_profile.py script or a similar approach be applied to convert the taxonomy in the HUMAnN output files to GTDB taxonomy?
If not directly, is there a recommended workflow or mapping file that can help translate the g__Genus|s__Species labels in the HUMAnN stratified outputs to GTDB taxonomy?
Any suggestions or tools would be greatly appreciated!
Thank you.
Boram