ChocoPhlAn/UniRef 201901b vs 201901

zdwallen · August 31, 2021, 6:43pm

Hello,

I’ve recently come across mentions of updated HUMAnN ChocoPhlAn/UniRef databases (201901b vs 201901), but haven’t been able to find any posts describing the update, and what has now changed in each database file. Could you please provide a brief synopsis of the update, and whether or not you would recommend re-running metagenomic samples previously run using the 201901 versions of these databases?

Thank you for your help,
Zach

franzosa · September 1, 2021, 10:38pm

Thanks for the reminder on this. I just posted not-so-brief release notes on HUMAnN 3.0.0 and the 201901b database update here:

Whether or not I would rerun a dataset would depend on the cost of the compute (which ends up being subjective). Personally, if I was working with a few hundred samples I would probably justify the rerun, but not for a few thousand samples. That might depend further on whether or not you had a species of interest among the ~600 new pangenomes added with this update.

Some of the other changes in this update (e.g. MetaCyc 24.0, the new UniRef mappings, changes to infer_taxonomy) could all be (re)computed quickly from gene family abundance profiles generated by the previous HUMAnN 3 – whether or not you want to repeat the gene family quantification would be the tougher question.

zdwallen · September 2, 2021, 2:38pm

Got it, thank you for the info.

With the database updates is the mpa_v30_CHOCOPhlAn_201901_marker_info.txt.bz2 file still the most up to date marker info file, or is there a “b” counterpart to this file as well?

franzosa · September 3, 2021, 9:24pm

No changes to the MetaPhlAn markers for this release. The 600 new pangenomes in ChocoPhlAn 201901b were already quantifiable by MetaPhlAn 3’s 201901 markers but not available as pangenomes due to a synchronization issue between UniProt and GenBank when we built the original batch.

Topic		Replies	Views
Release notes for HUMAnN 3.0.0 and ChocoPhlAn 201901b HUMAnN	5	3093	July 27, 2022
Humann databases need to be updated HUMAnN	1	204	August 23, 2024
Announcing HUMAnN 3.9 HUMAnN	0	1314	February 22, 2024
Cannot run humann v3.7 using the latest Chocophlan database HUMAnN	17	1455	August 2, 2024
'update your version of MetaPhlAn2 to v3.0' error while running HUMAnN 3 in a HPC HUMAnN	2	1184	August 16, 2021

ChocoPhlAn/UniRef 201901b vs 201901

Related topics