Announcing MetaPhlAn 4.1.1 release

aitor.blancomiguez · May 27, 2024, 8:07am

Announcement: We are pleased to share that MetaPhlAn 4.1.1 is now available, which provides fixed and consistent taxonomic identification at high taxonomic ranks, allowing accurate relative abundance estimation also at high taxonomic levels. This is enabled by a new version of the MetaPhlAn database (vJun23_202403) containing the taxonomic updates. It is also possible to fix taxonomic profiles generated with previous versions of the MetaPhlAn database (i.e., vJun23_202307 and earlier) by running the new script fix_relab_mpa4.py.

For details on MetaPhlan 4, check announcing MetaPhlAn 4 or visit the MetaPhlAn 4 GitHub repository.

What is new in MetaPhlAn 4.1.1

New utility script (fix_relab_mpa4.py) to fix the profiles generated with previous versions of the MetaPhlAn database
Bug fix allowing MetaPhlAn to continue its execution even when the option --profile_vsc is set and no viral hits are found
Bug fix in the new implementation (since StrainPhlAn 4.1) of --print_clades_only
Implementation of the option --subsampling_paired [N_PAIRED_READS] to subsample paired-end input reads. This option allows to pass forward and reverse reads separately through arguments -1 and -2, enabling the use of paired-end information for the subsampling procedure (check the usage example). The previously described --subsampling [N_READS] is the choice for running single-end subsampling.
Disclaimer: if paired-end information is not used (i.e., by using --subsampling), all reads in the input data are considered independent. This comes with the caveat that - after subsampling paired-end datasets with deep and shallow samples -, the deep samples will span a higher diversity of reads (because the different ends in the paired-end data will be rarely selected, while this will be more often the case for shallow samples). As a consequence, it is only by using paired-end information (through --subsampling_paired) that MetaPhlAn will be able to effectively correct for varying sequencing depths in paired-end data.
Improved taxonomies for the previous two MetaPhlAn databases (now vJun23_202403 and vOct22_202403, see below)

What has changed in vOct22_202403 in comparison to vOct22_202212

The vOct22_202403 database spans the exact same set of SGBs and marker genes present in the previously announced vOct22_202212 database. However, the new database contains fixed and consistent NCBI-based taxonomic labels. Further, the taxonomy for 2,087 SGBs in vOct22_202212 was reassigned following the identification of a bug in the calculation of centroid-centroid distances in the aforementioned database (this did not affect vJun23 databases).

What has changed in vJun23_202403 in comparison to vJun23_202307

The vJun23_202403 database spans the exact same set of SGBs and marker genes present in the previously announced vJun23_202307 database. However, the new database contains fixed and consistent NCBI-based taxonomic labels.

How to make use of the MetaPhlAn 4.1.1 updates

How to install MetaPhlAn 4.1.1 in a new environment:
- MetaPhlAn 4 · biobakery/MetaPhlAn Wiki · GitHub 97
How to update the MetaPhlAn database from the vJun23_202307 (or earlier) version:
- $ metaphlan --install --force_download

dchas · July 31, 2024, 4:36pm

Thank you for this update!

If I ran HUMAnN using the output of MetaPhlAn with the vJun23_202307 database, will I need to rerun HUMAnN after running fix_relab_mpa4.py to fix the taxonomy? Or, are the levels used by HUMAnN unaffected? Thank you!

ProsperP · August 1, 2024, 7:22am

Hi,

Thank you for the new support in profiling viruses. However, I have noticed an issue where MetaPhlAn fails to display error messages as expected and terminates improperly when the VSG FASTA file is missing, inaccessible, or compressed in the .bz2 format.

To address this bug, I have submitted a pull request on the MetaPhlAn GitHub repository. In my proposed solution, I have ensured that the parentheses of the .format method close correctly. And I have also included support for reading .bz2-compressed VSG FASTA files as input. I believe these changes should resolve the problem. Thanks.

lindaerlina · December 31, 2024, 5:49am

Thank you for the updates,

I have already installed the newest MetaPhlan version 4.1.1, but I have stuck in downloading the index mpa_vJun23_CHOCOPhlAnSGB_202403_bt2.tar always in 57.87 MB for several days.

Here is the details:
metaphlan --install --force_download

Downloading MetaPhlAn database
Please note due to the size this might take a few minutes

\Downloading and uncompressing indexes

Downloading http://cmprod1.cibio.unitn.it/biobakery4/metaphlan_databases/bowtie2_indexes/mpa_vJun23_CHOCOPhlAnSGB_202403_bt2.tar
Downloading file of size: 21924.74 MB
57.87 MB 0.26 % 0.21 MB/sec 1729 min 58 sec

I also tried to download the original files directly from Index of /biobakery4/metaphlan_databases/bowtie2_indexes but it also stuck and failed.

can you help me how to solve this?

thank you so much.

zoey1 · March 30, 2025, 7:11am

It’s a pleasure to meet you. I deeply admire your contribution and update. This is an excellent job. I’m a beginner who has just started getting in touch with this field, and I have some questions that I hope you could answer. What is the difference between vOct22_202403 database and vJun23_202403 database? Which database is the latest？Looking forward to hearing from you.

Topic		Replies	Views
Announcing MetaPhlAn 4.1 release (new virome database and SGB database update) MetaPhlAn	10	2331	May 27, 2024
MetaPhlAn 4 published + database update MetaPhlAn	15	6126	November 22, 2023
MetaPhlAn 4.2.2 release (initial long-read sequencing support and database update) MetaPhlAn	2	882	June 11, 2025
I am not getting viruses abundance in MetaPhlAn4.1 using mpa_vJun23 database MetaPhlAn	1	237	March 5, 2024
Further explanation needed re oct22_fix_tax.tsv MetaPhlAn	1	115	May 27, 2024

Announcing MetaPhlAn 4.1.1 release

What is new in MetaPhlAn 4.1.1

What has changed in vOct22_202403 in comparison to vOct22_202212

What has changed in vJun23_202403 in comparison to vJun23_202307

How to make use of the MetaPhlAn 4.1.1 updates

Related topics