Metaphlan2 trial run result

I am running Metaphlan2 (2.96.1) with the mpa_v296_CHOCOPhlAn database. When I analyzed the subsampled data (https://bitbucket.org/biobakery/biobakery/raw/tip/demos/biobakery_demos/data/metaphlan2/input/SRS014476-Supragingival_plaque.fasta.gz) provided in the tutorial with my installed software, I found the result is different from what given in the tutorial. Here I’m attaching my output file.
Please tell me why I am getting discrepancies in my result?

Link to tutorial output file:
https://bitbucket.org/biobakery/biobakery/raw/tip/demos/biobakery_demos/data/metaphlan2/output/SRS014476-Supragingival_plaque_profile.txt
Thank You,

DEEP CHANDA

Hi,
the results showed in the tutorial were computed with a previous version of the MetaPhlAn2 database (v20_m200), while yours are obtained with a more recent and updated one.
The new database includes more species, compared to the previous one, so this is the reason of why more species are now present.

Best

1 Like

Hi fbeghini!!!
Can I use this output profiled by this database (mpa_v296_CHOCOPhlAn_201901) as input for Humann2?
I am using MetaPhlAn version 2.96.1 (02 Feb 2020)

Thanks and Regards,
DC7

Hi, yes, you can use the 2.9 version of HUMAnN https://github.com/biobakery/humann/tree/2.9

1 Like

Hi there,

I am new with Metaphlan. I am trying to follow the tutorial of metaphlan2 at bitbucket, but when I ran the command: metaphlan2.py SRS014476-Supragingival_plaque.fasta.gz --input_type fasta > SRS014476-Supragingival_plaque_profile.txt

I got an error as following:

Downloading https://bitbucket.org/biobakery/metaphlan2/downloads/mpa_latest

Warning: Unable to download https://bitbucket.org/biobakery/metaphlan2/downloads/mpa_latest
Traceback (most recent call last):
File “/usr/local/bin/metaphlan2.py”, line 1442, in
metaphlan2()
File “/usr/local/bin/metaphlan2.py”, line 1164, in metaphlan2
pars[‘index’] = check_and_install_database(pars[‘index’], pars[‘bowtie2db’], pars[‘bowtie2_build’], pars[‘nproc’], pars[‘force_download’])
File “/usr/local/bin/metaphlan2.py”, line 570, in check_and_install_database
index = resolve_latest_database(bowtie2_db, force_redownload_latest)
File “/usr/local/bin/metaphlan2.py”, line 549, in resolve_latest_database
with open(os.path.join(bowtie2_db,‘mpa_latest’)) as mpa_latest:
FileNotFoundError: [Errno 2] No such file or directory: ‘/usr/local/bin/metaphlan_databases/mpa_latest’

Could you please advise me how to solve it?

Thank you very much,

Best,

Toan

Hi Toan,
which version of MetaPhlAn2 have you downloaded? The latest working and updated version is MetaPhlAn2 2.8, you can get it from GitHub here

The version you are running probably is an older one, still pointing to the BitBucket resources

Hi Francesco,

Thanks a lot for your reply!

I installed it with bioconda, so maybe I got an old version. I will try with the the latest one now. Do you have it in bioconda yet (I can’t find it)?

Cheers,

Toan

Hi Francesco,

It is working with the updated version MetaPhlAn2 2.8. I am running it on google colab. How long it usually takes to complete the run of a sample in the tutorial (it is taking quite long for me)? And what is the message when it is done?

Thank you,

Toan

But why I’m seeing MetaPhlAn version 2.96.1 (02 Feb 2020) !!! :roll_eyes: :roll_eyes:

It depends on how many threads have you specified, it can take up to one hour.
Nothing is prompted when the profiling is finished, you’ll have back your shell

You have to specify that you need to install version 2.8 (metaphlan2=2.8)

Hi Francesco,

Thank you for your reply!

I could make it done!

Cheers,

Toan

Hi Francesco,

Sorry for many questions! I am trying to re-generate the heatmap and cladogram from the tutorial of MetaPhlan2, but I could not install hclust2 and GraPhlAn either from biobakery or directly from source.

I also tried with metaphlan_hclust_heatmap.py from the Metaphlan 2.8, but it showed:

image

Could you please explain to me how to solve this?

Thank you very much,

Best,

Toan

metaphlan_hclust_heatmap.py is a older version of hclust2 and was not updated anymore, which problems have you encountered while installing hclust2 and/or graphlan? hclust2 is also available on bioconda, you can install it with conda install -c bioconda hclust2

Sir @fbeghini, can I run HUMAnN 2.9 with python version 3.7.6?

Hi @fbeghini,
I follow the tutorial on https://bitbucket.org/biobakery/biobakery/wiki/metaphlan2, I don’t understand why I could not install hclust2 either via bioconda or biobakery. Even I tried to download it but the link also has error.
Here is the error message when I installed it with bioconda or biobakery:

Collecting package metadata (current_repodata.json): done
Solving environment: failed with initial frozen solve. Retrying with flexible solve.
Solving environment: failed with repodata from current_repodata.json, will retry with next repodata source.
Collecting package metadata (repodata.json): done
Solving environment: failed with initial frozen solve. Retrying with flexible solve.
Solving environment: -
Found conflicts! Looking for incompatible packages.
This can take several minutes. Press CTRL-C to abortfailed

UnsatisfiableError: The following specifications were found
to be incompatible with the existing python installation in your environment:

Specifications:

  • hclust2 -> python[version=‘2.7.*|<3|>=2.7,<2.8.0a0’]

Your python: python=3.7

If python is on the left-most side of the chain, that’s the version you’ve asked for.
When python appears to the right, that indicates that the thing on the left is somehow
not available for the python version you are constrained to. Note that conda will not
change your python version to a different minor version unless you explicitly specify
that.

The following specifications were found to be incompatible with your CUDA driver:

  • feature:/linux-64::__cuda==10.1=0

Your installed CUDA driver is: 10.1

Could you please help me to check it?
Many thanks!
Toan

I am having a problem with hclust2 too. I installed it via Conda and it finishes the installation apparently without problems. I then run the commands as in the Metaphlan tutorial (mentioned above) and nothing happens, when I call hclust2 from the terminal it says that it is not an instance. When I open anaconda and search for it, it is there. I reinstalled again and this time around it says “The condensed distance matrix must contain only finite values” I will do a google search to try to solve the issue. Is this an hclust2 issue, or a data issue? Any pointers would help. I am new to this. Thanks in advance.