Hello, I installed humann on a workstation and installed the nucleotide database using the following command:
humann_databases --download chocophlan full /home/anth445/humann_db/chocophlan
And I have the following configuration settings:
humann --version
humann v3.6.1
metaphlan --version
MetaPhlAn version 4.0.6 (1 Mar 2023)
humann_config --print
HUMAnN Configuration ( Section : Name = Value )
database_folders : nucleotide = /home/anth445/humann_db/chocophlan
database_folders : protein = /home/anth445/humann_db/uniref
database_folders : utility_mapping = /home/anth445/anaconda3/envs/mamba/envs/humann/lib/python3.10/site-packages/humann/data/misc
run_modes : resume = False
run_modes : verbose = False
run_modes : bypass_prescreen = False
run_modes : bypass_nucleotide_index = False
run_modes : bypass_nucleotide_search = False
run_modes : bypass_translated_search = False
run_modes : threads = 1
alignment_settings : evalue_threshold = 1.0
alignment_settings : prescreen_threshold = 0.01
alignment_settings : translated_subject_coverage_threshold = 50.0
alignment_settings : translated_query_coverage_threshold = 90.0
alignment_settings : nucleotide_subject_coverage_threshold = 50.0
alignment_settings : nucleotide_query_coverage_threshold = 90.0
output_format : output_max_decimals = 10
output_format : remove_stratified_output = False
output_format : remove_column_description_output = False
When I try and run the following command:
humann -i dataset/merged_reads/LG_merged.fastq_fixed.gz -o dataset/merged_reads/humann2_output/ --threads 3
Humann fails and and I get the following log:
03/08/2023 02:05:25 PM - humann.search.prescreen - INFO: Running metaphlan ........
03/08/2023 02:05:25 PM - humann.utilities - DEBUG: Using software: /home/anth445/anaconda3/envs/mamba/envs/humann/bin/metaphlan
03/08/2023 02:05:25 PM - humann.utilities - INFO: Execute command: /home/anth445/anaconda3/envs/mamba/envs/humann/bin/metaphlan /home/anth445/dataset/merged_reads/humann2_output/1-E_merged_humann_temp/tmpujun740b/tmpc6tyd1bn -t rel_ab -o /home/anth445/dataset/merged_reads/humann2_output/1-E_merged_humann_temp/1-E_merged_metaphlan_bugs_list.tsv --input_type fastq --bowtie2out /home/anth445/dataset/merged_reads/humann2_output/1-E_merged_humann_temp/1-E_merged_metaphlan_bowtie2.txt --nproc 5
03/08/2023 03:34:03 PM - humann.utilities - DEBUG: b'WARNING: The metagenome profile contains clades that represent multiple species merged into a single representant.\nAn additional column listing the merged species is added to the MetaPhlAn output.\n'
03/08/2023 03:34:03 PM - humann.humann - INFO: TIMESTAMP: Completed prescreen : 5318 seconds
03/08/2023 03:34:03 PM - humann.search.prescreen - DEBUG: Taxon not in mapping file: k__Bacteria|p__Proteobacteria|c__Alphaproteobacteria|o__Sphingomonadales|f__Sphingomonadaceae|g__GGB73774|s__GGB73774_SGB56310|t__SGB56310 2|1224|28211|204457|41297||| 54.82873
03/08/2023 03:34:03 PM - humann.search.prescreen - DEBUG: Taxon not in mapping file: k__Bacteria|p__Proteobacteria|c__CFGB76227|o__OFGB76227|f__FGB76227|g__GGB79622|s__GGB79622_SGB56399|t__SGB56399 2|1224|||||| 1.53298
03/08/2023 03:34:03 PM - humann.search.prescreen - ERROR: The MetaPhlAn taxonomic profile provided was not generated with the database version v3 or vJan21 . Please update your version of MetaPhlAn to at least v3.0 or if you are using MetaPhlAn v4 please use the database vJan21.
How do change the database to be vJan21? I tried to use the command:
metaphlan --install --index mpa_vJan21_CHOCOPhlAnSGB_202103 --bowtie2db <database folder>
which results in these files:
mpa_latest
mpa_vJan21_CHOCOPhlAnSGB_202103.1.bt2l
mpa_vJan21_CHOCOPhlAnSGB_202103.2.bt2l
mpa_vJan21_CHOCOPhlAnSGB_202103.3.bt2l
mpa_vJan21_CHOCOPhlAnSGB_202103.4.bt2l
mpa_vJan21_CHOCOPhlAnSGB_202103.pkl
mpa_vJan21_CHOCOPhlAnSGB_202103.rev.1.bt2l
mpa_vJan21_CHOCOPhlAnSGB_202103.rev.2.bt2l
mpa_vJan21_CHOCOPhlAnSGB_202103_VINFO.csv
mpa_vJan21_CHOCOPhlAnSGB_202103_VSG.fna
mpa_vOct22_CHOCOPhlAnSGB_202212_bt2.md5
mpa_vOct22_CHOCOPhlAnSGB_202212_bt2.tar
But when I ran humann with the metaphlan option:
humann -i dataset/merged_reads/LG_merged.fastq_fixed.gz -o dataset/merged_reads/humann2_output/ --threads 3 --metaphlan-options '--bowtie2db /home/anth445/chocophlan_update'
It again broke, and I got this log file:
03/08/2023 03:41:35 PM - humann.utilities - INFO: Execute command: /home/anth445/anaconda3/envs/mamba/envs/humann/bin/metaphlan /home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/tmpfxks24v5/tmpabt3cvlz --bowtie2db /home/anth445/chocophlan_update -o /home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/LG_merged_metaphlan_bugs_list.tsv --input_type fastq --bowtie2out /home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/LG_merged_metaphlan_bowtie2.txt --nproc 3
03/08/2023 03:41:54 PM - humann.utilities - CRITICAL: Error executing: /home/anth445/anaconda3/envs/mamba/envs/humann/bin/metaphlan /home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/tmpfxks24v5/tmpabt3cvlz --bowtie2db /home/anth445/chocophlan_update -o /home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/LG_merged_metaphlan_bugs_list.tsv --input_type fastq --bowtie2out /home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/LG_merged_metaphlan_bowtie2.txt --nproc 3
Error message returned from metaphlan :
Downloading MetaPhlAn database
Please note due to the size this might take a few minutes
\Downloading and uncompressing indexes
File /home/anth445/chocophlan_update/mpa_vOct22_CHOCOPhlAnSGB_202212_bt2.tar already present!
Downloading http://cmprod1.cibio.unitn.it/biobakery4/metaphlan_databases/bowtie2_indexes/mpa_vOct22_CHOCOPhlAnSGB_202212_bt2.md5
Downloading file of size: 0.00 MB
0.01 MB 11070.27 % 55.26 MB/sec 0 min -0 sec ^MMD5 checksums do not correspond! If this happens again, you should remove the database files and rerun MetaPhlAn so they are re-downloaded
03/08/2023 03:41:54 PM - humann.utilities - CRITICAL: TRACEBACK:
Traceback (most recent call last):
File "/home/anth445/anaconda3/envs/mamba/envs/humann/lib/python3.10/site-packages/humann/utilities.py", line 761, in execute_command
p_out = subprocess.check_output(cmd, stderr=subprocess.STDOUT)
File "/home/anth445/anaconda3/envs/mamba/envs/humann/lib/python3.10/subprocess.py", line 421, in check_output
return run(*popenargs, stdout=PIPE, timeout=timeout, check=True,
File "/home/anth445/anaconda3/envs/mamba/envs/humann/lib/python3.10/subprocess.py", line 526, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['/home/anth445/anaconda3/envs/mamba/envs/humann/bin/metaphlan', '/home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/tmpfxks24v5/tmpabt3cvlz', '--bowtie2db', '/home/anth445/chocophlan_update', '-o', '/home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/LG_merged_metaphlan_bugs_list.tsv', '--input_type', 'fastq', '--bowtie2out', '/home/anth445/dataset/merged_reads/humann2_output/LG_merged_humann_temp/LG_merged_metaphlan_bowtie2.txt', '--nproc', '3']' returned non-zero exit status 1.
Could you please advise what I should do next?
Thank you!