I will explain more first thing first I downloaded the Uniref90 from UniProt as a fasta file and after that converted it to Diamond extension using diamond makedb --in uniref90_201901b_fullt.fasta -d uniref90_201901b_full.dmnd now I have
and I run HUMAnN
humann -i “$input_fastq_dir” -o “$output_dir” --metaphlan “$metaphlan_database” --nucleotide-database “$chocophlan_database” --protein-database “$uniref_database” --search-mode uniref90 -v --metaphlan “$metaphlan_profile” --prescreen-threshold 0.01 --nucleotide-query-coverage-threshold 90.0 --nucleotide-subject-coverage-threshold 50.0 --evalue 1.0 --translated-identity-threshold 50.0 --translated-query-coverage-threshold 90.0 --translated-subject-coverage-threshold 50.0 --minpath on --pathways metacyc --annotation-gene-index 3 --log-level INFO --output-basename demo_m_samples --threads 8
Output files will be written to: /media/mo/New_Volume/HumanN_tot/output
Writing temp files to directory: /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp
File ( /media/mo/New_Volume/HumanN_tot/output/sample_name_humann_temp/tmpbm6io_rr/demo_m_samples.fastq ) is of format: fastq
Running metaphlan …
/home/mo/anaconda3/envs/HUMAnN/bin/metaphlan /media/mo/New_Volume/HumanN_tot/output/sample_name_humann_temp/tmpbm6io_rr/demo_m_samples.fastq -t rel_ab -o /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/demo_m_samples_metaphlan_bugs_list.tsv --input_type fastq --bowtie2out /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/demo_m_samples_metaphlan_bowtie2.txt --nproc 8
TIMESTAMP: Completed prescreen : 1294 seconds
Found t__SGB7967 : 97.10% of mapped reads ( s__Enterococcus_faecium,g__Enterococcus.s__Enterococcus_sp_BSD2780120874b_170522_B6,g__Enterococcus.s__Enterococcus_sp_HMSC035B04,g__Enterococcus.s__Enterococcus_sp_H57,g__Enterococcus.s__Enterococcus_sp_HMSC069A01,g__Enterococcus.s__Enterococcus_sp_HMSC063H10,g__Enterococcus.s__Enterococcus_sp_HMSC035C10,g__Enterococcus.s__Enterococcus_sp_GMD3E,g__Enterococcus.s__Enterococcus_sp_GMD4E,g__Enterococcus.s__Enterococcus_sp_GMD5E,g__Enterococcus.s__Enterococcus_sp_HMSC060D07,g__Enterococcus.s__Enterococcus_sp_GMD2E,g__Enterococcus.s__Enterococcus_sp_HMSC074F07,g__Enterococcus.s__Enterococcus_sp_HMSC34G12,g__Enterococcus.s__Enterococcus_sp_HMSC034B11,g__Enterococcus.s__Enterococcus_sp_GMD1E,g__Enterococcus.s__Enterococcus_sp_HMSC077E07,g__Enterococcus.s__Enterococcus_sp_HMSC070F12,g__Enterococcus.s__Enterococcus_sp_HMSC055G03,g__Enterococcus.s__Enterococcus_sp_HMSC058D07,g__Enterococcus.s__Enterococcus_sp_105332 )
Found t__SGB10115 : 2.24% of mapped reads ( s__Klebsiella_oxytoca,g__Klebsiella.s__Klebsiella_africana,g__Klebsiella.s__Klebsiella_sp_MS_92_3,g__Klebsiella.s__Klebsiella_sp_01_6622,g__Klebsiella.s__Klebsiella_sp_Kps,g__Klebsiella.s__Klebsiella_sp_K1,g__Klebsiella.s__Klebsiella_sp_KGM_IMP216,g__Klebsiella.s__Klebsiella_sp_28,g__Klebsiella.s__Klebsiella_sp_P1927,g__Klebsiella.s__Klebsiella_sp_HMSC16A12,g__Klebsiella.s__Klebsiella_sp_10,g__Klebsiella.s__Klebsiella_aerogenes,g__Klebsiella.s__Klebsiella_sp_4_1_44FAA,g__Klebsiella.s__Klebsiella_sp_AqSCr,g__Klebsiella.s__Klebsiella_sp_01_3681,g__Klebsiella.s__Klebsiella_sp_P1954,g__Klebsiella.s__Klebsiella_sp_K4,g__Klebsiella.s__Klebsiella_sp_KBG1 )
Found t__SGB8002 : 0.64% of mapped reads ( s__Streptococcus_thermophilus )
Total species selected from prescreen: 43
Selected species explain 99.99% of predicted community composition
Creating custom ChocoPhlAn database …
/usr/bin/gunzip -c /media/mo/New_Volume/HumanN_tot/chocophlan/g__Klebsiella.s__Klebsiella_aerogenes.centroids.v201901_v31.ffn.gz /media/mo/New_Volume/HumanN_tot/chocophlan/g__Klebsiella.s__Klebsiella_oxytoca.centroids.v201901_v31.ffn.gz /media/mo/New_Volume/HumanN_tot/chocophlan/g__Klebsiella.s__Klebsiella_pneumoniae.centroids.v201901_v31.ffn.gz /media/mo/New_Volume/HumanN_tot/chocophlan/g__Enterococcus.s__Enterococcus_faecium.centroids.v201901_v31.ffn.gz /media/mo/New_Volume/HumanN_tot/chocophlan/g__Streptococcus.s__Streptococcus_thermophilus.centroids.v201901_v31.ffn.gz
TIMESTAMP: Completed custom database creation : 9 seconds
Running bowtie2-build …
/home/mo/anaconda3/envs/HUMAnN/bin/bowtie2-build -f /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/demo_m_samples_custom_chocophlan_database.ffn /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/demo_m_samples_bowtie2_index
TIMESTAMP: Completed database index : 374 seconds
Running bowtie2 …
/home/mo/anaconda3/envs/HUMAnN/bin/bowtie2 -q -x /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/demo_m_samples_bowtie2_index -U /media/mo/New_Volume/HumanN_tot/output/sample_name_humann_temp/tmpbm6io_rr/demo_m_samples.fastq -S /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/demo_m_samples_bowtie2_aligned.sam -p 8 --very-sensitive
TIMESTAMP: Completed nucleotide alignment : 981 seconds
TIMESTAMP: Completed nucleotide alignment post-processing : 2053 seconds
Total bugs from nucleotide alignment: 5
g__Enterococcus.s__Enterococcus_faecium: 18887240 hits
g__Klebsiella.s__Klebsiella_pneumoniae: 993738 hits
g__Klebsiella.s__Klebsiella_oxytoca: 155895 hits
g__Streptococcus.s__Streptococcus_thermophilus: 97610 hits
g__Klebsiella.s__Klebsiella_aerogenes: 130875 hits
Total gene families from nucleotide alignment: 46095
Unaligned reads after nucleotide alignment: 24.9416395908 %
Running diamond …
Aligning to reference database: uniref90_201901b.fasta.gz.dmnd
/home/mo/anaconda3/envs/HUMAnN/bin/diamond blastx --query /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/demo_m_samples_bowtie2_unaligned.fa --evalue 1.0 --threads 8 --top 1 --outfmt 6 --db /media/mo/New_Volume/HumanN_tot/uniref90/uniref90_201901b.fasta.gz --out /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/tmpel7022_e/diamond_m8_r_p620bo --tmpdir /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/tmpel7022_e
CRITICAL ERROR: Error executing: /home/mo/anaconda3/envs/HUMAnN/bin/diamond blastx --query /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/demo_m_samples_bowtie2_unaligned.fa --evalue 1.0 --threads 8 --top 1 --outfmt 6 --db /media/mo/New_Volume/HumanN_tot/uniref90/uniref90_201901b.fasta.gz --out /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/tmpel7022_e/diamond_m8_r_p620bo --tmpdir /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/tmpel7022_e
Error message returned from diamond :
diamond v2.1.8.162 (C) Max Planck Society for the Advancement of Science, Benjamin Buchfink, University of Tuebingen
Documentation, support and updates available at http://www.diamondsearch.org
Please cite: Sensitive protein alignments at tree-of-life scale using DIAMOND | Nature Methods Nature Methods (2021)
#CPU threads: 8
Scoring parameters: (Matrix=BLOSUM62 Lambda=0.267 K=0.041 Penalties=11/1)
Temporary directory: /media/mo/New_Volume/HumanN_tot/output/demo_m_samples_humann_temp/tmpel7022_e
Percentage range of top alignment score to report hits: 1
Opening the database…
Error opening file /media/mo/New_Volume/HumanN_tot/uniref90/uniref90_201901b.fasta.gz: No such file or directory