Our results, both bowtie2out and profile, of the MetaPhlan4.0 tutorial differ from the sample output files; even the sum of the abundances is not 100. Please help me identify where I did wrong. We use the biobakery/metaphlan:4.0.2 docker image and create a database using the --bowtie2db flag to specify the folder.
metaphlan --install --bowtie2db /mnt/share/metaphlan_bowtie2db/vOct22
Then, I ran metaphlan on the sample SRS014476-Supragingival_plaque.fasta.gz file, which was complete without error.
root@c0f5e1794384:/mnt/share/metaphlan_analysis# metaphlan SRS014476-Supragingival_plaque.fasta.gz --input_type fasta > SRS014476-Supragingival_plaque_profile.txt --bowtie2db /mnt/share/metaphlan_bowtie2db/vOct22
WARNING: The metagenome profile contains clades that represent multiple species merged into a single representant.
An additional column listing the merged species is added to the MetaPhlAn output.
Here are the first 12 lines of the profile result. 19048 reads processed are at least the same, but the abundances differ from the sample.
#mpa_vOct22_CHOCOPhlAnSGB_202212
#/usr/local/bin/metaphlan SRS014476-Supragingival_plaque.fasta.gz --input_type fasta --bowtie2db /mnt/share/metaphlan_bowtie2db/vOct22
#19048 reads processed
#SampleID Metaphlan_Analysis
#clade_name NCBI_tax_id relative_abundance additional_species
k__Bacteria 2 100.0
k__Bacteria|p__Actinobacteria 2|201174 55.36506
k__Bacteria|p__Firmicutes 2|1239 44.63494
k__Bacteria|p__Actinobacteria|c__Actinomycetia 2|201174|1760 55.36506
k__Bacteria|p__Firmicutes|c__Bacilli 2|1239|91061 44.63494
k__Bacteria|p__Firmicutes|c__Bacilli|o__Lactobacillales 2|1239|91061|186826 44.63494
k__Bacteria|p__Actinobacteria|c__Actinomycetia|o__Corynebacteriales|f__Corynebacteriaceae 2|201174|1760|85007|1653 55.36506
Sample result:
#mpa_vJan21_CHOCOPhlAnSGB_202103
#<metaphlan command>
#19048 reads processed
#SampleID Metaphlan_Analysis
#clade_name NCBI_tax_id relative_abundance additional_species
k__Bacteria 2 100.0
k__Bacteria|p__Actinobacteria 2|201174 94.8922
k__Bacteria|p__Proteobacteria 2|1224 5.1078
k__Bacteria|p__Actinobacteria|c__Actinobacteria 2|201174|1760 94.8922
k__Bacteria|p__Proteobacteria|c__Betaproteobacteria 2|1224|28216 5.1078
k__Bacteria|p__Actinobacteria|c__Actinobacteria|o__Corynebacteriales 2|201174|1760|85007 53.56955
k__Bacteria|p__Actinobacteria|c__Actinobacteria|o__Micrococcales 2|201174|1760|85006 40.15913
I attached our bowtie2out and profile result files.
SRS014476-Supragingival_plaque.fasta.gz.bowtie2out.txt (53.6 KB)
SRS014476-Supragingival_plaque_profile.txt (2.9 KB)