Hello I tried changing the values for the flags “–marker_in_n_samples” and “–sample_with_n_maker”. They did not work.
Many of my metagenome samples have no idenfied microbial species. I removed them and only included the ones with identified spiecies and abundance values from the profile.tsv files. The error message still pop up.
I went ahead and selected a subset of the metagenomes samples that have 100% abundance (making sure the markers should be identified) of the species I focus on, and rerun the strainphlan, however the same error message popped up.
Lastly I changed my species specification in strainphlan to a more common microbe and reran strainphlan. It still did not work.
When I look back on my error log, there was a bowtie issue occured during db marker fasta stage yet the error did not stop the bash flow. Would this be the issue? Could any one help address? I created a conda environment for the metaphlan package.
Thu Feb 17 15:16:39 2022: Start samples to markers execution
Thu Feb 17 15:16:39 2022: Decompressing samples…
Thu Feb 17 15:16:41 2022: Done.
Thu Feb 17 15:16:41 2022: Converting samples to BAM format…
Thu Feb 17 15:16:47 2022: Done.
Thu Feb 17 15:16:47 2022: Sorting BAM samples…
Thu Feb 17 15:16:54 2022: Done.
Thu Feb 17 15:16:54 2022: Getting consensus markers from samples…
Thu Feb 17 15:16:54 2022: Processing sample: consensus_markers/tmp/XXX.end1.fq.bam
Thu Feb 17 15:17:03 2022: Done.
Thu Feb 17 15:17:03 2022: Processing sample: consensus_markers/XXX.end1.fq.bam
Thu Feb 17 15:17:16 2022: Done.
Thu Feb 17 15:17:16 2022: Processing sample: consensus_markers/XXX.end2.fq.bam
Thu Feb 17 15:17:26 2022: Done.
Thu Feb 17 15:20:46 2022: Finish samples to markers execution (247.21 seconds): Results are stored at “/XX/Strainphlan_analyses/consensus_markers/”
Thu Feb 17 15:20:46 2022: Start extract markers execution
Thu Feb 17 15:20:46 2022: Generating DB markers FASTA…/miniconda3/envs/XXX/bin/bowtie2-inspect:24: DeprecationWarning: the imp module is deprecated in favour of importlib; see the module’s documentation for alternative uses
import imp
Thu Feb 17 15:21:49 2022: Done.
Thu Feb 17 15:21:49 2022: Loading MetaPhlan 3.0 database…
Thu Feb 17 15:21:58 2022: Done.
Thu Feb 17 15:21:58 2022: Number of markers for the clade “s__XXX”: 133
Thu Feb 17 15:21:58 2022: Exporting markers…
Thu Feb 17 15:22:14 2022: Done.
Thu Feb 17 15:22:14 2022: Finish extract markers execution (87.74 seconds): Results are stored at “/XX/Strainphlan_analyses/db_markers/”
Thu Feb 17 15:22:15 2022: Start StrainPhlAn 3.0 execution
Thu Feb 17 15:22:15 2022: Creating temporary directory…
Thu Feb 17 15:22:15 2022: Done.
Thu Feb 17 15:22:15 2022: Getting markers from main sample files…
Thu Feb 17 15:22:15 2022: Done.
Thu Feb 17 15:22:15 2022: Getting markers from main reference files…
Thu Feb 17 15:22:15 2022: Done.
Thu Feb 17 15:22:15 2022: Removing bad markers / samples…
[e] Phylogeny can not be inferred. Too many samples were discarded
Thu Feb 17 15:22:15 2022: Stop StrainPhlAn 3.0 execution.