🧬 Join us in Boston for the 2026 bioBakery Workshop
(Cambridge, MA · July 20–24) —
Learn more & apply
🔬 We’re also delighted to welcome you to the
8th Annual HCMPH Symposium
(May 18, 2026) —
Translating the Microbiome: Turning Discovery into Implementation
View symposium details
Hi community!!! @sma@franzosa@sagunmaharjann
I am using MetaPhlAn output data as input for lefse in Galaxy. But, I am seeing that a single output graph shows various taxa level within it. This is my output. But I want only a single taxa level (e.g. only species) to be shown in a single plot. How can I get that? Am I doing anything wrong?
LEfSe automatically goes up the taxonomy to test for features at all levels. Given your input is MetaPhlAn (so should have features at all taxonomy levels to begin with?), here is what I’d suggest:
First, filter your MetaPhlAn table so that it only contains species level features.
“Trick” LEfSe to not test features at all taxonomy levels. You can achieve this by manipulating the species names. You can either remove all the other taxa levels preceding the species name, e.g, “k__Bacteria|p__Actinobacteria|c__Actinobacteria|o__Actinomycetales|f__Corynebacteriaceae|g__Corynebacterium|s__Corynebacterium_matruchotii” -> “s__Corynebacterium_matruchotii”. Alternatively, replace the taxon separator “|” with something LEfSe won’t recognize, such as “;”, e.g., -> “k__Bacteria;p__Actinobacteria;c__Actinobacteria;o__Actinomycetales;f__Corynebacteriaceae;g__Corynebacterium;s__Corynebacterium_matruchotii”. The latter might be simpler to do, but gives you less pretty feature names (the full name will appear instead of just s__Corynebacterium_matruchotii).
Let me know if you’d need help on any of this.
Best,
Siyuan
Hi @sma I have already tested the first option. It did not work. I have seen that the first step (“format data for lefse”) was successfully completed. But, the second step (“LDA Effect size”) was failed.
Thanks,
dc
Hi @Kelsey_Thompson !!! This is the command I have used to generate species level merged abundance file:
grep -E “(s__)|(^clade_name)” merged_abundance_table.txt | grep -v “t__” | sed ‘s/^.*s__//g’ > merge_abundance_species.txt
here is a reproducible part from my data:
|type|highbp|control|control|
|---|---|---|---|
|clade_name|ERR185652_profile|ERR298068_profile|ERR321205_profile|
|Actinobaculum_sp_oral_taxon_183|0|0|0|
|Actinomyces_graevenitzii|0|0|0|
|Actinomyces_naeslundii|0|0|0|
|Actinomyces_odontolyticus|0|0|0.00341|
|Actinomyces_oris|0|0|0|
|Actinomyces_sp_HMSC035G02|0|0|0|
|Actinomyces_sp_HPA0247|0|0|0|
|Actinomyces_sp_ICM47|0|0|0.0042|
|Actinomyces_sp_S6_Spd3|0|0|0|
|Actinomyces_sp_oral_taxon_181|0|0|0|
|Actinomyces_sp_oral_taxon_414|0|0|0|
|Actinomyces_turicensis|0|0|0|
|Varibaculum_cambriense|0|0|0|
|Aeriscardovia_aeriphila|0.00454|0|0|
|Alloscardovia_omnicolens|0|0|0|
|Bifidobacterium_adolescentis|0.26989|0.21046|0|
|Bifidobacterium_animalis|0|0.03512|0|
|Bifidobacterium_bifidum|0|0.07668|0|
|Bifidobacterium_breve|0|0|0|
|Bifidobacterium_catenulatum|0|0.06641|0|
|Bifidobacterium_dentium|0|0|0|
|Bifidobacterium_longum|2.46071|1.68152|0|
|Bifidobacterium_pseudocatenulatum|0|0|0|
|Bifidobacterium_pullorum|0|0|0|
|Gardnerella_vaginalis|0|0|0|
Would you mind sharing your input table with us? It’s a bit difficult to see what the issue might be without the file. You can reach me at siyuanma@g.harvard.edu to share the file.