Thanks for using GraPhlAn. One solution to this problem might be to use shortened versions of the taxa names. You could change Lactobacillus to Lactob, Lactococcus to Lactoc, Faecalibacterium to Faecali, etc. You would do this by writing the annotation in your .annot.txt as follows:
#
#
#
Lactobacillus
annotation
Lactob:Lactobacillus
Lactococccus
annotation
Lactoc:Lactococcus
Faecalibacterium
annotation
Faecali:Faecalibacterium
Here, the first column is the name of your node, the second column says that you are annotating this node, and the third column is the label. As you can see, in the third column I’ve written the shorthand before the colon. For example, Lactob occurs before :Lactobacillus, which tells GraPhlAn that you want to use a shorthand annotation for the Lactobacillus node. Importantly, GraPhlAn will also produce a legend that explains what the abbreviations stand for.
I’d recommend doing this wherever there’s an overlap. It will involve some manual inspection of the tree but it should tidy up the labels.
Hope this helps! Let me know if you have any additional questions.
Thank you so much for your help and attention. Now I can shorten the taxa names, and, as a result, I have a much better GraPhlAn output.
I’d appreciate it if you could also let me know if it is possible to include all the taxon names in the output files. For instance, Treponema berlinense doesn’t show up on my output files. I have tried to change the settings of --abundance_threshold and --most_abundant without success.
Thank you, Aaron. I am delighted with my new GraPhlAn figures!