there are currently two scripts with similar names. panphlan_pangenome_generation.py and panphlan_new_pangenome_generation.py
The first one comes from previous version where the user should provide fasta file and annotation files to generate the pangenome himself (using the USEARCH software).
The second script assume that you already have everything and only need. For the next step (mapping with panphlan_map.py), one only need to generate the Bowtie2 indexes from fna files.
Anyway, I confess this isn’t the best option anyway. We are currently working on a full pangenome database, providing fasta, annotation and Bowtie2 indexes, that should be available soon.
Thank you. I am quite new to bioinformatics so was struggling a bit. I have the new version of panphlan so can I create my own pangenome with fasta files and annotation files?
creating your own pangenome with fasta and annotation is actually part of the “old code” ( panphlan_pangenome_generation.py) from PanPhlAn 1.2. For that you’ll need the USEARCH software for clustering (https://www.drive5.com/usearch/)
Thank you. I have just realized that I was using the 1.2 version so I should be able to make my own pangenome. Also, the panphlan_erectale16_pangenome.csv export from ChocoPhlan looks different to the panphlan_39491_pangenome.csv in the tutorial. Is there a reason one shows the UniProtID and the other the locus tags instead?