Picrust 2: A subset of reference sequences aligned poorly to the Reference sequences

Abel_Tan · August 26, 2024, 3:26am

Hi,

Im using the version of picrust2 as of May 2024.

I’m trying to validate the accuracy of picrust2 for predicting my own trait data.
I randomly sampled a subset of 16s RRNA sequences from the reference sequence fasta file (picrust2/picrust2/default_files/prokaryotic at master · picrust/picrust2 · GitHub)
/pro_ref/pro_ref.fna.

Then I removed gaps by replacing - with “”.

Then I ran place_seqs on the random subsets.

place_seqs.py -s input_seqs -o output_tree -p 1 --intermediate intermediate_folder

I got the error :
Stopping - all 4201 input sequences aligned poorly to reference sequences (–min_align option specified a minimum proportion of 0.8 aligning to reference sequences).

How can reference sequences be poorly aligned to refererence sequences?

Regards
Abel Tan

nearinj · September 3, 2024, 7:33pm

Hi @Abel_Tan,

I’m not exactly sure what is going on there. I would suggest making a post on the piecrust user group which the main authors actively watch.

https://groups.google.com/g/picrust-users

I think there might also be similar questions on that group around this type of analysis.

Cheers,
Jacob Nearing

Topic	Replies	Views
PICRSt2 Placing reads into reference tree error PICRUSt	258	April 3, 2023
Picrust2 - reference database issue PICRUSt	495	June 22, 2021
Test failing at test_place_seqs.py PICRUSt	775	June 23, 2023
Missing samples in PICRUSt2 output files PICRUSt	570	February 21, 2022
Please direct PICRUSt questions to its external user group (link here) PICRUSt	420	June 30, 2023

Picrust 2: A subset of reference sequences aligned poorly to the Reference sequences

Related topics