Uniref90 database

N_Hiroshi · April 8, 2022, 8:23am

Hello.
I performed metagenome analysis using humann3 with the full UniRef90 database (20.7GB). I found many UniRef100-IDs in the file analyzed with UniRef90 database. Why are UniRef100-IDs included in Uniref90 database? Do I mistake anywhere?
Thank you.

franzosa · April 8, 2022, 1:21pm

Do you mean 1) that you saw IDs that looked like UniRef100_XYZ or 2) that you saw IDs that looked like UniRef90_XYZ where XYZ is also a UniRef100 member? Note that UniRef90 is constructed by clustering UniRef100, so every UniRef90 representative will also be a UniRef100 representative.

N_Hiroshi · April 11, 2022, 3:46am

Dear Franzosa,

Thank you for your comment.
For example, Uniref90_A0A015URD2 looks like Uniref100-IDs. Uniref90_A0A015URD2 is a member of cluster-UniRef90_C7X9K7. So, should UniRef90_C7X9K7 be used instead of Uniref90_A0A015URD2?

franzosa · June 28, 2022, 8:44pm

Sorry for missing this reply. There is still some confusion though, as UniRef90_X can’t be a member of UniRef90_Y: UniRef90s are non-overlapping clusters. The extension after the “UniRef90_” prefix is just a UniProtKB identifier, and those can vary in form depending on the source proteome from which the corresponding protein derives. For example, “A0A015URD2” and “C7X9K7” both look to me like UniProt (protein) IDs which could (in theory) have been selected as representatives for UniRef90 clusters.

Topic		Replies	Views
Uniref50s in humann2 output when using "--search-mode uniref90" HUMAnN	1	846	November 16, 2019
Different UniRef90 ID has the same nucleotide sequences in ChocoPhlAn database HUMAnN	3	529	August 4, 2020
Eukaryotic Uniref90 Gene Families in Gene Family TSV Files HUMAnN	20	2274	April 2, 2020
UniRef90 to UniRef50 conversion using HUMAnN3.0 HUMAnN	1	217	October 20, 2023
Metaphlan-informed taxonomic stratification and its discrepancies with unirefids HUMAnN	2	61	August 27, 2025

Uniref90 database

Related topics