Hi HUMAnN2 team,
I’m working on assigning taxonomy to UniRef90 hits and would like to integrate into the HUMAnN2 environment for taxonomic assignments of hits. I have output from a diamond blastx vs UniRef90 run so UniRef90 IDs as well as a pipe character followed by a string, e.g., UniRef90_R5XQM2
and UniRef90_R5XQM2|1416
. I tried joining the UniRef90_R5XQM2
on your uniref90-tol-lca.dat
file but this offers pretty limited taxonomic information where I would expect something deeper. Is there a way to use your humann2_infer_taxonomy on diamond blastx vs UniRef90 runs?
The reason I’m not using the HUMAnN2 pipeline is that I’m working in an environment with a ton of novelty, so had to build my own gene catalogue and am checking annotations across several databases (UniRef, PFAM, KEGG etc.). I understand this is outside the scope of using the HUMAnN2 pipeline but was hoping you can offer some guidance nonetheless. Thank you!