Hello, HUMAnN developers:
I have a question regarding the further subgrouping of genefamilies files when the family size is too large. Below is the workflow I followed based on the manual—could you please check if the order is correct?
First, perform humann_regroup_table on the genefamilies files obtained for each sample.
Next, apply humann_renorm_table to the regrouped genefamilies files for each sample (UNMAPPED are also included in this normalization process).
Then, use humann_join_tables on all samples that have been regrouped and renormalized.
Finally, perform humann_split_stratified_table on the file obtained from the third step.
Additionally, I’d like to ask whether the “unmapped” and “ungrouped” rows in the final output can be directly deleted, and whether another round of normalization is needed afterward.
Thank you!