I am interested in visualizing HUMANn2 output in WHAM! (https://ruggleslab.shinyapps.io/wham_v1/). WHAM! requires a certain type of input data. The sample file contains three columns called “Acc,” “Feature,” and “Taxa” and lists the gene content as read counts.
By contrast, the HUMANn2 output has one column that combines Acc, Feature, and Taxa and gives gene content as RPK.
I am interested in converting my HUMANn2 output file into a format that can be analyzed in WHAM! or other programs that use read counts as input. My understanding is that I would need to multiply each number by the kilobases in each gene (RPK = reads per kilobase). Is there a simple way to find out the kilobases in each gene so that this conversion can be done?
Also, I can probably write an R script to split up the one column from HUMANn2 output into three columns, but is there any easier way to accomplish this?