I really like what your package has to offer but I’m having a difficult time understanding my output. I expect to get one dataset but it seems like I’m getting three distinct count datasets and metadata within SyntheticMicrobiome-Count.pcl.
Here are my questions:
When I specify number_metadata = 2, I get 8 rows of metadata. Why is there double the number of continuous metadata? I see that it says that there will be in the function but I don’t know why.
The second chunk is the ‘null community.’ What do you mean by this? If I don’t want any outliers or correlations, do I just grab the ‘null community’ and ignore everything else?
What is the outlier chunk? What does this mean? Outlier Swap: Feature_Outlier_137 Sample: 45
If I’m looking for correlations with my metadata, do I just grab the “feature spiked” chunk? Or do I need the null community AND the feature spiked chunk?
Also, is there a minumum number of samples? Because if I try to run 10 samples with 50 microbes, I get an error.
Also, is there a paper on sparseDOSSA that’s I’ve missed? I can’t find it.