A consistent part of my thesis work consists in the analysis of metagenomics datasets and their taxonomic classification. In particular I’m interested in the study of human gut microbiome differences between healthy and IBD/UC/CD patients and understand them with statistical physics tools. In order to perform such analysis I’m collecting diverse datasets and the dataset used in your article “Gut microbiome structure and metabolic activity in inflammatory bowel disease” is a good candidate for the study. In particular we would like to know if it is possible to access diagnostic metadata in order to distinguish between control/healthy and clinic patients. In the ideal case we would like to obtain a labeling of the samples available online at ncbi [1] extending the metadata csv for the 220 samples, like the following:
RUN SUBJECT_ID CLINIC
SRR6468499 XYZ IBD/CONTROL
SRR6468500 XYT IBD/CONTROL