Questions about ibdmdb datasets

sagunmaharjann · April 20, 2021, 3:34pm

I have been working on the development of statistical models for microbiome data analysis. Recently, I am developing a statistical model for omics data analysis with one of my students.

While looking for interesting datasets for our model’s illustration, I found the datasets used for your paper, Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases (nature, 2019). Especially, I am interested in the datasets of metagenomics and metatranscriptomics, which showed high association in the paper. I see merged tables available from https://ibdmdb.org/ for the metagenomics and metatranscriptomics datasets.

I see that your merged tables have relative abundances estimated by MetaPhlAn for the metagenomics data and RPKs for the metatranscriptomics data

I wonder if you have data in estimated or raw counts. I think how to normalize sample’s sequencing depth may affect final inferences. I found from my experience in analyzing 16S sequencing data. Also, our method dose model-based sample normalization, and the normalization prior to analysis is not required. Also, working with counts gives us more flexibility. Can I find count data from the website?

sagunmaharjann · April 20, 2021, 3:49pm

Hi user,
The MetaPhlAn estimates can be “back-calculated” to count-like values based on the sequencing depths, unfortunately, there is no simple count interpretation like there is for 16S since the reference sequences are of different lengths (unlike amplicons).

Also, the raw data itself is available on the following pages:
MTX Raw Files | IBDMDB
Metagenomes Raw Files | IBDMDB

Regards,
Sagun
.

Topic		Replies	Views
How to choose which samples to download? IBDMDB	1	384	September 10, 2021
IBD data analysis IBDMDB	2	680	December 17, 2020
Queries regarding HMP2 project IBDMDB	4	584	January 15, 2021
Trouble downloading data IBDMDB	2	890	October 30, 2021
Request for information -- Dynamics of metatranscription in the inflammatory bowel disease gut microbiome IBDMDB	1	529	January 15, 2020

Questions about ibdmdb datasets

Related topics