Input_data and input_metadata Specifications

Dario · July 15, 2021, 11:00am

How does the software match between data and metadata? Are the two data frames required to be in the same order or does the software match by row or column names? In the documentation,

input_data: The tab-delimited input file of features.
input_metadata: The tab-delimited input file of metadata.

Could more informative descriptions be provided instead with precise specifications of them? I tried providing a data frame with only a subset of samples, so the dimensions of the two data frames are different and it didn’t produce any errors. Did it work correctly? Perhaps there should be a \details{} section in the Rd file explaining about row and column names and expectations.

Kelsey_Thompson · July 15, 2021, 12:15pm

Hi @Dario,

Sorry for any confusion and thanks for the suggestions! Before MaAsLin runs the models it first identifies if the samples are in the rows or columns - by using the intersect command in R between the rownames and colnames of the input data tables. Then matches based on the previous step to filter down to only those samples with matching features and metadata. Then finally it reorders the two resulting data frames to be in the same order. Thus, in answer to your question if you provided a subset of the samples only and it ran error-free, then it did identify which rows/columns your samples were in, subset the two feature tables to match, and then ran without incident. If you want further information about a MaAsLin run you can always check the log file that is produced and it should log how many samples were included in the analysis etc.

We do provide more details about the requirements for the file types in our tutorial and the HTML vignette associated with Bioconductor. Again, apologies for any confusion that arose while running MaAsLin, and thank you for the additional suggestions on where to place this information to avoid confusion.

I hope this helps!
Best,
Kelsey

Topic		Replies	Views
Can't recognize input data in setting up MaAsLin	1	483	June 17, 2022
Potential Bug in `maaslin3` with `input_metadata` Parameter Downstream analysis and statistics	1	31	January 6, 2025
Error running Maaslin2 MaAsLin	8	3626	March 22, 2022
Tutorial help-newbie struggling MaAsLin	9	1294	September 16, 2022
Error running Maaslin2 about input MaAsLin	3	1483	February 2, 2021

Input_data and input_metadata Specifications

Related topics