CLR normalisation ouput

Katb · December 15, 2023, 6:15am

Hello,

I’m trying to run the CLR normalisation option on Maaslin and in the significant results ouput file the ‘N not 0’ column is incorrect. It is saying one of my taxa is only present in 6 samples, but it is present in 209 samples!
Due to the I am concerned about the accuracy of the output, and wondering if I am doing something incorrectly.

I input a table of relative abundance data with the taxa as columns and the samples as rows.

Also please note I haven’t done anything with the zeros in the relative abundance table. I would really appreciate your guidance. Thanks!

for (p in variable_list) {
output_file ← paste0(‘maaslin_allweeks_CLR_confounders’, p)

Maaslin2(‘taxa_maaslin’,
‘meta_out_maaslin’,
output_file, # Use the generated output file name
min_abundance = 0.001,
min_prevalence = 0.1,
transform = ‘NONE’,
normalization = ‘CLR’,
cores = 6,
random_effects = c(‘PID’),
fixed_effects = c(‘Week’,p, ‘Age’, ‘BMI’, ‘Delivery_birth’, ‘Farm’, ‘Gender’),
reference = NULL,
correction = ‘BH’)
}

nearinj · January 2, 2024, 7:38pm

Hi there,

Thanks for looking into this. We corrected this with the most recent update to the GitHub version of Maaslin2 found here: GitHub - biobakery/Maaslin2: MaAsLin2: Microbiome Multivariate Association with Linear Models

However, it hasn’t been pushed to the bioconductor version. Your output should be fine except for that column value.

Sorry for the slow response.

Jacob Nearing

Katb · January 3, 2024, 11:44pm

Thanks Jacob - could you please also confirm for me whether the Maaslin CLR transform option adds a pseudocount to the relative abundance data to eliminate zeros prior to the CLR transformation? Or is that something I need to do prior to uploading the data in to Maaslin?

nearinj · January 4, 2024, 3:23am

Hi there,

Yes it adds a pseudo-count to deal with zeros in the data so no need to do so beforehand.

Cheers,
Jacob

immunochem · January 5, 2024, 7:09pm

Hi, following up on the pseudo-count question, How are the pseudo-counts determined? Say my data contains already very small relative abundances (EX: 6.851430e-08
) is the psueodo-count determined proportionally to this?

nearinj · January 5, 2024, 8:00pm

Hi there,

Maaslin2 will add a value of 1 to the raw input data before computing CLR values.

Cheers,
Jacob Nearing

shibataryohei · January 16, 2025, 4:27pm

Hi Jacob,

Quick question.

Will Maaslin2 do +1 shift even for relative abundance data, with 0-1? Do you think it is appropriate? And, will the shifted values be divided by total sum for CLR since CLR is a transformation for compositional data?

Thanks,
Ryohei

nearinj · January 23, 2025, 7:32pm

Hello @shibataryohei,

Yes when using CLR will have a pseudo count added of 1. Whether or not that is appropriate depends on your data as CLR transformations with pseudo counts can sometimes cause issues when the covariate of interest is associated with read depth.

I’m not sure I understand your second question but the formula used for the normalization can be found here:

Cheers
Jacob

Topic		Replies	Views
CLR normalization and min_abundance in MaAsLin3 MaAsLin	4	120	March 13, 2025
MaAsLin2 CLR transformation differing results MaAsLin	1	79	December 18, 2024
N.not.0 column issues with CLR data MaAsLin	10	535	March 10, 2025
Output All results N.not.0=0 MaAsLin	5	1541	December 1, 2020
Maaslin3 filtering out different features to Maaslin2 Downstream analysis and statistics	2	40	June 2, 2025

CLR normalisation ouput

Related topics