Problem using LEfSe on Galaxy - can't format data for LEfSe

I am new to LEfSe and am trying to do a LEfSe analysis for a microbial dataset I have, and I seem to be having difficulty getting LEfSe to read in my file (can’t seem to attach as a new user) and format it for analysis. I can upload my File into Galaxy, but then I get stuck. Unfortunately, the link to download the example LEfSe input file in the explanation of the steps results in a 404 error.

Once I hit “Execute” on the dialog box to Format Data for LEfSe, I get this error (I’ve tried various options in all places in the drop-down boxes to no avail):
Traceback (most recent call last): File “/shed_tools/testtoolshed.g2.bx.psu.edu/repos/george-weingart/lefse/a6284ef17bf3/lefse/format_input.py”, line 435, in feats = numerical_values(feats,params[‘norm_v’]) File "/shed_tools/testtoolshed

I’ve tried to google the error, and all that comes up are unresolved questions in various forums - one answer suggests that the problem arises from special characters being in the metadata file, but my file does not have special characters.

I’d really appreciate some guidance!

Hope you are all safe and well,
Alissa

[Here’s the input file I can’t seem to upload]

sample AHC01 AHC02 AHC03 AHC04 AHC05 AHC06 AHC07 AHC08 AHC09 AHC10 AHC11 AHC12 AHC14 AHC15 AHC16 AHC17 AHC18 AHC19 AHC20 AHC21 AHC22 AHC23 AHC25 AHC26 AHC28 AHC29 AHC30 AHC32 AHC33 AHC34 AHC35 AHC38 AHC39 AHC40 AHC41 AHC43
Sampling_ID C3MB C2MB C3UB C1UB C1MB C2UB C1LB C2LB R2MB R1UB C3UR C1MR C2UR R2LB C3LB R3UB R3MB E2MB E1UB R2UB E3MB E1MB E3UB E3LB R3LB R1MB R2UR R2LR E2UR E3UR R1UR R2MR R3UR R3LR E3MR E2MR
Depth Middle Middle Upper Upper Middle Upper Lower Lower Middle Upper Upper Middle Upper Lower Lower Upper Middle Middle Upper Upper Middle Middle Upper Lower Lower Middle Upper Lower Upper Upper Upper Middle Upper Lower Middle Middle
Sample_Type Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Rhizosphere Rhizosphere Rhizosphere Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Bulk_Soil Rhizosphere Rhizosphere Rhizosphere Rhizosphere Rhizosphere Rhizosphere Rhizosphere Rhizosphere Rhizosphere Rhizosphere
ade5092696cca194ca857a05b9718444 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
c14367bb06b926b2b5e883a558d6fca5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0
f19396208c8df37cf8e4d3a7fc773539 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0
e9822fcf48a8da07d328eac1ed4926db 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0
c210205a36e63d886994863fa4dbb078 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
1dea6fdf50cbddb2bdff2a458ee0b7e0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
30e86531040d5278e5207a681bd0a183 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0
e97e6c980420343168e2c86d4828a017 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
623a266e1a1dc3c3ec31e23824055614 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0
f3d79f4a473bf2321a93a303f6573021 0 0 1 4 10 1 0 0 8 0 0 0 3 0 0 2 0 3 0 0 0 7 0 0 0 0 0 0 0 4 0 7 3 0 0 0
62d5e0685613aa05ac96bf4dc1a05c31 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
ccfa885a96eb4063d6f59f7ba54617e0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
d463e4afe73b4b1769744b0c7f4097ce 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
98ea864bea57fb9533d14c7f75132b41 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 0
b0220866345c7236e4b4b2f972ba01d9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
e9eac09b62aef6bae57cd5dcfa0778cf 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
f92cda9389d47aeac8ee7bad79bb196f 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
f5772908ea5d0d94f854323569a3af38 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
3e8a6c29ec09db0843c48f594bfede14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
5ce477ce6618824b82a56f4d0572aa3c 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
d90d29d0baf82626fa3dd0918908076e 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
16a965795186d8fb234e85a2e74319a3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
6ad6e9eccf8a15fc5760ed9c23f58e02 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
a681c67fa4e7ae55a6e3b4e7ad87eb1c 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
21c6b311f462cccd040143cd165bb105 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
744ea5bab523a943666bfbb5030d07f1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0
54897405d8f33e1d286571d5b17aca0f 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0
a3cc7badc7e034c064771b2658b28e11 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0
c77905290ec657dbfc999bbf01b67998 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0
130a8e93111f6c3e68eab9336f413aae 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0
b231847777963cf594f96e68074ee665 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
cf7bf3e6639ce1a089d6e88b6f078425 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
b854bd37fc93ebd995c4b7982f4125f8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0
518d39787fcd51238a38aa4f855525e4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
ea5954ba41090ec0903924233f39da0e 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0
eab57a0b62d42cafe5cbb4c942d26f3d 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0
f55e3255ea3a3a8af9f0c5930869793c 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
e9bdad9e4fd65a1d38da5e3a53c4ec47 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
f521e9e92a9ee18e93450d9f44199258 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
8e1a6d6dfc0dfdf5735c1778ee9b5a8d 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
2ee095bdccfe5937a803e42b8935128d 6 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
7f07e7a9ec60dcd67ae0e584afb0c08c 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
4cbdaa6ac2018a23607eb6750e83a211 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
7d610f474f5628248ed696156659d26c 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
88505ce8aec0c275af1d5f4d512dbcce 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
6a17e06eaf33f50c4105aad6050e406e 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
f12c8a47d26cc214d54aff4a3a7de6a8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0
4aac54b5d7cb6b0a042ce7b125ea7607 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
d6f684b679435943564d15a9efad4ffd 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
6b96197d4f392a85d7b1eac794293b5e 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0b47ed8713c8a26dbfd1b640d4509eb3 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
4b5279fb6bf7cabc2aa70d395c30941e 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
574f869264a352ff7d010ec73fc31cdc 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
1612eadf68ce66aaa987c72ca5f48ebc 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
90515a9a8aae1c64e4f2556e8c5bc36a 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0
2830db2ae378f6da4fba8a0b67c9dcfa 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0
21a3e1af75c6d6a016da1f35d1156c5b 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0
15cc364a4f35f40ed3f0d42120d57fce 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
c9ede7953abf1e7713cc273a7c63665a 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0
3d1cab1999c15c684b39b014df8d2acc 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
0abf1f5b40b2b7b40806b1c9c842412a 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 2 0 0 0 0 0 0 0 0 0 0 0
6028469df1d48689ad6738cddeb7f8e9 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
3d533a8868f59b7328f7f75c10e8d007 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6
db359588bc9c1d055b5fe46d1fec9cde 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 6 0 0 0 0 0 0 0
52da1e59be9161f8f4a363ac02952ed5 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 1 2 0 0
fa11c2622318b6d44ef1ced3b37092aa 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
779dc92d5387a7010e39ff71e85850df 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
cdc7438efb1bb84fc0b687ec71b82744 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
a5c0142aeab6dd951f8d29efc681e3d5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0
926e6ef9402efb84a4e08181b795973f 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0
22771c5187db6b5f3be176816b9fbb85 0 0 3 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
181707d3e1ff6fb3c70a829d1bc312fd 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0
7272cd542de89934539d5674f4c3a73e 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0
58a0b4560cbea0bc7c51632bd9cd913d 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 3
4d4e23eca3860def53570fb648c40b98 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
75381669b9559310551133e8c7bbca49 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 5 0 3 0 2 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
7ca9a3300909b4b6bf44731b50e139a2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0
19f12bfbbc0c5a158b9b42a71958b64d 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
06f73e3517d3594c8d9df93c640047ab 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1
5db7603cdabd934dbf9ab6127a11f952 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
540c3f98daab58191f8797aab1059365 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
8013b5108922e2c22a22179d3c7b843f 0 0 0 0 0 4 0 0 0 0 0 0 0 9 0 1 0 0 0 0 0 1 0 6 0 0 0 8 0 0 0 0 0 0 6 0
56c626be7f67051bb3003b114b3c7866 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
9ae0c41b0f89bf4f0b118e2658c6ea80 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0

Hi,

Thanks for providing the input! Standard input for LEfSe shouldn’t have sample ID rows (example input) and I think that might be what’s causing the issue. Taking the first two rows out makes formatting run successfully.

Thanks!
Siyuan

Hi Siyuan,

I have the same problem of Alibba,
I got this error report

"Traceback (most recent call last):
File “/shed_tools/testtoolshed.g2.bx.psu.edu/repos/george-weingart/lefse/a6284ef17bf3/lefse/format_input.py”, line 435, in
feats = numerical_values(feats,params[‘norm_v’])
File “/shed_tools/testtoolshed”

I eliminated all the special characters after reviewed your example.
Lefse_VOJ|690x430
Lefse_VOJ_2|690x429

Thanks in advance,
Vanessa

Hi -
It’s not clear to me what’s causing the issue. I did notice repeated features (multiple “Bacteria” rows), which I think LEfSe does not expect and could be problematic? If you’d like, you can email an example input to siyuanma@g.harvard.edu.

1 Like

Greetings,

i am using the example input file but constantly having this error.
Traceback (most recent call last):
File “/usr/bin/lefse_format_input.py”, line 10, in
from importlib.metadata import distribution
ModuleNotFoundError: No module named ‘importlib.metadata’

Hi @Mahrukh_Butt,

I believe the error that you are getting is due to our transition to python3 on Galaxy. LEfSe is a tool not under current development and thus we needed to revisit the code base to get it to work with python3. We will let everyone know when we have updated this in Galaxy.

In the meantime, you can use MaAsLin in R to complete a complementary task.

Best,
Kelsey

Greetings,

Thank you so much for replying. Can you tell me an estimate of how long it will take?

Best Regards