Details
-
Bug
-
Resolution: Unresolved
-
Major
-
Lustre 2.16.0
-
None
-
9223372036854775807
Description
This is a relatively low frequency crash that had several attempt to be fixed in other tickets but seems to still persist. I see in boilpot a few times a year and maloo also saw it at least once (reported in LU-15444):
LustreError: 955:0:(llog_osd.c:624:llog_osd_write_rec()) lustre-MDT0001-osp-MDT0000: index 2790 already set in llog bitmap [0x240000402:0x4:0x0] LustreError: 955:0:(llog_osd.c:626:llog_osd_write_rec()) LBUG Pid: 955, comm: mdt00_011 3.10.0-7.9-debug #2 SMP Tue Feb 1 18:17:58 EST 2022 Call Trace: [<0>] libcfs_call_trace+0x90/0xf0 [libcfs] [<0>] lbug_with_loc+0x4c/0xa0 [libcfs] [<0>] llog_osd_write_rec+0x172c/0x1ba0 [obdclass] [<0>] llog_write_rec+0x290/0x590 [obdclass] [<0>] llog_cat_add_rec+0x201/0xa10 [obdclass] [<0>] llog_add+0x17f/0x1f0 [obdclass] [<0>] sub_updates_write+0x309/0xe55 [ptlrpc] [<0>] top_trans_stop+0x49a/0xfb0 [ptlrpc] [<0>] lod_trans_stop+0x264/0x350 [lod] [<0>] mdd_trans_stop+0x28/0x16e [mdd] [<0>] mdd_attr_set+0x897/0x11e0 [mdd] [<0>] mdt_reint_setattr+0xab3/0x19a0 [mdt] [<0>] mdt_reint_rec+0x87/0x240 [mdt] [<0>] mdt_reint_internal+0x76c/0xba0 [mdt] [<0>] mdt_reint+0x67/0x150 [mdt] [<0>] tgt_request_handle+0x93a/0x19c0 [ptlrpc] [<0>] ptlrpc_server_handle_request+0x251/0xc00 [ptlrpc] [<0>] ptlrpc_main+0xc21/0x15f0 [ptlrpc] [<0>] kthread+0xe4/0xf0 [<0>] ret_from_fork_nospec_begin+0x7/0x21 [<0>] 0xfffffffffffffffe
https://knox.linuxhacker.ru/crashdb_ui_external.py.cgi?newid=68534
https://knox.linuxhacker.ru/crashdb_ui_external.py.cgi?newid=68364
https://knox.linuxhacker.ru/crashdb_ui_external.py.cgi?newid=68010
https://knox.linuxhacker.ru/crashdb_ui_external.py.cgi?newid=67398