[LU-14952] recovery-small test_149 crash: ASSERTION( !lu_object_is_dying(dt->do_lu.lo_header) ) failed Created: 18/Aug/21 Updated: 01/Sep/21 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Maloo | Assignee: | WC Triage |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
This issue was created by maloo for S Buisson <sbuisson@ddn.com> This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/7e66ae7d-f144-4979-8d3f-826e0614bbd0 test_149 failed with the following error: trevis-48vm5 crashed during recovery-small test_149 Secondary MDS crashed with the following stack trace: [ 7711.822050] LustreError: 14020:0:(osd_handler.c:4362:osd_ref_del()) lustre-MDT0001: nlink == 0 on [0x240001b70:0x2:0x0], maybe an upgraded file? (LU-3915) [ 7711.824585] LustreError: 14020:0:(osd_handler.c:3754:osd_destroy()) ASSERTION( !lu_object_is_dying(dt->do_lu.lo_header) ) failed: [ 7711.826678] LustreError: 14020:0:(osd_handler.c:3754:osd_destroy()) LBUG [ 7711.828012] Pid: 14020, comm: mdt_out00_002 4.18.0-240.22.1.el8_lustre.x86_64 #1 SMP Fri Jul 30 19:47:15 UTC 2021 [ 7711.829959] Call Trace TBD: [ 7711.830735] [<0>] libcfs_call_trace+0x6f/0x90 [libcfs] [ 7711.831878] [<0>] lbug_with_loc+0x43/0x80 [libcfs] [ 7711.833138] [<0>] osd_destroy+0x37c/0x4f0 [osd_ldiskfs] [ 7711.834618] [<0>] out_obj_destroy+0x92/0x320 [ptlrpc] [ 7711.835601] [<0>] out_tx_destroy_exec+0x1b/0x190 [ptlrpc] [ 7711.836636] [<0>] out_tx_end+0x166/0x5c0 [ptlrpc] [ 7711.837563] [<0>] out_handle+0x1703/0x1f70 [ptlrpc] [ 7711.838497] [<0>] tgt_request_handle+0xc90/0x1940 [ptlrpc] [ 7711.839552] [<0>] ptlrpc_server_handle_request+0x323/0xbc0 [ptlrpc] [ 7711.840732] [<0>] ptlrpc_main+0xba2/0x1490 [ptlrpc] [ 7711.841669] [<0>] kthread+0x112/0x130 [ 7711.842367] [<0>] ret_from_fork+0x35/0x40 [ 7711.843106] Kernel panic - not syncing: LBUG [ 7711.843888] CPU: 1 PID: 14020 Comm: mdt_out00_002 Kdump: loaded Tainted: G OE --------- - - 4.18.0-240.22.1.el8_lustre.x86_64 #1 [ 7711.846140] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 [ 7711.847169] Call Trace: [ 7711.847654] dump_stack+0x5c/0x80 [ 7711.848281] panic+0xe7/0x2a9 [ 7711.848830] ? ret_from_fork+0x35/0x40 [ 7711.849516] lbug_with_loc.cold.10+0x18/0x18 [libcfs] [ 7711.850439] osd_destroy+0x37c/0x4f0 [osd_ldiskfs] [ 7711.851314] ? _cond_resched+0x15/0x30 [ 7711.852030] out_obj_destroy+0x92/0x320 [ptlrpc] [ 7711.852912] out_tx_destroy_exec+0x1b/0x190 [ptlrpc] [ 7711.853852] out_tx_end+0x166/0x5c0 [ptlrpc] [ 7711.854670] out_handle+0x1703/0x1f70 [ptlrpc] [ 7711.855485] ? libcfs_log_return+0x1e/0x30 [libcfs] [ 7711.856368] ? libcfs_log_return+0x1e/0x30 [libcfs] [ 7711.857290] tgt_request_handle+0xc90/0x1940 [ptlrpc] [ 7711.858246] ptlrpc_server_handle_request+0x323/0xbc0 [ptlrpc] [ 7711.859332] ptlrpc_main+0xba2/0x1490 [ptlrpc] [ 7711.860141] ? __schedule+0x2cc/0x700 [ 7711.860850] ? ptlrpc_wait_event+0x500/0x500 [ptlrpc] [ 7711.861767] kthread+0x112/0x130 [ 7711.862359] ? kthread_flush_work_fn+0x10/0x10 [ 7711.863158] ret_from_fork+0x35/0x40 VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV |
| Comments |
| Comment by Patrick Farrell [ 24/Aug/21 ] |
|
Another: |
| Comment by Alex Zhuravlev [ 01/Sep/21 ] |
|
I'm not 100% sure, but tend to think this is another symptom of |