[LU-14952] recovery-small test_149 crash: ASSERTION( !lu_object_is_dying(dt->do_lu.lo_header) ) failed Created: 18/Aug/21  Updated: 01/Sep/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for S Buisson <sbuisson@ddn.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/7e66ae7d-f144-4979-8d3f-826e0614bbd0

test_149 failed with the following error:

trevis-48vm5 crashed during recovery-small test_149

Secondary MDS crashed with the following stack trace:

[ 7711.822050] LustreError: 14020:0:(osd_handler.c:4362:osd_ref_del()) lustre-MDT0001: nlink == 0 on [0x240001b70:0x2:0x0], maybe an upgraded file? (LU-3915)
[ 7711.824585] LustreError: 14020:0:(osd_handler.c:3754:osd_destroy()) ASSERTION( !lu_object_is_dying(dt->do_lu.lo_header) ) failed: 
[ 7711.826678] LustreError: 14020:0:(osd_handler.c:3754:osd_destroy()) LBUG
[ 7711.828012] Pid: 14020, comm: mdt_out00_002 4.18.0-240.22.1.el8_lustre.x86_64 #1 SMP Fri Jul 30 19:47:15 UTC 2021
[ 7711.829959] Call Trace TBD:
[ 7711.830735] [<0>] libcfs_call_trace+0x6f/0x90 [libcfs]
[ 7711.831878] [<0>] lbug_with_loc+0x43/0x80 [libcfs]
[ 7711.833138] [<0>] osd_destroy+0x37c/0x4f0 [osd_ldiskfs]
[ 7711.834618] [<0>] out_obj_destroy+0x92/0x320 [ptlrpc]
[ 7711.835601] [<0>] out_tx_destroy_exec+0x1b/0x190 [ptlrpc]
[ 7711.836636] [<0>] out_tx_end+0x166/0x5c0 [ptlrpc]
[ 7711.837563] [<0>] out_handle+0x1703/0x1f70 [ptlrpc]
[ 7711.838497] [<0>] tgt_request_handle+0xc90/0x1940 [ptlrpc]
[ 7711.839552] [<0>] ptlrpc_server_handle_request+0x323/0xbc0 [ptlrpc]
[ 7711.840732] [<0>] ptlrpc_main+0xba2/0x1490 [ptlrpc]
[ 7711.841669] [<0>] kthread+0x112/0x130
[ 7711.842367] [<0>] ret_from_fork+0x35/0x40
[ 7711.843106] Kernel panic - not syncing: LBUG
[ 7711.843888] CPU: 1 PID: 14020 Comm: mdt_out00_002 Kdump: loaded Tainted: G           OE    --------- -  - 4.18.0-240.22.1.el8_lustre.x86_64 #1
[ 7711.846140] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
[ 7711.847169] Call Trace:
[ 7711.847654]  dump_stack+0x5c/0x80
[ 7711.848281]  panic+0xe7/0x2a9
[ 7711.848830]  ? ret_from_fork+0x35/0x40
[ 7711.849516]  lbug_with_loc.cold.10+0x18/0x18 [libcfs]
[ 7711.850439]  osd_destroy+0x37c/0x4f0 [osd_ldiskfs]
[ 7711.851314]  ? _cond_resched+0x15/0x30
[ 7711.852030]  out_obj_destroy+0x92/0x320 [ptlrpc]
[ 7711.852912]  out_tx_destroy_exec+0x1b/0x190 [ptlrpc]
[ 7711.853852]  out_tx_end+0x166/0x5c0 [ptlrpc]
[ 7711.854670]  out_handle+0x1703/0x1f70 [ptlrpc]
[ 7711.855485]  ? libcfs_log_return+0x1e/0x30 [libcfs]
[ 7711.856368]  ? libcfs_log_return+0x1e/0x30 [libcfs]
[ 7711.857290]  tgt_request_handle+0xc90/0x1940 [ptlrpc]
[ 7711.858246]  ptlrpc_server_handle_request+0x323/0xbc0 [ptlrpc]
[ 7711.859332]  ptlrpc_main+0xba2/0x1490 [ptlrpc]
[ 7711.860141]  ? __schedule+0x2cc/0x700
[ 7711.860850]  ? ptlrpc_wait_event+0x500/0x500 [ptlrpc]
[ 7711.861767]  kthread+0x112/0x130
[ 7711.862359]  ? kthread_flush_work_fn+0x10/0x10
[ 7711.863158]  ret_from_fork+0x35/0x40

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
recovery-small test_149 - trevis-48vm5 crashed during recovery-small test_149



 Comments   
Comment by Patrick Farrell [ 24/Aug/21 ]

Another:
https://testing.whamcloud.com/test_sets/7681e43b-c015-490e-92b8-098136213bb1

Comment by Alex Zhuravlev [ 01/Sep/21 ]

I'm not 100% sure, but tend to think this is another symptom of LU-13195

Generated at Sat Feb 10 03:14:10 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.