[LU-10573] mdt_destroy_export()) ASSERTION( list_empty(&exp->u.eu_mdt_data.med_open_head) ) failed Created: 28/Jan/18  Updated: 14/Aug/18

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Oleg Drokin Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Duplicate
is duplicated by LU-11232 replay-ost-single test_0b: BUG: unabl... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Just had this crash in recovery-small test 50 fail an assertion in my testing.

This is masteer-next as of today, but I don't see anything in master-next patches that would cause it so might be a generic master bug.

[25862.562574] Lustre: DEBUG MARKER: == recovery-small test 50: failover MDS under load =================================================== 14:20:27 (1517167227)
[25873.636422] Lustre: Failing over lustre-MDT0000
[25873.719531] LustreError: 24395:0:(ldlm_lockd.c:1362:ldlm_handle_enqueue0()) ### lock on destroyed export ffff8800961ee800 ns: mdt-lustre-MDT0000_UUID lock: ffff880323b2cd00/0x763bb02014e26b5c lrc: 3/0,0 mode: CR/CR res: [0x200002b11:0x11f:0x0].0x0 bits 0x8/0x0 rrc: 2 type: IBT flags: 0x50200000000000 nid: 0@lo remote: 0x763bb02014e26b40 expref: 3 pid: 24395 timeout: 0 lvb_type: 0
[25873.733250] LustreError: 12679:0:(client.c:1147:ptlrpc_import_delay_req()) @@@ IMP_CLOSED   req@ffff8802deaccc00 x1590863678811328/t0(0) o6->lustre-OST0001-osc-MDT0000@0@lo:28/4 lens 664/432 e 0 to 0 dl 0 ref 1 fl Rpc:/0/ffffffff rc 0/-1
[25873.769998] Lustre: lustre-MDT0000: Not available for connect from 0@lo (stopping)
[25873.824961] LustreError: 12625:0:(mdt_handler.c:5997:mdt_destroy_export()) ASSERTION( list_empty(&exp->u.eu_mdt_data.med_open_head) ) failed: 
[25873.827562] LustreError: 12625:0:(mdt_handler.c:5997:mdt_destroy_export()) LBUG
[25873.829635] Pid: 12625, comm: obd_zombid
[25873.830589] 
Call Trace:
[25873.837799]  [<ffffffffa02437ce>] libcfs_call_trace+0x4e/0x60 [libcfs]
[25873.838612]  [<ffffffffa024385c>] lbug_with_loc+0x4c/0xb0 [libcfs]
[25873.847602]  [<ffffffffa0cab2dc>] mdt_destroy_export+0x1fc/0x200 [mdt]
[25873.848676]  [<ffffffffa03ad385>] class_export_destroy+0xe5/0x490 [obdclass]
[25873.849634]  [<ffffffffa03b553c>] obd_zombie_impexp_cull+0x35c/0x530 [obdclass]
[25873.851052]  [<ffffffffa03b577d>] obd_zombie_impexp_thread+0x6d/0x1c0 [obdclass]
[25873.852465]  [<ffffffff810b7cc0>] ? default_wake_function+0x0/0x20
[25873.862811]  [<ffffffffa03b5710>] ? obd_zombie_impexp_thread+0x0/0x1c0 [obdclass]
[25873.864393]  [<ffffffff810a2eba>] kthread+0xea/0xf0
[25873.865344]  [<ffffffff810a2dd0>] ? kthread+0x0/0xf0
[25873.866310]  [<ffffffff8170fb98>] ret_from_fork+0x58/0x90
[25873.867276]  [<ffffffff810a2dd0>] ? kthread+0x0/0xf0
[25873.868241] 
[25873.869276] Kernel panic - not syncing: LBUG

Generated at Sat Feb 10 02:36:17 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.