Details
-
Bug
-
Resolution: Fixed
-
Major
-
None
-
Lustre 2.4.1
-
None
-
client: slessp2 2.4.1-3nas
-
3
-
12522
Description
3891131.052019] LustreError: 27617:0:(cl_lock.c:1964:discard_cb()) ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageWriteback(cl_page_vmpage(env, page)))) ) failed: ^M
[3891131.067131] LustreError: 27617:0:(cl_lock.c:1964:discard_cb()) LBUG^M
[3891131.073568] Pid: 27617, comm: geos.xco2.osse.^M
PID: 27617 TASK: ffff8809be7a04c0 CPU: 21 COMMAND: "geos.xco2.osse."
#0 [ffff88103321f620] panic at ffffffff81450800
#1 [ffff88103321f6a0] lbug_with_loc at ffffffffa04c2dc3 [libcfs]
#2 [ffff88103321f6c0] discard_cb at ffffffffa0633774 [obdclass]
#3 [ffff88103321f6f0] cl_page_gang_lookup at ffffffffa06309d6 [obdclass]
#4 [ffff88103321f780] cl_lock_discard_pages at ffffffffa063346a [obdclass]
#5 [ffff88103321f7c0] osc_lock_flush at ffffffffa08fe272 [osc]
#6 [ffff88103321f810] osc_lock_cancel at ffffffffa08fe4d9 [osc]
#7 [ffff88103321f850] cl_lock_cancel0 at ffffffffa0631155 [obdclass]
#8 [ffff88103321f870] cl_lock_cancel at ffffffffa0631eab [obdclass]
#9 [ffff88103321f890] osc_lock_blocking at ffffffffa08ff01d [osc]
#10 [ffff88103321f8c0] osc_dlm_blocking_ast0 at ffffffffa08ff8c9 [osc]
#11 [ffff88103321f900] osc_ldlm_blocking_ast at ffffffffa08ffa2c [osc]
#12 [ffff88103321f940] ldlm_cancel_callback at ffffffffa0742e0f [ptlrpc]
#13 [ffff88103321f950] ldlm_cli_cancel_local at ffffffffa075121f [ptlrpc]
#14 [ffff88103321f970] ldlm_cli_cancel_list_local at ffffffffa07545b2 [ptlrpc]
#15 [ffff88103321f9d0] ldlm_prep_elc_req at ffffffffa07563bf [ptlrpc]
#16 [ffff88103321fa40] ldlm_prep_enqueue_req at ffffffffa075648f [ptlrpc]
#17 [ffff88103321fa50] osc_enqueue_base at ffffffffa08e525f [osc]
#18 [ffff88103321faf0] osc_lock_enqueue at ffffffffa08ff440 [osc]
#19 [ffff88103321fb60] cl_enqueue_kick at ffffffffa0632652 [obdclass]
#20 [ffff88103321fb90] cl_enqueue_try at ffffffffa0635961 [obdclass]
#21 [ffff88103321fbc0] lov_lock_enqueue_one at ffffffffa0995d8d [lov]
#22 [ffff88103321fbf0] lov_lock_enqueue at ffffffffa09984bb [lov]
#23 [ffff88103321fc60] cl_enqueue_kick at ffffffffa0632652 [obdclass]
#24 [ffff88103321fc90] cl_enqueue_try at ffffffffa0635961 [obdclass]
#25 [ffff88103321fcc0] cl_enqueue_locked at ffffffffa0636717 [obdclass]
#26 [ffff88103321fcf0] cl_lock_request at ffffffffa06373e9 [obdclass]
#27 [ffff88103321fd40] cl_glimpse_lock at ffffffffa0a6ae5d [lustre]
#28 [ffff88103321fda0] cl_glimpse_size0 at ffffffffa0a6b31f [lustre]
#29 [ffff88103321fdf0] ll_glimpse_size at ffffffffa0a19695 [lustre]
#30 [ffff88103321fe10] ll_inode_revalidate_it at ffffffffa0a1eb68 [lustre]
#31 [ffff88103321fe40] ll_getattr_it at ffffffffa0a1ebae [lustre]
#32 [ffff88103321fe70] ll_getattr at ffffffffa0a1ecff [lustre]
#33 [ffff88103321fed0] vfs_fstat at ffffffff81155d27
#34 [ffff88103321fef0] sys_newfstat at ffffffff81155d6f
#35 [ffff88103321ff80] system_call_fastpath at ffffffff8145b412
RIP: 00002aaaacfe9394 RSP: 00007fffffff5a20 RFLAGS: 00000202
RAX: 0000000000000005 RBX: ffffffff8145b412 RCX: 0000000000000001
RDX: 00007fffffdafbf8 RSI: 00007fffffdafbf8 RDI: 0000000000000009
RBP: 00007fffffdb1cc0 R8: 0000000000000000 R9: 000000000000000a
R10: 00007fffffdac960 R11: 0000000000000246 R12: 0000000000000001
R13: 00007fffffdb1dc0 R14: 0000000000000000 R15: 00007fffffdb1cc0
ORIG_RAX: 0000000000000005 CS: 0033 SS: 002b
Attachments
Issue Links
Activity
Link | New: This issue is related to LDEV-134 [ LDEV-134 ] |
Resolution | New: Fixed [ 1 ] | |
Status | Original: Reopened [ 4 ] | New: Resolved [ 5 ] |
Comment | [ lustre log dump ] |
Comment |
[ We appear to be hitting this issue on Hyperion with 2.7 FE {code} Jul 30 16:33:37 iwc151 kernel: LustreError: 22705:0:(ldlm_resource.c:1418:ldlm_resource_dump()) Granted locks (in reverse order): Jul 30 16:33:37 iwc151 kernel: LustreError: 22705:0:(ldlm_resource.c:1421:ldlm_resource_dump()) ### ### ns: lustre-OST0009-osc-ffff8808726fb400 lock: ffff8808537b6b00/0x58cbc34f0b2d4dcb lrc: 2/0,0 mode: PW/PW res: [0xd4c6b8:0x0:0x0].0 rrc: 3 type: EXT [0->18446744073709551615] (req 0->8191) flags: 0x126400020000 nid: local remote: 0xdb6a7d473dfbfe9b expref: -99 pid: 22607 timeout: 0 lvb_type: 1 Jul 30 16:33:37 iwc151 kernel: Lustre: lustre-OST0009-osc-ffff8808726fb400: Connection restored to lustre-OST0009 (at 192.168.120.22@o2ib) Jul 30 16:33:37 iwc151 kernel: LustreError: 91227:0:(osc_cache.c:3115:discard_cb()) ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageDirty(cl_page_vmpage(page)))) ) failed: Jul 30 16:33:37 iwc151 kernel: LustreError: 91227:0:(osc_cache.c:3115:discard_cb()) ASSERTION( (!(page->cp_type == CPT_CACHEABLE) || (!PageDirty(cl_page_vmpage(page)))) ) failed: Jul 30 16:33:37 iwc151 kernel: LustreError: 91227:0:(osc_cache.c:3115:discard_cb()) LBUG Jul 30 16:33:37 iwc151 kernel: LustreError: 91227:0:(osc_cache.c:3115:discard_cb()) LBUG Jul 30 16:33:37 iwc151 kernel: Pid: 91227, comm: ldlm_bl_79 Jul 30 16:33:37 iwc151 kernel: Jul 30 16:33:37 iwc151 kernel: Call Trace: Jul 30 16:33:37 iwc151 kernel: [<ffffffffa045e895>] libcfs_debug_dumpstack+0x55/0x80 [libcfs] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa045ee97>] lbug_with_loc+0x47/0xb0 [libcfs] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dccaa6>] discard_cb+0x156/0x190 [osc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dcce1c>] osc_page_gang_lookup+0x1ac/0x330 [osc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dcc950>] ? discard_cb+0x0/0x190 [osc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dcd0e4>] osc_lock_discard_pages+0x144/0x240 [osc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dcc950>] ? discard_cb+0x0/0x190 [osc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dc37cb>] osc_lock_flush+0x8b/0x260 [osc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dc3c48>] osc_ldlm_blocking_ast+0x2a8/0x3c0 [osc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c2a04c>] ldlm_cancel_callback+0x6c/0x1a0 [ptlrpc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c395ba>] ldlm_cli_cancel_local+0x8a/0x470 [ptlrpc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c3e040>] ldlm_cli_cancel+0x60/0x360 [ptlrpc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0dc3a7b>] osc_ldlm_blocking_ast+0xdb/0x3c0 [osc] Jul 30 16:33:37 iwc151 kernel: [<ffffffff8152cfae>] ? _spin_lock+0x1e/0x30 Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c417a0>] ldlm_handle_bl_callback+0x130/0x400 [ptlrpc] Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c41ce1>] ldlm_bl_thread_main+0x271/0x3f0 [ptlrpc] Jul 30 16:33:37 iwc151 kernel: [<ffffffff81064c00>] ? default_wake_function+0x0/0x20 Jul 30 16:33:37 iwc151 kernel: [<ffffffffa0c41a70>] ? ldlm_bl_thread_main+0x0/0x3f0 [ptlrpc] Jul 30 16:33:37 iwc151 kernel: [<ffffffff8109e78e>] kthread+0x9e/0xc0 Jul 30 16:33:37 iwc151 kernel: [<ffffffff8100c28a>] child_rip+0xa/0x20 Jul 30 16:33:37 iwc151 kernel: [<ffffffff8109e6f0>] ? kthread+0x0/0xc0 Jul 30 16:33:37 iwc151 kernel: [<ffffffff8100c280>] ? child_rip+0x0/0x20 Jul 30 16:33:37 iwc151 kernel: {code} ] |
Attachment | New: iwc151.lbug.txt [ 18536 ] |
Resolution | Original: Fixed [ 1 ] | |
Status | Original: Resolved [ 5 ] | New: Reopened [ 4 ] |
Resolution | New: Fixed [ 1 ] | |
Status | Original: Open [ 1 ] | New: Resolved [ 5 ] |
Labels | Original: p4d |
Looks like the patch we supplied has fixed this problem, as far as we can tell.
~ jfc.