[LU-2651] lov_object.c:595:lov_layout_change()) ASSERTION( list_empty(&hdr->coh_locks) ) failed Created: 18/Jan/13  Updated: 22/Jan/13  Resolved: 22/Jan/13

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.0
Fix Version/s: None

Type: Bug Priority: Blocker
Reporter: Oleg Drokin Assignee: Jinshan Xiong (Inactive)
Resolution: Duplicate Votes: 0
Labels: HB

Severity: 3
Rank (Obsolete): 6193

 Description   

Juts run racer on freshly tagged 2.3.59 and it died very fat with this assertion.

[  756.853670] LustreError: 25456:0:(lov_object.c:595:lov_layout_change()) ASSERTION( list_empty(&hdr->coh_locks) ) failed: 
[  756.854210] LustreError: 25456:0:(lov_object.c:595:lov_layout_change()) LBUG
[  756.854498] Pid: 25456, comm: ll_sa_25403
[  756.854724] 
[  756.854725] Call Trace:
[  756.855090]  [<ffffffffa0449915>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
[  756.855355]  [<ffffffffa0449f17>] lbug_with_loc+0x47/0xb0 [libcfs]
[  756.855617]  [<ffffffffa0a711af>] lov_layout_change+0x2df/0x350 [lov]
[  756.855874]  [<ffffffffa0a714b7>] lov_conf_set+0x297/0x6a0 [lov]
[  756.856124]  [<ffffffffa045a401>] ? libcfs_debug_msg+0x41/0x50 [libcfs]
[  756.856384]  [<ffffffffa04629eb>] ? cfs_hash_add_unique+0x1b/0x40 [libcfs]
[  756.856807]  [<ffffffffa061ad08>] cl_conf_set+0x58/0x100 [obdclass]
[  756.857073]  [<ffffffffa0dd71c8>] ll_layout_conf+0x78/0x250 [lustre]
[  756.857334]  [<ffffffffa0de62ea>] ll_layout_lock_set+0x40a/0xd30 [lustre]
[  756.857599]  [<ffffffffa0947320>] ? lmv_set_lock_data+0x100/0x480 [lmv]
[  756.857862]  [<ffffffffa0de73ef>] ll_layout_refresh+0x7df/0xd80 [lustre]
[  756.858128]  [<ffffffffa0e0a560>] ? ll_md_blocking_ast+0x0/0x720 [lustre]
[  756.858411]  [<ffffffffa0765370>] ? ldlm_completion_ast+0x0/0x950 [ptlrpc]
[  756.858685]  [<ffffffffa0e2d2ed>] vvp_io_fini+0xfd/0x1b0 [lustre]
[  756.858939]  [<ffffffffa0a7b7ad>] ? lov_io_fini+0x16d/0x3d0 [lov]
[  756.859202]  [<ffffffffa062c807>] cl_io_fini+0x77/0x260 [obdclass]
[  756.859460]  [<ffffffffa0e24985>] cl_glimpse_size0+0xc5/0x1d0 [lustre]
[  756.859725]  [<ffffffffa0e1cde2>] ll_agl_trigger+0xd2/0x350 [lustre]
[  756.859983]  [<ffffffffa0e233a7>] ll_statahead_thread+0x2d7/0xf40 [lustre]
[  756.860245]  [<ffffffff81057d60>] ? default_wake_function+0x0/0x20
[  756.860501]  [<ffffffffa0e230d0>] ? ll_statahead_thread+0x0/0xf40 [lustre]
[  756.860776]  [<ffffffff8100c14a>] child_rip+0xa/0x20
[  756.861012]  [<ffffffffa0e230d0>] ? ll_statahead_thread+0x0/0xf40 [lustre]
[  756.861279]  [<ffffffffa0e230d0>] ? ll_statahead_thread+0x0/0xf40 [lustre]
[  756.861537]  [<ffffffff8100c140>] ? child_rip+0x0/0x20
[  756.861769] 
[  756.885413] Kernel panic - not syncing: LBUG

Crashdump and modules are in /exports/crashdumps/192.168.10.210-2013-01-18-21\:37\:33



 Comments   
Comment by Oleg Drokin [ 18/Jan/13 ]

Ok, this does seem to be hitting very often, so upgrading to blocker.
More crashes:
/exports/crashdumps/192.168.10.210-2013-01-18-22\:13\:44
/exports/crashdumps/192.168.10.218-2013-01-18-21\:57\:37

Comment by Jinshan Xiong (Inactive) [ 22/Jan/13 ]

LU-2652

Generated at Sat Feb 10 01:27:02 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.