[LU-7438] racer test_1 test failed: LustreError: 11008:0:(ldlm_lock.c:921:ldlm_lock_decref_and_cancel()) ASSERTION( lock != ((void *)0) ) failed: LustreError: 11008:0:(ldlm_lock.c:921:ldlm_lock_decref_and_cancel()) LBUG Created: 17/Nov/15  Updated: 28/Jan/21  Resolved: 28/Jan/21

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.8.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: parinay v kondekar (Inactive) Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None
Environment:

Interop 2.5.x <-> master patchless client (2.7.62)


Attachments: File 1.lctl.tgz     Text File vmcore-dmesg.txt    
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Configuration : 4 node - 1 MDS/1 OSS/ 2 Clients
Release
2.6.32_431.17.1.69.x86_64
2.6.32_431.29.2.el6.x86_64_g70e90c3

Server 2.5.1.x6
Client 2.7.62

1.console.fre0207.log
LustreError: 11008:0:(ldlm_lock.c:921:ldlm_lock_decref_and_cancel()) ASSERTION( lock != ((void *)0) ) failed: 
LustreError: 11008:0:(ldlm_lock.c:921:ldlm_lock_decref_and_cancel()) LBUG

dmesg.txt
<4>Lustre: Mounted lustre-client
<4>Lustre: DEBUG MARKER: Using TIMEOUT=20
<4>Lustre: DEBUG MARKER: == racer test 1: racer on clients: fre0207,fre0208 DURATION=900 == 00:33:34 (1447547614)
<0>LustreError: 11008:0:(ldlm_lock.c:921:ldlm_lock_decref_and_cancel()) ASSERTION( lock != ((void *)0) ) failed: 
<0>LustreError: 11008:0:(ldlm_lock.c:921:ldlm_lock_decref_and_cancel()) LBUG
<4>Pid: 11008, comm: getfattr
<4>
<4>Call Trace:
<4> [<ffffffffa02a7875>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
<4> [<ffffffffa02a7e77>] lbug_with_loc+0x47/0xb0 [libcfs]
<4> [<ffffffffa05be3ba>] ldlm_lock_decref_and_cancel+0x14a/0x150 [ptlrpc]
<4> [<ffffffffa092c4e0>] ll_xattr_cache_refill+0x9b0/0x1eb0 [lustre]
<4> [<ffffffffa09216b0>] ? ll_md_blocking_ast+0x0/0x7d0 [lustre]
<4> [<ffffffffa05d0530>] ? ldlm_completion_ast+0x0/0x9b0 [ptlrpc]
<4> [<ffffffffa092daa9>] ll_xattr_cache_get+0xc9/0x4e0 [lustre]
<4> [<ffffffffa09291e5>] ll_getxattr_common+0x375/0xee0 [lustre]
<4> [<ffffffffa092a8e8>] ll_listxattr+0x78/0x3b0 [lustre]
<4> [<ffffffff811b0a50>] vfs_listxattr+0x50/0x90
<4> [<ffffffff811b0ad0>] listxattr+0x40/0xf0
<4> [<ffffffff811b0c69>] sys_listxattr+0x59/0x90
<4> [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
<4>
<0>Kernel panic - not syncing: LBUG
<4>Pid: 11008, comm: getfattr Not tainted 2.6.32-431.29.2.el6.x86_64 #1
<4>Call Trace:
<4> [<ffffffff8152873c>] ? panic+0xa7/0x16f
<4> [<ffffffffa02a7ecb>] ? lbug_with_loc+0x9b/0xb0 [libcfs]
<4> [<ffffffffa05be3ba>] ? ldlm_lock_decref_and_cancel+0x14a/0x150 [ptlrpc]
<4> [<ffffffffa092c4e0>] ? ll_xattr_cache_refill+0x9b0/0x1eb0 [lustre]
<4> [<ffffffffa09216b0>] ? ll_md_blocking_ast+0x0/0x7d0 [lustre]
<4> [<ffffffffa05d0530>] ? ldlm_completion_ast+0x0/0x9b0 [ptlrpc]
<4> [<ffffffffa092daa9>] ? ll_xattr_cache_get+0xc9/0x4e0 [lustre]
<4> [<ffffffffa09291e5>] ? ll_getxattr_common+0x375/0xee0 [lustre]
<4> [<ffffffffa092a8e8>] ? ll_listxattr+0x78/0x3b0 [lustre]
<4> [<ffffffff811b0a50>] ? vfs_listxattr+0x50/0x90
<4> [<ffffffff811b0ad0>] ? listxattr+0x40/0xf0
<4> [<ffffffff811b0c69>] ? sys_listxattr+0x59/0x90
<4> [<ffffffff8100b072>] ? system_call_fastpath+0x16/0x1b



 Comments   
Comment by James Nunez (Inactive) [ 20/Nov/15 ]

Look like similar problem:
2015-11-19 10:39:53 - https://testing.hpdd.intel.com/test_sets/c908f466-8edf-11e5-b140-5254006e85c2

Comment by Andreas Dilger [ 28/Jan/21 ]

Have not seen reports of this failing again.

Generated at Sat Feb 10 02:08:55 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.