[LU-6190] replay-single test_73a: (ldlm_lib.c:1389:target_finish_recovery()) LBUG Created: 02/Feb/15  Updated: 22/Jul/18  Resolved: 22/Jul/18

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.7.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: Mikhail Pershin
Resolution: Cannot Reproduce Votes: 0
Labels: zfs
Environment:

server and client: lustre-master build # 2835 RHEL6 zfs


Severity: 3
Rank (Obsolete): 17313

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/216f2a96-a7df-11e4-95cf-5254006e85c2.

The sub-test test_73a failed with the following error:

test failed to respond and timed out

MDT console

22:45:17:Lustre: lustre-MDT0000-lwp-MDT0000: Connection restored to lustre-MDT0000 (at 0@lo)
22:45:17:LustreError: 4271:0:(ldlm_lib.c:1387:target_finish_recovery()) lustre-MDT0000: Recovery queues ( lock ) are not empty
22:45:17:LustreError: 4271:0:(ldlm_lib.c:1389:target_finish_recovery()) LBUG
22:45:17:Pid: 4271, comm: tgt_recov
22:45:17:
22:45:17:Call Trace:
22:45:17: [<ffffffffa0606895>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
22:45:17: [<ffffffffa0606e97>] lbug_with_loc+0x47/0xb0 [libcfs]
22:45:17: [<ffffffffa095b328>] target_recovery_thread+0x19f8/0x1a10 [ptlrpc]
22:45:17: [<ffffffffa0959930>] ? target_recovery_thread+0x0/0x1a10 [ptlrpc]
22:45:17: [<ffffffff8109abf6>] kthread+0x96/0xa0
22:45:17: [<ffffffff8100c20a>] child_rip+0xa/0x20
22:45:17: [<ffffffff8109ab60>] ? kthread+0x0/0xa0
22:45:17: [<ffffffff8100c200>] ? child_rip+0x0/0x20
22:45:17:
22:45:17:Kernel panic - not syncing: LBUG
22:45:17:Pid: 4271, comm: tgt_recov Tainted: P           ---------------    2.6.32-431.29.2.el6_lustre.g52480e5.x86_64 #1
22:45:17:Call Trace:
22:45:17: [<ffffffff81528fdc>] ? panic+0xa7/0x16f
22:45:17: [<ffffffffa0606eeb>] ? lbug_with_loc+0x9b/0xb0 [libcfs]
22:45:17: [<ffffffffa095b328>] ? target_recovery_thread+0x19f8/0x1a10 [ptlrpc]
22:45:17: [<ffffffffa0959930>] ? target_recovery_thread+0x0/0x1a10 [ptlrpc]
22:45:17: [<ffffffff8109abf6>] ? kthread+0x96/0xa0
22:45:17: [<ffffffff8100c20a>] ? child_rip+0xa/0x20
22:45:17: [<ffffffff8109ab60>] ? kthread+0x0/0xa0
22:45:17: [<ffffffff8100c200>] ? child_rip+0x0/0x20
22:45:17:Initializing cgroup subsys cpuset
22:45:17:Initializing cgroup subsys cpu


 Comments   
Comment by Vladimir Saveliev [ 08/Sep/17 ]
22:45:17:LustreError: 4271:0:(ldlm_lib.c:1387:target_finish_recovery()) lustre-MDT0000: Recovery queues ( lock ) are not empty

This could be a result of lightweight reconnect during recovery and could be fixed in course of -LU-6629-.

Generated at Sat Feb 10 01:58:02 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.