[LU-546] 1.8<->2.1 interop: LBUG: ASSERTION(service->srv_n_queued_reqs == 0) failed Created: 28/Jul/11  Updated: 29/Jul/11  Resolved: 29/Jul/11

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.1.0, Lustre 1.8.6
Fix Version/s: Lustre 2.1.0, Lustre 1.8.7

Type: Bug Priority: Major
Reporter: Jian Yu Assignee: Robert Read (Inactive)
Resolution: Duplicate Votes: 0
Labels: None
Environment:

Lustre Clients:
Tag: 1.8.6-wc1
Distro/Arch: RHEL6/x86_64 (kernel version: 2.6.32_131.2.1.el6)
Build: http://newbuild.whamcloud.com/job/lustre-b1_8/100/arch=x86_64,build_type=client,distro=el6,ib_stack=inkernel/
Network: IB (inkernel OFED)
ENABLE_QUOTA=yes

Lustre Servers:
Tag: v2_0_66_0
Distro/Arch: RHEL6/x86_64 (kernel version: 2.6.32-131.2.1.el6_lustre)
Build: http://newbuild.whamcloud.com/job/lustre-master/228/arch=x86_64,build_type=server,distro=el6,ib_stack=inkernel/
Network: IB (inkernel OFED)


Severity: 3
Rank (Obsolete): 6574

 Description   

While running sanity-quota test 18b, unmounting the MDS hit the following LBUG on the MDS node:

LustreError: 28451:0:(service.c:2704:ptlrpc_unregister_service()) LBUG
Pid: 28451, comm: obd_zombid

Call Trace:
[<ffffffffa0427855>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
[<ffffffffa0427e95>] lbug_with_loc+0x75/0xe0 [libcfs]
[<ffffffffa0432cb6>] libcfs_assertion_failed+0x66/0x70 [libcfs]
[<ffffffffa09efa03>] ptlrpc_unregister_service+0xb83/0xc20 [ptlrpc]
[<ffffffff8105dc72>] ? default_wake_function+0x12/0x20
[<ffffffff8104af29>] ? __wake_up_common+0x59/0x90
[<ffffffff8104f843>] ? __wake_up+0x53/0x70
[<ffffffffa0b9facc>] mgs_cleanup+0x4c/0x220 [mgs]
[<ffffffffa088275a>] class_decref+0x19a/0x610 [obdclass]
[<ffffffff810dbebe>] ? call_rcu+0xe/0x10
[<ffffffffa0b9f32f>] ? mgs_destroy_export+0x3f/0x110 [mgs]
[<ffffffffa086dac5>] obd_zombie_impexp_cull+0x335/0x5a0 [obdclass]
[<ffffffff8108e51c>] ? remove_wait_queue+0x3c/0x50
[<ffffffffa086de35>] obd_zombie_impexp_thread+0x105/0x270 [obdclass]
[<ffffffff8105dc60>] ? default_wake_function+0x0/0x20
[<ffffffff8100c1ca>] child_rip+0xa/0x20
[<ffffffffa086dd30>] ? obd_zombie_impexp_thread+0x0/0x270 [obdclass]
[<ffffffff8100c1c0>] ? child_rip+0x0/0x20

Kernel panic - not syncing: LBUG
Pid: 28451, comm: obd_zombid Tainted: G           ---------------- T 2.6.32-131.2.1.el6_lustre.x86_64 #1
Call Trace:
[<ffffffff814db1b8>] ? panic+0x78/0x143
[<ffffffffa0427eeb>] ? lbug_with_loc+0xcb/0xe0 [libcfs]
[<ffffffffa0432cb6>] ? libcfs_assertion_failed+0x66/0x70 [libcfs]
[<ffffffffa09efa03>] ? ptlrpc_unregister_service+0xb83/0xc20 [ptlrpc]
[<ffffffff8105dc72>] ? default_wake_function+0x12/0x20
[<ffffffff8104af29>] ? __wake_up_common+0x59/0x90
[<ffffffff8104f843>] ? __wake_up+0x53/0x70
[<ffffffffa0b9facc>] ? mgs_cleanup+0x4c/0x220 [mgs]
[<ffffffffa088275a>] ? class_decref+0x19a/0x610 [obdclass]
[<ffffffff810dbebe>] ? call_rcu+0xe/0x10
[<ffffffffa0b9f32f>] ? mgs_destroy_export+0x3f/0x110 [mgs]
[<ffffffffa086dac5>] ? obd_zombie_impexp_cull+0x335/0x5a0 [obdclass]
[<ffffffff8108e51c>] ? remove_wait_queue+0x3c/0x50
[<ffffffffa086de35>] ? obd_zombie_impexp_thread+0x105/0x270 [obdclass]
[<ffffffff8105dc60>] ? default_wake_function+0x0/0x20
[<ffffffff8100c1ca>] ? child_rip+0xa/0x20
[<ffffffffa086dd30>] ? obd_zombie_impexp_thread+0x0/0x270 [obdclass]
[<ffffffff8100c1c0>] ? child_rip+0x0/0x20

Maloo report: https://maloo.whamcloud.com/test_sets/b4fa75a0-b8db-11e0-8bdf-52540025f9af



 Comments   
Comment by Jian Yu [ 28/Jul/11 ]

Is this a duplicate of LU-292?

Comment by Peter Jones [ 29/Jul/11 ]

Oleg agrees YuJian so closing as a duplicate

Generated at Sat Feb 10 01:08:06 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.