Details
-
Bug
-
Resolution: Duplicate
-
Minor
-
None
-
Lustre 2.7.0, Lustre 2.5.4
-
None
-
server: lustre-master build # 2639
client: 2.5.2
-
3
-
15678
Description
This issue was created by maloo for sarah <sarah@whamcloud.com>
This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/7e94d1f8-32e1-11e4-9c61-5254006e85c2.
The sub-test test_6 failed with the following error:
test failed to respond and timed out
test log
22:57:14:Lustre: DEBUG MARKER: == lustre-rsync-test test 6: lustre_rsync large no of hard links == 22:55:14 (1409637314) 22:57:14:BUG: soft lockup - CPU#0 stuck for 67s! [ll_sa_27678:27679] 22:57:14:Modules linked in: lustre(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) sha512_generic sha256_generic nfs fscache nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4 ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs] 22:57:14:BUG: soft lockup - CPU#1 stuck for 67s! [ptlrpcd_0:1399] 22:57:14:Modules linked in: lustre(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) sha512_generic sha256_generic nfs fscache nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4 ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs] 22:57:14:CPU 1 22:57:14:Modules linked in: lustre(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) sha512_generic sha256_generic nfs fscache nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4 ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs] 22:57:14: 22:57:14:Pid: 1399, comm: ptlrpcd_0 Not tainted 2.6.32-431.17.1.el6.x86_64 #1 Red Hat KVM 22:57:14:RIP: 0010:[<ffffffff8152a5ee>] [<ffffffff8152a5ee>] _spin_lock+0x1e/0x30 22:57:14:RSP: 0018:ffff88007c48dc20 EFLAGS: 00000206 22:57:14:RAX: 0000000000000002 RBX: ffff88007c48dc20 RCX: 5a5a5a5a5a5a5a5a 22:57:14:RDX: 0000000000000000 RSI: 0000000000000004 RDI: ffff88005302f440 22:57:14:RBP: ffffffff8100bb8e R08: 5a5a5a5a5a5a5a5a R09: 5a5a5a5a5a5a5a5a 22:57:14:R10: 0000000000000000 R11: 0000000000000078 R12: 0000000000000000 22:57:14:R13: ffff88007cbe12d8 R14: 0000000000000000 R15: ffffffff00000000 22:57:14:FS: 0000000000000000(0000) GS:ffff880002300000(0000) knlGS:0000000000000000 22:57:14:CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b 22:57:14:CR2: 0000000001eb81c0 CR3: 0000000001a85000 CR4: 00000000000006e0 22:57:14:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 22:57:14:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 22:57:14:Process ptlrpcd_0 (pid: 1399, threadinfo ffff88007c48c000, task ffff88007a29d540) 22:57:14:Stack: 22:57:14: ffff88007c48dc80 ffffffffa1083c5b 0000000000000000 ffff880049195f48 22:57:14:<d> ffff88005302f440 ffff880060ac6800 ffff880060ac6800 ffff88007adc0480 22:57:14:<d> ffff880060ac6800 ffff880049195e00 0000000000000000 ffff88007c641400 22:57:14:Call Trace: 22:57:14: [<ffffffffa1083c5b>] ? ll_statahead_interpret+0x7b/0x560 [lustre] 22:57:14: [<ffffffffa0efa736>] ? mdc_intent_getattr_async_interpret+0x1f6/0x540 [mdc] 22:57:14: [<ffffffffa0d3caec>] ? ptlrpc_check_set+0x2bc/0x1b50 [ptlrpc] 22:57:14: [<ffffffffa0d6804b>] ? ptlrpcd_check+0x53b/0x560 [ptlrpc] 22:57:14: [<ffffffffa0d6856b>] ? ptlrpcd+0x20b/0x370 [ptlrpc] 22:57:14: [<ffffffff81061d00>] ? default_wake_function+0x0/0x20 22:57:14: [<ffffffffa0d68360>] ? ptlrpcd+0x0/0x370 [ptlrpc] 22:57:14: [<ffffffff8109ab56>] ? kthread+0x96/0xa0 22:57:14: [<ffffffff8100c20a>] ? child_rip+0xa/0x20 22:57:14: [<ffffffff8109aac0>] ? kthread+0x0/0xa0 22:57:14: [<ffffffff8100c200>] ? child_rip+0x0/0x20 22:57:14:Code: 00 00 00 01 74 05 e8 22 40 d6 ff c9 c3 55 48 89 e5 0f 1f 44 00 00 b8 00 00 01 00 f0 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 <0f> 1f 44 00 00 83 3f 00 75 f4 eb df c9 c3 0f 1f 40 00 55 48 89 22:57:14:Call Trace: 22:57:14: [<ffffffffa1083c5b>] ? ll_statahead_interpret+0x7b/0x560 [lustre] 22:57:14: [<ffffffffa0efa736>] ? mdc_intent_getattr_async_interpret+0x1f6/0x540 [mdc] 22:57:14: [<ffffffffa0d3caec>] ? ptlrpc_check_set+0x2bc/0x1b50 [ptlrpc] 22:57:14: [<ffffffffa0d6804b>] ? ptlrpcd_check+0x53b/0x560 [ptlrpc] 22:57:14: [<ffffffffa0d6856b>] ? ptlrpcd+0x20b/0x370 [ptlrpc] 22:57:14: [<ffffffff81061d00>] ? default_wake_function+0x0/0x20 22:57:14: [<ffffffffa0d68360>] ? ptlrpcd+0x0/0x370 [ptlrpc] 22:57:14: [<ffffffff8109ab56>] ? kthread+0x96/0xa0 22:57:14: [<ffffffff8100c20a>] ? child_rip+0xa/0x20 22:57:14: [<ffffffff8109aac0>] ? kthread+0x0/0xa0 22:57:14: [<ffffffff8100c200>] ? child_rip+0x0/0x20 22:57:14:CPU 0 22:58:35:Modules linked in: lustre(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) libcfs(U) sha512_generic sha256_generic nfs fscache nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4 ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs] 22:58:35: 22:58:35:Pid: 27679, comm: ll_sa_27678 Not tainted 2.6.32-431.17.1.el6.x86_64 #1 Red Hat KVM 22:58:35:RIP: 0010:[<ffffffff8152a5ee>] [<ffffffff8152a5ee>] _spin_lock+0x1e/0x30 22:58:35:RSP: 0018:ffff880068113d50 EFLAGS: 00000206 22:58:35:RAX: 0000000000000001 RBX: ffff880068113d50 RCX: 0000000000000000 22:58:35:RDX: 0000000000000000 RSI: ffff8800531765c0 RDI: ffff88005302f440 22:58:35:RBP: ffffffff8100bb8e R08: 0000000000000000 R09: 00000000fffffffe 22:58:35:R10: 0000000000000000 R11: 0000000000000001 R12: ffff880068113d30 22:58:35:R13: ffff8800228a4400 R14: 0000000000001000 R15: 0000000000000000 22:58:35:FS: 0000000000000000(0000) GS:ffff880002200000(0000) knlGS:0000000000000000 22:58:35:CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b 22:58:35:CR2: 00000000019ea0f8 CR3: 00000000253dd000 CR4: 00000000000006f0 22:58:35:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 22:58:35:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 22:58:35:Process ll_sa_27678 (pid: 27679, threadinfo ffff880068112000, task ffff88002879aaa0) 22:58:35:Stack: 22:58:35: ffff880068113dc0 ffffffffa1084cc0 ffff88005302f178 ffff880033595670 22:58:35:<d> 0000000000000000 ffff88005302f080 ffff88005dc395c0 ffff88002513f600 22:58:35:<d> ffff880068113dc0 ffff880028433000 ffff880028433170 ffff88005302f400 22:58:35:Call Trace: 22:58:35: [<ffffffffa1084cc0>] ? ll_post_statahead+0x50/0xa20 [lustre] 22:58:35: [<ffffffffa1088ce8>] ? ll_statahead_thread+0x228/0xfb0 [lustre] 22:58:35: [<ffffffff81061d00>] ? default_wake_function+0x0/0x20 22:58:35: [<ffffffffa1088ac0>] ? ll_statahead_thread+0x0/0xfb0 [lustre] 22:58:35: [<ffffffff8109ab56>] ? kthread+0x96/0xa0 22:58:35: [<ffffffff8100c20a>] ? child_rip+0xa/0x20 22:58:35: [<ffffffff8109aac0>] ? kthread+0x0/0xa0 22:58:35: [<ffffffff8100c200>] ? child_rip+0x0/0x20 22:58:35:Code: 00 00 00 01 74 05 e8 22 40 d6 ff c9 c3 55 48 89 e5 0f 1f 44 00 00 b8 00 00 01 00 f0 0f c1 07 0f b7 d0 c1 e8 10 39 c2 74 0e f3 90 <0f> 1f 44 00 00 83 3f 00 75 f4 eb df c9 c3 0f 1f 40 00 55 48 89 22:58:35:Call Trace: 22:58:35: [<ffffffffa1084cc0>] ? ll_post_statahead+0x50/0xa20 [lustre] 22:58:35: [<ffffffffa1088ce8>] ? ll_statahead_thread+0x228/0xfb0 [lustre] 22:58:35: [<ffffffff81061d00>] ? default_wake_function+0x0/0x20 22:58:35: [<ffffffffa1088ac0>] ? ll_statahead_thread+0x0/0xfb0 [lustre] 22:58:35: [<ffffffff8109ab56>] ? kthread+0x96/0xa0 22:58:35: [<ffffffff8100c20a>] ? child_rip+0xa/0x20 22:58:35: [<ffffffff8109aac0>] ? kthread+0x0/0xa0 22:58:35: [<ffffffff8100c200>] ? child_rip+0x0/0x20
Info required for matching: lustre-rsync-test 6
Attachments
Issue Links
- duplicates
-
LU-4410 sanityn test 40a: BUG: soft lockup - CPU#0 stuck for 67s! [ptlrpcd_0:2892]
- Resolved