Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
None
-
3
-
9223372036854775807
Description
This issue was created by maloo for Bob Glossman <bob.glossman@intel.com>
This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/20f5e3d8-9bfe-11e6-a46c-5254006e85c2.
The sub-test test_10a failed with the following error:
test failed to respond and timed out
The test got a timeout during:
umount -d /mnt/mds3
This may be the same as some previous bug in sanity-scrub.
I can't tell so I am raising it as a new bug. I'll let an expert decide if it's a dup.
Info required for matching: sanity-scrub 10a
from session console log https://testing.hpdd.intel.com/test_logs/932207f2-8e88-11e7-882a-5254006e85c2/show_text
22:20:50:LustreError: 17101:0:(ldlm_lib.c:2565:target_stop_recovery_thread()) lustre-MDT0002: Aborting recovery
22:20:50:general protection fault: 0000 [#1] SMP
22:20:50:last sysfs file: /sys/devices/system/cpu/online
22:20:50:CPU 0
22:20:50:Modules linked in: osp(U) mdd(U) lod(U) mdt(U) lfsck(U) mgs(U) mgc(U) osd_ldiskfs(U) lquota(U) lustre(U) lov(U) mdc(U) fid(U) lmv(U) fld(U) ksocklnd(U) ptlrpc(U) obdclass(U) lnet(U) sha512_generic crc32c_intel libcfs(U) ldiskfs(U) jbd2 nfsd exportfs autofs4 nfs lockd fscache auth_rpcgss nfs_acl sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr ipv6 microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: speedstep_lib]
22:20:50:
22:20:50:Pid: 16031, comm: tgt_recover_2 Not tainted 2.6.32-573.26.1.el6_lustre.g948c890.x86_64 #1 Red Hat KVM
22:20:50:RIP: 0010:[<ffffffffa0824e75>] [<ffffffffa0824e75>] update_recovery_exec+0xe5/0x1d20 [ptlrpc]
22:20:50:RSP: 0018:ffff880043d47c70 EFLAGS: 00010246
22:20:50:RAX: ffff880046a5a080 RBX: ffff88005b84d048 RCX: ffff8800787c95d0
22:20:50:RDX: ffff88005b84d048 RSI: 5a5a5a5a5a5a5a5a RDI: ffff88007b6fad40
22:20:50:RBP: ffff880043d47d60 R08: 0000000000000000 R09: 00000000000006b1
22:20:50:R10: 000000000000000a R11: 8000000000000000 R12: 0000000000000001
22:20:50:R13: ffff88004e973920 R14: ffff88007d1202c0 R15: ffff88005b84d000
22:20:50:FS: 0000000000000000(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
22:20:50:CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
22:20:50:CR2: 00007fa180d8d000 CR3: 00000000784f0000 CR4: 00000000000406f0
22:20:50:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
22:20:50:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
22:20:50:Process tgt_recover_2 (pid: 16031, threadinfo ffff880043d44000, task ffff880043d43520)
22:20:50:Stack:
22:20:50: ffff880043d47cc0 0000000000000001 ffff88005b84d108 ffff88005b84d010
22:20:50:<d> 0000000000080000 ffff880043d44000 0000000000008050 ffff88004e971490
22:20:50:<d> ffff88005b84d000 0000000000000000 ffff88005797fdc0 ffff88005797f8f8
22:20:50:Call Trace:
22:20:50: [<ffffffffa08286b1>] distribute_txn_replay_handle+0x271/0xcf0 [ptlrpc]
22:20:50: [<ffffffffa076b532>] target_recovery_thread+0xa12/0x1dd0 [ptlrpc]
22:20:50: [<ffffffff81067662>] ? default_wake_function+0x12/0x20
22:20:50: [<ffffffffa076ab20>] ? target_recovery_thread+0x0/0x1dd0 [ptlrpc]
22:20:50: [<ffffffff810a138e>] kthread+0x9e/0xc0
22:20:50: [<ffffffff8100c28a>] child_rip+0xa/0x20
22:20:50: [<ffffffff810a12f0>] ? kthread+0x0/0xc0
22:20:50: [<ffffffff8100c280>] ? child_rip+0x0/0x20
22:20:50:Code: 48 89 8d 68 ff ff ff 0f 1f 80 00 00 00 00 66 83 7b 10 10 0f 84 9b 02 00 00 48 8b 45 98 48 8b 7d a8 45 31 c0 48 89 da 48 8b 70 20 <48> 8b 46 18 48 8b 48 10 e8 1e 76 d6 ff 48 3d 00 f0 ff ff 49 89
22:20:50:RIP [<ffffffffa0824e75>] update_recovery_exec+0xe5/0x1d20 [ptlrpc]
22:20:50: RSP <ffff880043d47c70>
22:20:50:Initializing cgroup subsys cpuset
22:20:50:Initializing cgroup subsys cpu
22:20:50:Linux version 2.6.32-573.26.1.el6_lustre.g948c890.x86_64 (jenkins@onyx-13-sdb1-el6-x8664.onyx.hpdd.intel.com) (gcc version 4.4.7 20120313 (Red Hat 4.4.7-16) (GCC) ) #1 SMP Wed Sep 28 13:53:41 PDT 2016
22:20:50:Command line: ro root=UUID=9267feeb-da3f-41dc-b7d8-f0fc891ad430 rd_NO_LUKS rd_NO_LVM LANG=en_US.UTF-8 rd_NO_MD console=tty0 SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM console=ttyS0,115200 irqpoll nr_cpus=1 reset_devices cgroup_disable=memory mce=off acpi_no_memhotplug disable_cpu_apicid=0 memmap=exactmap memmap=627K@4K memmap=131449K@49779K elfcorehdr=181228K memmap=4K$0K memmap=9K$631K memmap=64K$960K memmap=12K$2097140K memmap=272K$4194032K
Attachments
Issue Links
- is related to
-
LU-8472 sanity-scrub test_5 times out
- Resolved