Details
-
Bug
-
Resolution: Cannot Reproduce
-
Major
-
None
-
Lustre 2.7.0
-
MDSCOUNT=4
-
3
-
17254
Description
While verifying patch http://review.whamcloud.com/11750 on master branch with MDSCOUNT=4, conf-sanity test 32c hung as follows:
CMD: onyx-40vm3 umount -d /tmp/t32/mnt/mdt
Console log on MDS:
Lustre: DEBUG MARKER: umount -d /tmp/t32/mnt/mdt Lustre: Failing over t32fs-MDT0000 Lustre: Skipped 2 previous similar messages general protection fault: 0000 [#1] SMP last sysfs file: /sys/devices/system/cpu/online CPU 1 Modules linked in: osd_ldiskfs(U) ldiskfs(U) lustre(U) ofd(U) osp(U) lod(U) ost(U) mdt(U) mdd(U) mgs(U) lquota(U) lfsck(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc_gss(U) ptlrpc(U) obdclass(U) ksocklnd(U) lnet(U) libcfs(U) sha512_generic sha256_generic jbd2 nfs fscache nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4 ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk pata_acpi ata_generic ata_piix virtio_pci virtio_ring virtio dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs] Pid: 22975, comm: umount Not tainted 2.6.32-431.29.2.el6_lustre.gffd1fc2.x86_64 #1 Red Hat KVM RIP: 0010:[<ffffffffa03d4e8b>] [<ffffffffa03d4e8b>] fld_local_lookup+0x5b/0x290 [fld] RSP: 0018:ffff88006493f858 EFLAGS: 00010282 RAX: ffff880065c62640 RBX: ffff880065c62640 RCX: 0000000000000000 RDX: ffff88007ad94000 RSI: ffffffffa03dc580 RDI: ffff88006493fbe8 RBP: ffff88006493f898 R08: 20737365636f7250 R09: 0a64657265746e65 R10: 20737365636f7250 R11: 0a64657265746e65 R12: 5a5a5a5a5a5a5a5a R13: 0000000000000245 R14: ffff8800649e1088 R15: ffff8800649e0000 FS: 00007ffac1edb740(0000) GS:ffff880002300000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b CR2: 00007fe5c9eb3000 CR3: 0000000055c17000 CR4: 00000000000006e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Process umount (pid: 22975, threadinfo ffff88006493e000, task ffff8800649ceaa0) Stack: ffff880000000010 ffff88006493f8b8 ffff88006493f878 0000000000000000 <d> 0000000000000245 ffff880063a52000 ffff88006493fbe8 ffff88005ab0b040 <d> ffff88006493f8a8 ffffffffa1616fd9 ffff88006493f8f8 ffffffffa161755e Call Trace: [<ffffffffa1616fd9>] osd_fld_lookup+0x49/0xd0 [osd_ldiskfs] [<ffffffffa161755e>] osd_remote_fid+0xce/0x450 [osd_ldiskfs] [<ffffffffa162713e>] osd_index_ea_lookup+0x68e/0xd30 [osd_ldiskfs] [<ffffffffa04111b1>] ? libcfs_debug_msg+0x41/0x50 [libcfs] [<ffffffffa058c24f>] dt_lookup_dir+0x7f/0x1e0 [obdclass] [<ffffffffa054e365>] llog_osd_open+0x475/0xb30 [obdclass] [<ffffffffa053f515>] llog_open+0x145/0x470 [obdclass] [<ffffffffa053fe8b>] llog_erase+0x10b/0x1e0 [obdclass] [<ffffffffa0dcb5f0>] mgs_erase_log+0x80/0x2c0 [mgs] [<ffffffffa0dd3952>] mgs_erase_logs+0x3e2/0x4d0 [mgs] [<ffffffffa0dd3a55>] mgs_params_fsdb_cleanup+0x15/0x20 [mgs] [<ffffffffa0dc453d>] mgs_device_fini+0x11d/0x590 [mgs] [<ffffffffa0575072>] class_cleanup+0x552/0xd10 [obdclass] [<ffffffffa0555b56>] ? class_name2dev+0x56/0xe0 [obdclass] [<ffffffffa057781a>] class_process_config+0x1fea/0x27c0 [obdclass] [<ffffffffa04111b1>] ? libcfs_debug_msg+0x41/0x50 [libcfs] [<ffffffffa0570825>] ? lustre_cfg_new+0x435/0x630 [obdclass] [<ffffffffa0578111>] class_manual_cleanup+0x121/0x870 [obdclass] [<ffffffffa0555b56>] ? class_name2dev+0x56/0xe0 [obdclass] [<ffffffffa05b06bf>] server_put_super+0x81f/0xe50 [obdclass] [<ffffffff8118b61b>] generic_shutdown_super+0x5b/0xe0 [<ffffffff8118b706>] kill_anon_super+0x16/0x60 [<ffffffffa057a366>] lustre_kill_super+0x36/0x60 [obdclass] [<ffffffff8118bea7>] deactivate_super+0x57/0x80 [<ffffffff811ab8af>] mntput_no_expire+0xbf/0x110 [<ffffffff811ac3fb>] sys_umount+0x7b/0x3a0 [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b Code: 74 0e 8b 35 f8 d8 05 00 85 f6 0f 88 e0 00 00 00 48 89 df 48 c7 c6 80 c5 3d a0 e8 51 fc 1a 00 48 85 c0 48 89 c3 0f 84 1e 01 00 00 <49> 8b 7c 24 18 48 8d 50 18 4c 89 ee e8 64 c1 ff ff 85 c0 75 78 RIP [<ffffffffa03d4e8b>] fld_local_lookup+0x5b/0x290 [fld] RSP <ffff88006493f858>
Maloo report: https://testing.hpdd.intel.com/test_sets/6ac30566-a440-11e4-a785-5254006e85c2
Close old bug that hasn't been seen in a long time.