Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-6164

conf-sanity test 32c: RIP: fld_local_lookup+0x5b/0x290 [fld]

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Major
    • None
    • Lustre 2.7.0
    • MDSCOUNT=4
    • 3
    • 17254

    Description

      While verifying patch http://review.whamcloud.com/11750 on master branch with MDSCOUNT=4, conf-sanity test 32c hung as follows:

      CMD: onyx-40vm3 umount -d /tmp/t32/mnt/mdt
      

      Console log on MDS:

      Lustre: DEBUG MARKER: umount -d /tmp/t32/mnt/mdt
      Lustre: Failing over t32fs-MDT0000
      Lustre: Skipped 2 previous similar messages
      general protection fault: 0000 [#1] SMP
      last sysfs file: /sys/devices/system/cpu/online
      CPU 1
      Modules linked in: osd_ldiskfs(U) ldiskfs(U) lustre(U) ofd(U) osp(U) lod(U) ost(U) mdt(U) mdd(U) mgs(U) lquota(U) lfsck(U) obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc_gss(U) ptlrpc(U) obdclass(U) ksocklnd(U) lnet(U) libcfs(U) sha512_generic sha256_generic jbd2 nfs fscache nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4 ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk pata_acpi ata_generic ata_piix virtio_pci virtio_ring virtio dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs]
      
      Pid: 22975, comm: umount Not tainted 2.6.32-431.29.2.el6_lustre.gffd1fc2.x86_64 #1 Red Hat KVM
      RIP: 0010:[<ffffffffa03d4e8b>]  [<ffffffffa03d4e8b>] fld_local_lookup+0x5b/0x290 [fld]
      RSP: 0018:ffff88006493f858  EFLAGS: 00010282 
      RAX: ffff880065c62640 RBX: ffff880065c62640 RCX: 0000000000000000
      RDX: ffff88007ad94000 RSI: ffffffffa03dc580 RDI: ffff88006493fbe8
      RBP: ffff88006493f898 R08: 20737365636f7250 R09: 0a64657265746e65
      R10: 20737365636f7250 R11: 0a64657265746e65 R12: 5a5a5a5a5a5a5a5a
      R13: 0000000000000245 R14: ffff8800649e1088 R15: ffff8800649e0000
      FS:  00007ffac1edb740(0000) GS:ffff880002300000(0000) knlGS:0000000000000000
      CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
      CR2: 00007fe5c9eb3000 CR3: 0000000055c17000 CR4: 00000000000006e0
      DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      Process umount (pid: 22975, threadinfo ffff88006493e000, task ffff8800649ceaa0)
      Stack:
       ffff880000000010 ffff88006493f8b8 ffff88006493f878 0000000000000000
      <d> 0000000000000245 ffff880063a52000 ffff88006493fbe8 ffff88005ab0b040
      <d> ffff88006493f8a8 ffffffffa1616fd9 ffff88006493f8f8 ffffffffa161755e
      Call Trace:
       [<ffffffffa1616fd9>] osd_fld_lookup+0x49/0xd0 [osd_ldiskfs]
       [<ffffffffa161755e>] osd_remote_fid+0xce/0x450 [osd_ldiskfs]
       [<ffffffffa162713e>] osd_index_ea_lookup+0x68e/0xd30 [osd_ldiskfs]
       [<ffffffffa04111b1>] ? libcfs_debug_msg+0x41/0x50 [libcfs] 
       [<ffffffffa058c24f>] dt_lookup_dir+0x7f/0x1e0 [obdclass]
       [<ffffffffa054e365>] llog_osd_open+0x475/0xb30 [obdclass]
       [<ffffffffa053f515>] llog_open+0x145/0x470 [obdclass]
       [<ffffffffa053fe8b>] llog_erase+0x10b/0x1e0 [obdclass]
       [<ffffffffa0dcb5f0>] mgs_erase_log+0x80/0x2c0 [mgs]
       [<ffffffffa0dd3952>] mgs_erase_logs+0x3e2/0x4d0 [mgs]
       [<ffffffffa0dd3a55>] mgs_params_fsdb_cleanup+0x15/0x20 [mgs]
       [<ffffffffa0dc453d>] mgs_device_fini+0x11d/0x590 [mgs]
       [<ffffffffa0575072>] class_cleanup+0x552/0xd10 [obdclass]
       [<ffffffffa0555b56>] ? class_name2dev+0x56/0xe0 [obdclass]
       [<ffffffffa057781a>] class_process_config+0x1fea/0x27c0 [obdclass]
       [<ffffffffa04111b1>] ? libcfs_debug_msg+0x41/0x50 [libcfs] 
       [<ffffffffa0570825>] ? lustre_cfg_new+0x435/0x630 [obdclass] [<ffffffffa0578111>] class_manual_cleanup+0x121/0x870 [obdclass]
       [<ffffffffa0555b56>] ? class_name2dev+0x56/0xe0 [obdclass]
       [<ffffffffa05b06bf>] server_put_super+0x81f/0xe50 [obdclass]
       [<ffffffff8118b61b>] generic_shutdown_super+0x5b/0xe0
       [<ffffffff8118b706>] kill_anon_super+0x16/0x60
       [<ffffffffa057a366>] lustre_kill_super+0x36/0x60 [obdclass]
       [<ffffffff8118bea7>] deactivate_super+0x57/0x80
       [<ffffffff811ab8af>] mntput_no_expire+0xbf/0x110
       [<ffffffff811ac3fb>] sys_umount+0x7b/0x3a0
       [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
      Code: 74 0e 8b 35 f8 d8 05 00 85 f6 0f 88 e0 00 00 00 48 89 df 48 c7 c6 80 c5 3d a0 e8 51 fc 1a 00 48 85 c0 48 89 c3 0f 84 1e 01 00 00 <49> 8b 7c 24 18 48 8d 50 18 4c 89 ee e8 64 c1 ff ff 85 c0 75 78 
      RIP  [<ffffffffa03d4e8b>] fld_local_lookup+0x5b/0x290 [fld]
       RSP <ffff88006493f858>
      

      Maloo report: https://testing.hpdd.intel.com/test_sets/6ac30566-a440-11e4-a785-5254006e85c2

      Attachments

        Activity

          [LU-6164] conf-sanity test 32c: RIP: fld_local_lookup+0x5b/0x290 [fld]

          Close old bug that hasn't been seen in a long time.

          adilger Andreas Dilger added a comment - Close old bug that hasn't been seen in a long time.

          People

            wc-triage WC Triage
            yujian Jian Yu
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: