Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-6720

recovery-small test_111: mds crashed in lod_sub_recovery_thread

Details

    • Bug
    • Resolution: Duplicate
    • Minor
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Bob Glossman <bob.glossman@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/74fb7d4a-11d9-11e5-a29c-5254006e85c2.

      The sub-test test_111 failed with the following error:

      test failed to respond and timed out
      

      MDS panic. from the logs:

      04:59:24:BUG: unable to handle kernel NULL pointer dereference at 0000000000000088
      04:59:25:IP: [<ffffffffa1b17f66>] lod_sub_recovery_thread+0x896/0x980 [lod]
      04:59:25:PGD 63d7e067 PUD 68064067 PMD 0 
      04:59:25:Oops: 0002 [#1] SMP 
      04:59:25:last sysfs file: /sys/devices/pci0000:00/0000:00:04.0/virtio0/block/vda/queue/scheduler
      04:59:25:CPU 1 
      04:59:25:Modules linked in: osp(U) mdd(U) lod(U) mdt(U) lfsck(U) mgs(U) mgc(U) osd_ldiskfs(U) lquota(U) lustre(U) lov(U) mdc(U) fid(U) lmv(U) fld(U) ksocklnd(U) ptlrpc(U) obdclass(U) lnet(U) libcfs(U) ldiskfs(U) sha512_generic jbd2 nfs fscache nfsd lockd nfs_acl auth_rpcgss sunrpc exportfs autofs4 ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr ipv6 microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: libcfs]
      04:59:25:
      04:59:25:Pid: 11201, comm: lod0000_rec0001 Not tainted 2.6.32-504.16.2.el6_lustre.x86_64 #1 Red Hat KVM
      04:59:25:RIP: 0010:[<ffffffffa1b17f66>]  [<ffffffffa1b17f66>] lod_sub_recovery_thread+0x896/0x980 [lod]
      04:59:25:RSP: 0018:ffff88007149fe50  EFLAGS: 00010246
      04:59:25:RAX: 0000000000000000 RBX: ffff88005b00c720 RCX: 0000000000000000
      04:59:25:RDX: 0000000000000001 RSI: 0000000000000003 RDI: ffff880037aba800
      04:59:25:RBP: ffff88007149fee0 R08: 64707520746f6720 R09: 73676f6c20657461
      04:59:26:R10: 6c61206d6f726620 R11: 0a2e7354444d206c R12: ffff88007afbb140
      04:59:26:R13: ffff88004c0c90b0 R14: 0000000000000000 R15: ffff88007149fe70
      04:59:26:FS:  0000000000000000(0000) GS:ffff880002300000(0000) knlGS:0000000000000000
      04:59:26:CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
      04:59:26:CR2: 0000000000000088 CR3: 000000007cb16000 CR4: 00000000000006e0
      04:59:26:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      04:59:26:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      04:59:26:Process lod0000_rec0001 (pid: 11201, threadinfo ffff88007149e000, task ffff88007149d520)
      04:59:27:Stack:
      04:59:27: 0000000000000000 ffff88007c331ac0 ffff8800559eb800 ffff88007afbb178
      04:59:27:<d> 0000000210000081 0000000000000000 ffff8800645bb400 ffff88007149fe88
      04:59:27:<d> ffff88007149fe88 0000000000000389 0000000000000000 0000000000000000
      04:59:27:Call Trace:
      04:59:27: [<ffffffffa1b176d0>] ? lod_sub_recovery_thread+0x0/0x980 [lod]
      04:59:27: [<ffffffff8109e71e>] kthread+0x9e/0xc0
      04:59:27: [<ffffffff8100c20a>] child_rip+0xa/0x20
      04:59:27: [<ffffffff8109e680>] ? kthread+0x0/0xc0
      04:59:27: [<ffffffff8100c200>] ? child_rip+0x0/0x20
      04:59:27:Code: 00 00 c7 05 d1 73 04 00 00 00 08 00 31 c0 49 8b 55 00 48 83 c2 0c e8 ba dc c7 fe 49 8b 45 10 31 c9 ba 01 00 00 00 be 03 00 00 00 <80> 88 88 00 00 00 01 49 8b 7d 00 48 81 c7 a8 03 00 00 e8 83 3d 
      04:59:27:RIP  [<ffffffffa1b17f66>] lod_sub_recovery_thread+0x896/0x980 [lod]
      

      Info required for matching: recovery-small 111

      Attachments

        Issue Links

          Activity

            People

              di.wang Di Wang
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: