Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7782

sanity-scrub test_2: NULL pointer dereference at 0x10 in lu_context_key_get() on mds2

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.9.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Andreas Dilger <andreas.dilger@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/f0270640-d4f4-11e5-9e3f-5254006e85c2.

      This is testing patch http://review.whamcloud.com/18442 which is changing sanity-scrub.sh scrub_prep() to use test_mkdir -i instead of mkdir and lfs mkdir explicitly for testing, so that it works with the upstream kernel (which doesn't have DNE support).

      The sub-test test_2 failed with the following error on MDS2:

      08:17:18:LustreError: 29698:0:(client.c:1133:ptlrpc_import_delay_req()) Skipped 40 previous similar messages
      08:17:18:BUG: unable to handle kernel NULL pointer dereference at 0000000000000010
      08:17:18:IP: [<ffffffffa057bb57>] lu_context_key_get+0x1
      08:17:18:CPU 0 
      08:17:18:Pid: 29699, comm: osp_up2-1 Not tainted 2.6.32-573.12.1.el6_lustre.g93f956d.x86_64 #1 Red Hat KVM
      08:17:18:Call Trace:
      08:17:18: [<ffffffffa09ee83f>] fld_local_lookup+0x4f/0x290 [fld]
      08:17:18: [<ffffffffa09eec83>] fld_server_lookup+0x53/0x330 [fld]
      08:17:18: [<ffffffffa0e6e38f>] lod_fld_lookup+0x34f/0x520 [lod]
      08:17:18: [<ffffffffa0e84243>] lod_object_init+0x103/0x3c0 [lod]
      08:17:18: [<ffffffffa057f198>] lu_object_alloc+0xd8/0x320 [obdclass]
      08:17:18: [<ffffffffa0580581>] lu_object_find_try+0x151/0x260 [obdclass]
      08:17:18: [<ffffffffa0580741>] lu_object_find_at+0xb1/0xe0 [obdclass]
      08:17:18: [<ffffffffa05807af>] lu_object_find_slice+0x1f/0x80 [obdclass]
      08:17:18: [<ffffffffa0f79a4e>] osp_trans_stop_cb+0x1be/0x2d0 [osp]
      08:17:18: [<ffffffffa0f7b2be>] osp_update_interpret+0x21e/0x4a0 [osp]
      08:17:18: [<ffffffffa07900b5>] ptlrpc_check_set+0x615/0x1da0 [ptlrpc]
      08:17:18: [<ffffffffa0791b9a>] ptlrpc_set_wait+0x35a/0x960 [ptlrpc]
      08:17:18: [<ffffffffa0792221>] ptlrpc_queue_wait+0x81/0x220 [ptlrpc]
      08:17:18: [<ffffffffa0f7b9c6>] osp_send_update_req+0x256/0x850 [osp]
      08:17:18: [<ffffffffa0f7c63f>] osp_send_update_thread+0x20f/0x7ac [osp]
      08:17:18: [<ffffffff810a0fce>] kthread+0x9e/0xc0
      

      Please provide additional information about the failure here.

      It shouldn't be possible to cause the node to crash, no matter how the test directories are being created.

      Info required for matching: sanity-scrub 2

      Attachments

        Issue Links

          Activity

            [LU-7782] sanity-scrub test_2: NULL pointer dereference at 0x10 in lu_context_key_get() on mds2
            pjones Peter Jones added a comment -

            Landed for 2.9

            pjones Peter Jones added a comment - Landed for 2.9

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/21506/
            Subject: LU-7782 scrub: handle slave obj of striped directory
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 842bda9c5b41eef9e43dc3e00f05767147611677

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/21506/ Subject: LU-7782 scrub: handle slave obj of striped directory Project: fs/lustre-release Branch: master Current Patch Set: Commit: 842bda9c5b41eef9e43dc3e00f05767147611677

            Fan Yong (fan.yong@intel.com) uploaded a new patch: http://review.whamcloud.com/21506
            Subject: LU-7782 scrub: handle slave obj of striped directory
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: bcde560652d19f66c0ddf650e895d620e87e3537

            gerrit Gerrit Updater added a comment - Fan Yong (fan.yong@intel.com) uploaded a new patch: http://review.whamcloud.com/21506 Subject: LU-7782 scrub: handle slave obj of striped directory Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: bcde560652d19f66c0ddf650e895d620e87e3537
            green Oleg Drokin added a comment -

            The patch here was reverted because it appears to be causing multiple issues tracked under LU-8399, LU-8416 and others and just the fix in LU-8399 was not enough to fix it.

            green Oleg Drokin added a comment - The patch here was reverted because it appears to be causing multiple issues tracked under LU-8399 , LU-8416 and others and just the fix in LU-8399 was not enough to fix it.

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/21313/
            Subject: Revert "LU-7782 scrub: handle slave obj of striped directory"
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 0f37c051158a399f7b00536eeec27f5dbdd54168

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/21313/ Subject: Revert " LU-7782 scrub: handle slave obj of striped directory" Project: fs/lustre-release Branch: master Current Patch Set: Commit: 0f37c051158a399f7b00536eeec27f5dbdd54168

            Oleg Drokin (oleg.drokin@intel.com) uploaded a new patch: http://review.whamcloud.com/21313
            Subject: Revert "LU-7782 scrub: handle slave obj of striped directory"
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 60f7f0815007dffd709de93698bba3bd2380535c

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) uploaded a new patch: http://review.whamcloud.com/21313 Subject: Revert " LU-7782 scrub: handle slave obj of striped directory" Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 60f7f0815007dffd709de93698bba3bd2380535c

            Patches have landed to master for 2.9.0

            jgmitter Joseph Gmitter (Inactive) added a comment - Patches have landed to master for 2.9.0

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/18801/
            Subject: LU-7782 scrub: handle slave obj of striped directory
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 80fe81c5b14835bbd5d751e878edbd00fe90f797

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/18801/ Subject: LU-7782 scrub: handle slave obj of striped directory Project: fs/lustre-release Branch: master Current Patch Set: Commit: 80fe81c5b14835bbd5d751e878edbd00fe90f797

            Bob Glossman (bob.glossman@intel.com) uploaded a new patch: http://review.whamcloud.com/19313
            Subject: LU-7782 osp: save env for update callback
            Project: fs/lustre-release
            Branch: b2_8
            Current Patch Set: 1
            Commit: 079971d9943fa6f218c3b4188f0f6574e97b341d

            gerrit Gerrit Updater added a comment - Bob Glossman (bob.glossman@intel.com) uploaded a new patch: http://review.whamcloud.com/19313 Subject: LU-7782 osp: save env for update callback Project: fs/lustre-release Branch: b2_8 Current Patch Set: 1 Commit: 079971d9943fa6f218c3b4188f0f6574e97b341d

            seen in b2_8. I think the fix only went into master after the branch was made.

            https://testing.hpdd.intel.com/test_sets/b2502df8-f950-11e5-812a-5254006e85c2

            bogl Bob Glossman (Inactive) added a comment - seen in b2_8. I think the fix only went into master after the branch was made. https://testing.hpdd.intel.com/test_sets/b2502df8-f950-11e5-812a-5254006e85c2

            People

              yong.fan nasf (Inactive)
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: