Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7782

sanity-scrub test_2: NULL pointer dereference at 0x10 in lu_context_key_get() on mds2

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.9.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Andreas Dilger <andreas.dilger@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/f0270640-d4f4-11e5-9e3f-5254006e85c2.

      This is testing patch http://review.whamcloud.com/18442 which is changing sanity-scrub.sh scrub_prep() to use test_mkdir -i instead of mkdir and lfs mkdir explicitly for testing, so that it works with the upstream kernel (which doesn't have DNE support).

      The sub-test test_2 failed with the following error on MDS2:

      08:17:18:LustreError: 29698:0:(client.c:1133:ptlrpc_import_delay_req()) Skipped 40 previous similar messages
      08:17:18:BUG: unable to handle kernel NULL pointer dereference at 0000000000000010
      08:17:18:IP: [<ffffffffa057bb57>] lu_context_key_get+0x1
      08:17:18:CPU 0 
      08:17:18:Pid: 29699, comm: osp_up2-1 Not tainted 2.6.32-573.12.1.el6_lustre.g93f956d.x86_64 #1 Red Hat KVM
      08:17:18:Call Trace:
      08:17:18: [<ffffffffa09ee83f>] fld_local_lookup+0x4f/0x290 [fld]
      08:17:18: [<ffffffffa09eec83>] fld_server_lookup+0x53/0x330 [fld]
      08:17:18: [<ffffffffa0e6e38f>] lod_fld_lookup+0x34f/0x520 [lod]
      08:17:18: [<ffffffffa0e84243>] lod_object_init+0x103/0x3c0 [lod]
      08:17:18: [<ffffffffa057f198>] lu_object_alloc+0xd8/0x320 [obdclass]
      08:17:18: [<ffffffffa0580581>] lu_object_find_try+0x151/0x260 [obdclass]
      08:17:18: [<ffffffffa0580741>] lu_object_find_at+0xb1/0xe0 [obdclass]
      08:17:18: [<ffffffffa05807af>] lu_object_find_slice+0x1f/0x80 [obdclass]
      08:17:18: [<ffffffffa0f79a4e>] osp_trans_stop_cb+0x1be/0x2d0 [osp]
      08:17:18: [<ffffffffa0f7b2be>] osp_update_interpret+0x21e/0x4a0 [osp]
      08:17:18: [<ffffffffa07900b5>] ptlrpc_check_set+0x615/0x1da0 [ptlrpc]
      08:17:18: [<ffffffffa0791b9a>] ptlrpc_set_wait+0x35a/0x960 [ptlrpc]
      08:17:18: [<ffffffffa0792221>] ptlrpc_queue_wait+0x81/0x220 [ptlrpc]
      08:17:18: [<ffffffffa0f7b9c6>] osp_send_update_req+0x256/0x850 [osp]
      08:17:18: [<ffffffffa0f7c63f>] osp_send_update_thread+0x20f/0x7ac [osp]
      08:17:18: [<ffffffff810a0fce>] kthread+0x9e/0xc0
      

      Please provide additional information about the failure here.

      It shouldn't be possible to cause the node to crash, no matter how the test directories are being created.

      Info required for matching: sanity-scrub 2

      Attachments

        Issue Links

          Activity

            People

              yong.fan nasf (Inactive)
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: