Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-6272

sanity-lfsck test_17: MDS deadlock

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Blocker
    • Lustre 2.7.0
    • Lustre 2.7.0
    • None
    • 3
    • 17586

    Description

      This issue was created by maloo for Oleg Drokin <green@whamcloud.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/2e423416-ba54-11e4-a7c7-5254006e85c2.

      The sub-test test_17 failed with the following error:

      test failed to respond and timed out
      

      It looks like there a MDS deadlock

      23:07:57:INFO: task mdt00_000:6380 blocked for more than 120 seconds.
      23:07:57:      Not tainted 2.6.32-504.8.1.el6_lustre.g0ef66b1.x86_64 #1
      23:07:57:"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
      23:07:57:mdt00_000     D 0000000000000001     0  6380      2 0x00000080
      23:07:57: ffff88006ca2b940 0000000000000046 0000000000000000 0000000000000000
      23:07:57: ffff88007af106c0 ffff880079699300 ffff88007b104000 ffff880079699300
      23:07:57: ffff88006ca2b940 ffffffffa05f62af ffff880078429098 ffff88006ca2bfd8
      23:07:57:Call Trace:
      23:07:57: [<ffffffffa05f62af>] ? lu_object_find_try+0x9f/0x260 [obdclass]
      23:07:57: [<ffffffffa05f64ad>] lu_object_find_at+0x3d/0xe0 [obdclass]
      23:07:57: [<ffffffffa0fad725>] ? lod_index_lookup+0x25/0x30 [lod]
      23:07:57: [<ffffffff81064b90>] ? default_wake_function+0x0/0x20
      23:07:57: [<ffffffffa05f6566>] lu_object_find+0x16/0x20 [obdclass]
      23:07:57: [<ffffffffa0ebe056>] mdt_object_find+0x56/0x170 [mdt]
      23:07:57: [<ffffffffa0ef5407>] mdt_reint_open+0x1527/0x2c70 [mdt]
      23:07:57: [<ffffffffa04ae83c>] ? upcall_cache_get_entry+0x29c/0x880 [libcfs]
      23:07:57: [<ffffffffa06130b0>] ? lu_ucred+0x20/0x30 [obdclass]
      23:07:57: [<ffffffffa0edd0cd>] mdt_reint_rec+0x5d/0x200 [mdt]
      23:07:57: [<ffffffffa0ec123b>] mdt_reint_internal+0x4cb/0x7a0 [mdt]
      23:07:57: [<ffffffffa0ec1706>] mdt_intent_reint+0x1f6/0x430 [mdt]
      23:07:57: [<ffffffffa0ebfcf4>] mdt_intent_policy+0x494/0xce0 [mdt]
      23:07:57: [<ffffffffa07c24f9>] ldlm_lock_enqueue+0x129/0x9d0 [ptlrpc]
      23:07:57: [<ffffffffa07ee48b>] ldlm_handle_enqueue0+0x51b/0x13f0 [ptlrpc]
      23:07:57: [<ffffffffa086e951>] tgt_enqueue+0x61/0x230 [ptlrpc]
      23:07:57: [<ffffffffa086f59e>] tgt_request_handle+0x8be/0x1000 [ptlrpc]
      23:07:57: [<ffffffffa081f5c1>] ptlrpc_main+0xe41/0x1960 [ptlrpc]
      23:07:57: [<ffffffffa081e780>] ? ptlrpc_main+0x0/0x1960 [ptlrpc]
      23:07:57: [<ffffffff8109e66e>] kthread+0x9e/0xc0
      23:07:57: [<ffffffff8100c20a>] child_rip+0xa/0x20
      23:07:57: [<ffffffff8109e5d0>] ? kthread+0x0/0xc0
      23:07:57: [<ffffffff8100c200>] ? child_rip+0x0/0x20
      

      Info required for matching: sanity-lfsck 17

      Attachments

        Issue Links

          Activity

            People

              yong.fan nasf (Inactive)
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: