Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8267

sanity test_129: (osd_internal.h:1137:osd_trans_exec_check()) LBUG

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Minor
    • None
    • Lustre 2.9.0
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Minh Diep <minh.diep@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/54c6bed6-2fe2-11e6-80b9-5254006e85c2.

      The sub-test test_129 failed with the following error:

      test failed to respond and timed out
      

      22:48:48:LustreError: 5717:0:(osd_internal.h:1137:osd_trans_exec_check()) LBUG
      22:48:48:Pid: 5717, comm: mdt00_001
      22:48:48:
      22:48:48:Call Trace:
      22:48:48: [<ffffffffa0471875>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
      22:48:48: [<ffffffffa0471e77>] lbug_with_loc+0x47/0xb0 [libcfs]
      22:48:48: [<ffffffffa0c493ea>] osd_write+0x31a/0x5b0 [osd_ldiskfs]
      22:48:48: [<ffffffffa058d72d>] dt_record_write+0x3d/0x130 [obdclass]
      22:48:48: [<ffffffffa054fcd5>] llog_osd_write_rec+0xd55/0x1ad0 [obdclass]
      22:48:48: [<ffffffffa0c5bb9b>] ? dynlock_unlock+0x16b/0x1d0 [osd_ldiskfs]
      22:48:48: [<ffffffffa053d506>] llog_write_rec+0xb6/0x270 [obdclass]
      22:48:48: [<ffffffffa0546f23>] llog_cat_add_rec+0x1c3/0x7b0 [obdclass]
      22:48:48: [<ffffffffa053d319>] llog_add+0x89/0x1c0 [obdclass]
      22:48:48: [<ffffffffa0fbc48d>] osp_sync_add_rec+0x26d/0x9c0 [osp]
      22:48:48: [<ffffffffa0fbcc87>] osp_sync_add+0x77/0x80 [osp]
      22:48:48: [<ffffffffa0eeb05e>] ? lod_sub_get_thandle+0x24e/0x3c0 [lod]
      22:48:48: [<ffffffffa0fad823>] osp_object_destroy+0x173/0x230 [osp]
      22:48:48: [<ffffffffa0eee5dd>] lod_sub_object_destroy+0x1fd/0x440 [lod]
      22:48:48: [<ffffffffa0eeee5d>] ? lod_sub_object_ref_del+0x1fd/0x440 [lod]
      22:48:48: [<ffffffffa0ee24cb>] lod_object_destroy+0x36b/0x770 [lod]
      22:48:48: [<ffffffffa0f54605>] mdd_create+0xdb5/0x1770 [mdd]
      22:48:48: [<ffffffffa0e19418>] mdo_create+0x18/0x50 [mdt]
      22:48:48: [<ffffffffa0e21df3>] mdt_reint_open+0x1fc3/0x2fd0 [mdt]
      22:48:48: [<ffffffffa05a1c8c>] ? upcall_cache_get_entry+0x29c/0x880 [obdclass]
      22:48:48: [<ffffffffa05a6bf0>] ? lu_ucred+0x20/0x30 [obdclass]
      22:48:48: [<ffffffffa0dec145>] ? mdt_ucred+0x15/0x20 [mdt]
      22:48:48: [<ffffffffa0e0a11d>] mdt_reint_rec+0x5d/0x200 [mdt]
      22:48:48: [<ffffffffa0df58ab>] mdt_reint_internal+0x62b/0x9f0 [mdt]
      22:48:48: [<ffffffffa0df5e66>] mdt_intent_reint+0x1f6/0x440 [mdt]
      22:48:48: [<ffffffffa0df39ce>] mdt_intent_policy+0x4be/0xc70 [mdt]
      22:48:48: [<ffffffffa07406c7>] ldlm_lock_enqueue+0x127/0x990 [ptlrpc]
      22:48:48: [<ffffffffa076b467>] ldlm_handle_enqueue0+0x807/0x14d0 [ptlrpc]
      22:48:48: [<ffffffffa07f1ab1>] tgt_enqueue+0x61/0x230 [ptlrpc]
      22:48:48: [<ffffffffa07f256c>] tgt_request_handle+0x8ec/0x1440 [ptlrpc]
      22:48:48: [<ffffffffa079f101>] ptlrpc_main+0xd21/0x1800 [ptlrpc]
      22:48:48: [<ffffffffa079e3e0>] ? ptlrpc_main+0x0/0x1800 [ptlrpc]
      22:48:48: [<ffffffff810a138e>] kthread+0x9e/0xc0
      22:48:48: [<ffffffff8100c28a>] child_rip+0xa/0x20
      22:48:48: [<ffffffff810a12f0>] ? kthread+0x0/0xc0
      22:48:48: [<ffffffff8100c280>] ? child_rip+0x0/0x20
      22:48:48:
      22:48:48:Kernel panic - not syncing: LBUG
      22:48:48:Pid: 5717, comm: mdt00_001 Not tainted 2.6.32-573.26.1.el6_lustre.x86_64 #1
      22:48:48:Call Trace:
      22:48:48: [<ffffffff81539407>] ? panic+0xa7/0x16f
      22:48:48: [<ffffffffa0471ecb>] ? lbug_with_loc+0x9b/0xb0 [libcfs]
      22:48:48: [<ffffffffa0c493ea>] ? osd_write+0x31a/0x5b0 [osd_ldiskfs]
      22:48:48: [<ffffffffa058d72d>] ? dt_record_write+0x3d/0x130 [obdclass]
      22:48:48: [<ffffffffa054fcd5>] ? llog_osd_write_rec+0xd55/0x1ad0 [obdclass]
      22:48:48: [<ffffffffa0c5bb9b>] ? dynlock_unlock+0x16b/0x1d0 [osd_ldiskfs]
      22:48:48: [<ffffffffa053d506>] ? llog_write_rec+0xb6/0x270 [obdclass]
      22:48:48: [<ffffffffa0546f23>] ? llog_cat_add_rec+0x1c3/0x7b0 [obdclass]
      22:48:48: [<ffffffffa053d319>] ? llog_add+0x89/0x1c0 [obdclass]
      22:48:48: [<ffffffffa0fbc48d>] ? osp_sync_add_rec+0x26d/0x9c0 [osp]
      22:48:48: [<ffffffffa0fbcc87>] ? osp_sync_add+0x77/0x80 [osp]
      22:48:48: [<ffffffffa0eeb05e>] ? lod_sub_get_thandle+0x24e/0x3c0 [lod]
      22:48:48: [<ffffffffa0fad823>] ? osp_object_destroy+0x173/0x230 [osp]
      22:48:48: [<ffffffffa0eee5dd>] ? lod_sub_object_destroy+0x1fd/0x440 [lod]
      22:48:48: [<ffffffffa0eeee5d>] ? lod_sub_object_ref_del+0x1fd/0x440 [lod]
      22:48:48: [<ffffffffa0ee24cb>] ? lod_object_destroy+0x36b/0x770 [lod]
      22:48:48: [<ffffffffa0f54605>] ? mdd_create+0xdb5/0x1770 [mdd]
      22:48:48: [<ffffffffa0e19418>] ? mdo_create+0x18/0x50 [mdt]
      22:48:48: [<ffffffffa0e21df3>] ? mdt_reint_open+0x1fc3/0x2fd0 [mdt]
      22:48:48: [<ffffffffa05a1c8c>] ? upcall_cache_get_entry+0x29c/0x880 [obdclass]
      22:48:48: [<ffffffffa05a6bf0>] ? lu_ucred+0x20/0x30 [obdclass]
      22:48:48: [<ffffffffa0dec145>] ? mdt_ucred+0x15/0x20 [mdt]
      22:48:48: [<ffffffffa0e0a11d>] ? mdt_reint_rec+0x5d/0x200 [mdt]
      22:48:48: [<ffffffffa0df58ab>] ? mdt_reint_internal+0x62b/0x9f0 [mdt]
      22:48:48: [<ffffffffa0df5e66>] ? mdt_intent_reint+0x1f6/0x440 [mdt]
      22:48:48: [<ffffffffa0df39ce>] ? mdt_intent_policy+0x4be/0xc70 [mdt]
      22:48:48: [<ffffffffa07406c7>] ? ldlm_lock_enqueue+0x127/0x990 [ptlrpc]
      22:48:48: [<ffffffffa076b467>] ? ldlm_handle_enqueue0+0x807/0x14d0 [ptlrpc]
      22:48:48: [<ffffffffa07f1ab1>] ? tgt_enqueue+0x61/0x230 [ptlrpc]
      22:48:48: [<ffffffffa07f256c>] ? tgt_request_handle+0x8ec/0x1440 [ptlrpc]
      22:48:48: [<ffffffffa079f101>] ? ptlrpc_main+0xd21/0x1800 [ptlrpc]
      22:48:48: [<ffffffffa079e3e0>] ? ptlrpc_main+0x0/0x1800 [ptlrpc]
      22:48:48: [<ffffffff810a138e>] ? kthread+0x9e/0xc0
      22:48:48: [<ffffffff8100c28a>] ? child_rip+0xa/0x20
      22:48:48: [<ffffffff810a12f0>] ? kthread+0x0/0xc0
      22:48:48: [<ffffffff8100c280>] ? child_rip+0x0/0x20
      22:48:48:Initializing cgroup subsys cpuset
      22:48:48:Initializing cgroup subsys cpu
      Please provide additional information about the failure here.

      Info required for matching: sanity 129

      Attachments

        Issue Links

          Activity

            People

              bzzz Alex Zhuravlev
              mdiep Minh Diep
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: