Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1554

LBUG when doing system cleanup after clean upgrade from 1.8.8 to 2.3

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • None
    • None
    • None
    • 3
    • 6375

    Description

      Clean upgrade from 1.8.8 to 2.3 successfully, system checking pass(quota, pools, verify data), after that when cleaning the system, MDS hit LBUG and restarted. Here is the console message:

      Lustre: DEBUG MARKER: ===== Pass ==================================================================
      Lustre: DEBUG MARKER: Using TIMEOUT=20
      LNet: 11093:0:(debug.c:324:libcfs_debug_str2mask()) You are trying to use a numerical value for the mask - this will be deprecated in a future release.
      LustreError: 8010:0:(osd_internal.h:665:osd_fid2oi()) ASSERTION( !fid_is_igif(fid) ) failed:
      LustreError: 8010:0:(osd_internal.h:665:osd_fid2oi()) LBUG
      Pid: 8010, comm: mdt_02

      Call Trace:
      [<ffffffffa03a3905>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
      [<ffffffffa03a3f17>] lbug_with_loc+0x47/0xb0 [libcfs]
      [<ffffffffa0e2d505>] osd_oi_delete+0x2e5/0x470 [osd_ldiskfs]
      [<ffffffffa0e26154>] osd_object_destroy+0x234/0x420 [osd_ldiskfs]
      [<ffffffffa0cf9e80>] mdd_object_kill+0xb0/0x290 [mdd]
      [<ffffffffa0d106c9>] mdd_finish_unlink+0x1f9/0x2f0 [mdd]
      [<ffffffffa0d16609>] mdd_unlink+0xa09/0xd60 [mdd]
      [<ffffffffa064e8f0>] ? ldlm_completion_ast+0x0/0x730 [ptlrpc]
      [<ffffffffa0d77a30>] ? mdt_blocking_ast+0x0/0x2a0 [mdt]
      [<ffffffffa0678294>] ? lustre_msg_get_versions+0xa4/0x120 [ptlrpc]
      [<ffffffffa0e7f027>] cml_unlink+0x97/0x200 [cmm]
      [<ffffffffa0d93b2f>] ? mdt_version_get_save+0x8f/0xd0 [mdt]
      [<ffffffffa0d957b4>] mdt_reint_unlink+0x634/0x9e0 [mdt]
      [<ffffffffa0d92b51>] mdt_reint_rec+0x41/0xe0 [mdt]
      [<ffffffffa0d8c3aa>] mdt_reint_internal+0x50a/0x810 [mdt]
      [<ffffffffa0d8c6f4>] mdt_reint+0x44/0xe0 [mdt]
      [<ffffffffa0d7e2a2>] mdt_handle_common+0x922/0x1740 [mdt]
      [<ffffffffa0d7f195>] mdt_regular_handle+0x15/0x20 [mdt]
      [<ffffffffa06858a2>] ptlrpc_server_handle_request+0x412/0xeb0 [ptlrpc]
      [<ffffffffa03a465e>] ? cfs_timer_arm+0xe/0x10 [libcfs]
      [<ffffffffa03b4daf>] ? lc_watchdog_touch+0x6f/0x180 [libcfs]
      [<ffffffffa067e6d2>] ? ptlrpc_wait_event+0xb2/0x2c0 [ptlrpc]
      [<ffffffff81051ba3>] ? __wake_up+0x53/0x70
      [<ffffffffa0686b17>] ptlrpc_main+0x7d7/0x1610 [ptlrpc]
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffff8100c14a>] child_rip+0xa/0x20
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffff8100c140>] ? child_rip+0x0/0x20

      Kernel panic - not syncing: LBUG
      Pid: 8010, comm: mdt_02 Not tainted 2.6.32-220.17.1.el6_lustre.x86_64 #1
      Call Trace:
      [<ffffffff814eccea>] ? panic+0x78/0x143
      [<ffffffffa03a3f6b>] ? lbug_with_loc+0x9b/0xb0 [libcfs]
      [<ffffffffa0e2d505>] ? osd_oi_delete+0x2e5/0x470 [osd_ldiskfs]
      [<ffffffffa0e26154>] ? osd_object_destroy+0x234/0x420 [osd_ldiskfs]
      [<ffffffffa0cf9e80>] ? mdd_object_kill+0xb0/0x290 [mdd]
      [<ffffffffa0d106c9>] ? mdd_finish_unlink+0x1f9/0x2f0 [mdd]
      [<ffffffffa0d16609>] ? mdd_unlink+0xa09/0xd60 [mdd]
      [<ffffffffa064e8f0>] ? ldlm_completion_ast+0x0/0x730 [ptlrpc]
      [<ffffffffa0d77a30>] ? mdt_blocking_ast+0x0/0x2a0 [mdt]
      [<ffffffffa0678294>] ? lustre_msg_get_versions+0xa4/0x120 [ptlrpc]
      [<ffffffffa0e7f027>] ? cml_unlink+0x97/0x200 [cmm]
      [<ffffffffa0d93b2f>] ? mdt_version_get_save+0x8f/0xd0 [mdt]
      [<ffffffffa0d957b4>] ? mdt_reint_unlink+0x634/0x9e0 [mdt]
      [<ffffffffa0d92b51>] ? mdt_reint_rec+0x41/0xe0 [mdt]
      [<ffffffffa0d8c3aa>] ? mdt_reint_internal+0x50a/0x810 [mdt]
      [<ffffffffa0d8c6f4>] ? mdt_reint+0x44/0xe0 [mdt]
      [<ffffffffa0d7e2a2>] ? mdt_handle_common+0x922/0x1740 [mdt]
      [<ffffffffa0d7f195>] ? mdt_regular_handle+0x15/0x20 [mdt]
      [<ffffffffa06858a2>] ? ptlrpc_server_handle_request+0x412/0xeb0 [ptlrpc]
      [<ffffffffa03a465e>] ? cfs_timer_arm+0xe/0x10 [libcfs]
      [<ffffffffa03b4daf>] ? lc_watchdog_touch+0x6f/0x180 [libcfs]
      [<ffffffffa067e6d2>] ? ptlrpc_wait_event+0xb2/0x2c0 [ptlrpc]
      [<ffffffff81051ba3>] ? __wake_up+0x53/0x70
      [<ffffffffa0686b17>] ? ptlrpc_main+0x7d7/0x1610 [ptlrpc]
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffff8100c14a>] ? child_rip+0xa/0x20
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffff8100c140>] ? child_rip+0x0/0x20
      Initializing cgroup subsys cpuset
      Initializing cgroup subsys cpu

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              sarah Sarah Liu
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: