Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1554

LBUG when doing system cleanup after clean upgrade from 1.8.8 to 2.3

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Minor Minor
    • None
    • None
    • None
    • 3
    • 6375

      Clean upgrade from 1.8.8 to 2.3 successfully, system checking pass(quota, pools, verify data), after that when cleaning the system, MDS hit LBUG and restarted. Here is the console message:

      Lustre: DEBUG MARKER: ===== Pass ==================================================================
      Lustre: DEBUG MARKER: Using TIMEOUT=20
      LNet: 11093:0:(debug.c:324:libcfs_debug_str2mask()) You are trying to use a numerical value for the mask - this will be deprecated in a future release.
      LustreError: 8010:0:(osd_internal.h:665:osd_fid2oi()) ASSERTION( !fid_is_igif(fid) ) failed:
      LustreError: 8010:0:(osd_internal.h:665:osd_fid2oi()) LBUG
      Pid: 8010, comm: mdt_02

      Call Trace:
      [<ffffffffa03a3905>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
      [<ffffffffa03a3f17>] lbug_with_loc+0x47/0xb0 [libcfs]
      [<ffffffffa0e2d505>] osd_oi_delete+0x2e5/0x470 [osd_ldiskfs]
      [<ffffffffa0e26154>] osd_object_destroy+0x234/0x420 [osd_ldiskfs]
      [<ffffffffa0cf9e80>] mdd_object_kill+0xb0/0x290 [mdd]
      [<ffffffffa0d106c9>] mdd_finish_unlink+0x1f9/0x2f0 [mdd]
      [<ffffffffa0d16609>] mdd_unlink+0xa09/0xd60 [mdd]
      [<ffffffffa064e8f0>] ? ldlm_completion_ast+0x0/0x730 [ptlrpc]
      [<ffffffffa0d77a30>] ? mdt_blocking_ast+0x0/0x2a0 [mdt]
      [<ffffffffa0678294>] ? lustre_msg_get_versions+0xa4/0x120 [ptlrpc]
      [<ffffffffa0e7f027>] cml_unlink+0x97/0x200 [cmm]
      [<ffffffffa0d93b2f>] ? mdt_version_get_save+0x8f/0xd0 [mdt]
      [<ffffffffa0d957b4>] mdt_reint_unlink+0x634/0x9e0 [mdt]
      [<ffffffffa0d92b51>] mdt_reint_rec+0x41/0xe0 [mdt]
      [<ffffffffa0d8c3aa>] mdt_reint_internal+0x50a/0x810 [mdt]
      [<ffffffffa0d8c6f4>] mdt_reint+0x44/0xe0 [mdt]
      [<ffffffffa0d7e2a2>] mdt_handle_common+0x922/0x1740 [mdt]
      [<ffffffffa0d7f195>] mdt_regular_handle+0x15/0x20 [mdt]
      [<ffffffffa06858a2>] ptlrpc_server_handle_request+0x412/0xeb0 [ptlrpc]
      [<ffffffffa03a465e>] ? cfs_timer_arm+0xe/0x10 [libcfs]
      [<ffffffffa03b4daf>] ? lc_watchdog_touch+0x6f/0x180 [libcfs]
      [<ffffffffa067e6d2>] ? ptlrpc_wait_event+0xb2/0x2c0 [ptlrpc]
      [<ffffffff81051ba3>] ? __wake_up+0x53/0x70
      [<ffffffffa0686b17>] ptlrpc_main+0x7d7/0x1610 [ptlrpc]
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffff8100c14a>] child_rip+0xa/0x20
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffff8100c140>] ? child_rip+0x0/0x20

      Kernel panic - not syncing: LBUG
      Pid: 8010, comm: mdt_02 Not tainted 2.6.32-220.17.1.el6_lustre.x86_64 #1
      Call Trace:
      [<ffffffff814eccea>] ? panic+0x78/0x143
      [<ffffffffa03a3f6b>] ? lbug_with_loc+0x9b/0xb0 [libcfs]
      [<ffffffffa0e2d505>] ? osd_oi_delete+0x2e5/0x470 [osd_ldiskfs]
      [<ffffffffa0e26154>] ? osd_object_destroy+0x234/0x420 [osd_ldiskfs]
      [<ffffffffa0cf9e80>] ? mdd_object_kill+0xb0/0x290 [mdd]
      [<ffffffffa0d106c9>] ? mdd_finish_unlink+0x1f9/0x2f0 [mdd]
      [<ffffffffa0d16609>] ? mdd_unlink+0xa09/0xd60 [mdd]
      [<ffffffffa064e8f0>] ? ldlm_completion_ast+0x0/0x730 [ptlrpc]
      [<ffffffffa0d77a30>] ? mdt_blocking_ast+0x0/0x2a0 [mdt]
      [<ffffffffa0678294>] ? lustre_msg_get_versions+0xa4/0x120 [ptlrpc]
      [<ffffffffa0e7f027>] ? cml_unlink+0x97/0x200 [cmm]
      [<ffffffffa0d93b2f>] ? mdt_version_get_save+0x8f/0xd0 [mdt]
      [<ffffffffa0d957b4>] ? mdt_reint_unlink+0x634/0x9e0 [mdt]
      [<ffffffffa0d92b51>] ? mdt_reint_rec+0x41/0xe0 [mdt]
      [<ffffffffa0d8c3aa>] ? mdt_reint_internal+0x50a/0x810 [mdt]
      [<ffffffffa0d8c6f4>] ? mdt_reint+0x44/0xe0 [mdt]
      [<ffffffffa0d7e2a2>] ? mdt_handle_common+0x922/0x1740 [mdt]
      [<ffffffffa0d7f195>] ? mdt_regular_handle+0x15/0x20 [mdt]
      [<ffffffffa06858a2>] ? ptlrpc_server_handle_request+0x412/0xeb0 [ptlrpc]
      [<ffffffffa03a465e>] ? cfs_timer_arm+0xe/0x10 [libcfs]
      [<ffffffffa03b4daf>] ? lc_watchdog_touch+0x6f/0x180 [libcfs]
      [<ffffffffa067e6d2>] ? ptlrpc_wait_event+0xb2/0x2c0 [ptlrpc]
      [<ffffffff81051ba3>] ? __wake_up+0x53/0x70
      [<ffffffffa0686b17>] ? ptlrpc_main+0x7d7/0x1610 [ptlrpc]
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffff8100c14a>] ? child_rip+0xa/0x20
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffffa0686340>] ? ptlrpc_main+0x0/0x1610 [ptlrpc]
      [<ffffffff8100c140>] ? child_rip+0x0/0x20
      Initializing cgroup subsys cpuset
      Initializing cgroup subsys cpu

            wc-triage WC Triage
            sarah Sarah Liu
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Created:
              Updated:
              Resolved: