Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-2856

Kernel panic - not syncing: LBUG - ASSERTION( lh->mlh_pdo_hash != 0 )

Details

    • Bug
    • Resolution: Fixed
    • Major
    • Lustre 2.4.0
    • Lustre 2.4.0, Lustre 2.1.4
    • RHEL6 Update 3
    • 3
    • 6918

    Description

      In a fresh install of RHEL 6 Update 3 (x86_64)

      1 Physical server (1 MDT / 8 OST). Local Mount (over eth0) and 1 export NFS

      I have a Kernel Panic :

      crash> log
      LustreError: 4085:0:(mdt_reint.c:916:mdt_pdir_hash_lock()) ASSERTION( lh->mlh_pdo_hash != 0 ) failed:
      LustreError: 4085:0:(mdt_reint.c:916:mdt_pdir_hash_lock()) LBUG
      Pid: 4085, comm: mdt_04
      
      Call Trace:
       [<ffffffffa03347f5>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
       [<ffffffffa0334e07>] lbug_with_loc+0x47/0xb0 [libcfs]
       [<ffffffffa0c65c6b>] mdt_reint_rename+0x1acb/0x1d70 [mdt]
       [<ffffffffa05a5520>] ? ldlm_blocking_ast+0x0/0x180 [ptlrpc]
       [<ffffffffa05a68c0>] ? ldlm_completion_ast+0x0/0x720 [ptlrpc]
       [<ffffffffa034b18e>] ? upcall_cache_get_entry+0x28e/0x944 [libcfs]
       [<ffffffffa0c5c4ec>] ? mdt_root_squash+0x2c/0x3e0 [mdt]
       [<ffffffffa0c60c51>] mdt_reint_rec+0x41/0xe0 [mdt]
       [<ffffffffa0c57ed4>] mdt_reint_internal+0x544/0x8e0 [mdt]
       [<ffffffffa0c582b4>] mdt_reint+0x44/0xe0 [mdt]
       [<ffffffffa0c4c772>] mdt_handle_common+0x932/0x1750 [mdt]
       [<ffffffffa0c4d665>] mdt_regular_handle+0x15/0x20 [mdt]
       [<ffffffffa05ddc5e>] ptlrpc_main+0xc4e/0x1a40 [ptlrpc]
       [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc]
       [<ffffffff8100c14a>] child_rip+0xa/0x20
       [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc]
       [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc]
       [<ffffffff8100c140>] ? child_rip+0x0/0x20
      
      Kernel panic - not syncing: LBUG
      Pid: 4085, comm: mdt_04 Not tainted 2.6.32-279.14.1.el6_lustre.x86_64 #1
      Call Trace:
       [<ffffffff814fdcba>] ? panic+0xa0/0x168
       [<ffffffffa0334e5b>] ? lbug_with_loc+0x9b/0xb0 [libcfs]
       [<ffffffffa0c65c6b>] ? mdt_reint_rename+0x1acb/0x1d70 [mdt]
       [<ffffffffa05a5520>] ? ldlm_blocking_ast+0x0/0x180 [ptlrpc]
       [<ffffffffa05a68c0>] ? ldlm_completion_ast+0x0/0x720 [ptlrpc]
       [<ffffffffa034b18e>] ? upcall_cache_get_entry+0x28e/0x944 [libcfs]
       [<ffffffffa0c5c4ec>] ? mdt_root_squash+0x2c/0x3e0 [mdt]
       [<ffffffffa0c60c51>] ? mdt_reint_rec+0x41/0xe0 [mdt]
       [<ffffffffa0c57ed4>] ? mdt_reint_internal+0x544/0x8e0 [mdt]
       [<ffffffffa0c582b4>] ? mdt_reint+0x44/0xe0 [mdt]
       [<ffffffffa0c4c772>] ? mdt_handle_common+0x932/0x1750 [mdt]
       [<ffffffffa0c4d665>] ? mdt_regular_handle+0x15/0x20 [mdt]
       [<ffffffffa05ddc5e>] ? ptlrpc_main+0xc4e/0x1a40 [ptlrpc]
       [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc]
       [<ffffffff8100c14a>] ? child_rip+0xa/0x20
       [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc]
       [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc]
       [<ffffffff8100c140>] ? child_rip+0x0/0x20
      
      crash> bt -l
      PID: 4085   TASK: ffff8806283c0040  CPU: 6   COMMAND: "mdt_04"
       #0 [ffff8805ea1d59b8] machine_kexec at ffffffff8103284b
          /usr/src/debug/kernel-2.6.32-279.14.1.el6/linux-2.6.32-279.14.1.el6_lustre.x86_64/arch/x86/kernel/machine_kexec_64.c: 336
       #1 [ffff8805ea1d5a18] crash_kexec at ffffffff810ba982
          /usr/src/debug/kernel-2.6.32-279.14.1.el6/linux-2.6.32-279.14.1.el6_lustre.x86_64/kernel/kexec.c: 1106
       #2 [ffff8805ea1d5ae8] panic at ffffffff814fdcc1
          /usr/src/debug/kernel-2.6.32-279.14.1.el6/linux-2.6.32-279.14.1.el6_lustre.x86_64/kernel/panic.c: 103
       #3 [ffff8805ea1d5b68] lbug_with_loc at ffffffffa0334e5b [libcfs]
       #4 [ffff8805ea1d5b88] mdt_reint_rename at ffffffffa0c65c6b [mdt]
       #5 [ffff8805ea1d5cd8] mdt_reint_rec at ffffffffa0c60c51 [mdt]
       #6 [ffff8805ea1d5cf8] mdt_reint_internal at ffffffffa0c57ed4 [mdt]
       #7 [ffff8805ea1d5d48] mdt_reint at ffffffffa0c582b4 [mdt]
       #8 [ffff8805ea1d5d68] mdt_handle_common at ffffffffa0c4c772 [mdt]
       #9 [ffff8805ea1d5db8] mdt_regular_handle at ffffffffa0c4d665 [mdt]
      #10 [ffff8805ea1d5dc8] ptlrpc_main at ffffffffa05ddc5e [ptlrpc]
      #11 [ffff8805ea1d5f48] kernel_thread at ffffffff8100c14a
          /usr/src/debug///////////////////////////////////////////////////////////////////////////////////////////kernel-2.6.32-279.14.1.el6/linux-2.6.32-279.14.1.el6_lustre.x86_64/arch/x86/kernel/entry_64.S: 1213
      

      Attachments

        Activity

          People

            cliffw Cliff White (Inactive)
            julien.paret Julien Paret
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: