Details
-
Bug
-
Resolution: Fixed
-
Major
-
Lustre 2.4.0, Lustre 2.1.4
-
RHEL6 Update 3
-
3
-
6918
Description
In a fresh install of RHEL 6 Update 3 (x86_64)
1 Physical server (1 MDT / 8 OST). Local Mount (over eth0) and 1 export NFS
I have a Kernel Panic :
crash> log LustreError: 4085:0:(mdt_reint.c:916:mdt_pdir_hash_lock()) ASSERTION( lh->mlh_pdo_hash != 0 ) failed: LustreError: 4085:0:(mdt_reint.c:916:mdt_pdir_hash_lock()) LBUG Pid: 4085, comm: mdt_04 Call Trace: [<ffffffffa03347f5>] libcfs_debug_dumpstack+0x55/0x80 [libcfs] [<ffffffffa0334e07>] lbug_with_loc+0x47/0xb0 [libcfs] [<ffffffffa0c65c6b>] mdt_reint_rename+0x1acb/0x1d70 [mdt] [<ffffffffa05a5520>] ? ldlm_blocking_ast+0x0/0x180 [ptlrpc] [<ffffffffa05a68c0>] ? ldlm_completion_ast+0x0/0x720 [ptlrpc] [<ffffffffa034b18e>] ? upcall_cache_get_entry+0x28e/0x944 [libcfs] [<ffffffffa0c5c4ec>] ? mdt_root_squash+0x2c/0x3e0 [mdt] [<ffffffffa0c60c51>] mdt_reint_rec+0x41/0xe0 [mdt] [<ffffffffa0c57ed4>] mdt_reint_internal+0x544/0x8e0 [mdt] [<ffffffffa0c582b4>] mdt_reint+0x44/0xe0 [mdt] [<ffffffffa0c4c772>] mdt_handle_common+0x932/0x1750 [mdt] [<ffffffffa0c4d665>] mdt_regular_handle+0x15/0x20 [mdt] [<ffffffffa05ddc5e>] ptlrpc_main+0xc4e/0x1a40 [ptlrpc] [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc] [<ffffffff8100c14a>] child_rip+0xa/0x20 [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc] [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc] [<ffffffff8100c140>] ? child_rip+0x0/0x20 Kernel panic - not syncing: LBUG Pid: 4085, comm: mdt_04 Not tainted 2.6.32-279.14.1.el6_lustre.x86_64 #1 Call Trace: [<ffffffff814fdcba>] ? panic+0xa0/0x168 [<ffffffffa0334e5b>] ? lbug_with_loc+0x9b/0xb0 [libcfs] [<ffffffffa0c65c6b>] ? mdt_reint_rename+0x1acb/0x1d70 [mdt] [<ffffffffa05a5520>] ? ldlm_blocking_ast+0x0/0x180 [ptlrpc] [<ffffffffa05a68c0>] ? ldlm_completion_ast+0x0/0x720 [ptlrpc] [<ffffffffa034b18e>] ? upcall_cache_get_entry+0x28e/0x944 [libcfs] [<ffffffffa0c5c4ec>] ? mdt_root_squash+0x2c/0x3e0 [mdt] [<ffffffffa0c60c51>] ? mdt_reint_rec+0x41/0xe0 [mdt] [<ffffffffa0c57ed4>] ? mdt_reint_internal+0x544/0x8e0 [mdt] [<ffffffffa0c582b4>] ? mdt_reint+0x44/0xe0 [mdt] [<ffffffffa0c4c772>] ? mdt_handle_common+0x932/0x1750 [mdt] [<ffffffffa0c4d665>] ? mdt_regular_handle+0x15/0x20 [mdt] [<ffffffffa05ddc5e>] ? ptlrpc_main+0xc4e/0x1a40 [ptlrpc] [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc] [<ffffffff8100c14a>] ? child_rip+0xa/0x20 [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc] [<ffffffffa05dd010>] ? ptlrpc_main+0x0/0x1a40 [ptlrpc] [<ffffffff8100c140>] ? child_rip+0x0/0x20
crash> bt -l PID: 4085 TASK: ffff8806283c0040 CPU: 6 COMMAND: "mdt_04" #0 [ffff8805ea1d59b8] machine_kexec at ffffffff8103284b /usr/src/debug/kernel-2.6.32-279.14.1.el6/linux-2.6.32-279.14.1.el6_lustre.x86_64/arch/x86/kernel/machine_kexec_64.c: 336 #1 [ffff8805ea1d5a18] crash_kexec at ffffffff810ba982 /usr/src/debug/kernel-2.6.32-279.14.1.el6/linux-2.6.32-279.14.1.el6_lustre.x86_64/kernel/kexec.c: 1106 #2 [ffff8805ea1d5ae8] panic at ffffffff814fdcc1 /usr/src/debug/kernel-2.6.32-279.14.1.el6/linux-2.6.32-279.14.1.el6_lustre.x86_64/kernel/panic.c: 103 #3 [ffff8805ea1d5b68] lbug_with_loc at ffffffffa0334e5b [libcfs] #4 [ffff8805ea1d5b88] mdt_reint_rename at ffffffffa0c65c6b [mdt] #5 [ffff8805ea1d5cd8] mdt_reint_rec at ffffffffa0c60c51 [mdt] #6 [ffff8805ea1d5cf8] mdt_reint_internal at ffffffffa0c57ed4 [mdt] #7 [ffff8805ea1d5d48] mdt_reint at ffffffffa0c582b4 [mdt] #8 [ffff8805ea1d5d68] mdt_handle_common at ffffffffa0c4c772 [mdt] #9 [ffff8805ea1d5db8] mdt_regular_handle at ffffffffa0c4d665 [mdt] #10 [ffff8805ea1d5dc8] ptlrpc_main at ffffffffa05ddc5e [ptlrpc] #11 [ffff8805ea1d5f48] kernel_thread at ffffffff8100c14a /usr/src/debug///////////////////////////////////////////////////////////////////////////////////////////kernel-2.6.32-279.14.1.el6/linux-2.6.32-279.14.1.el6_lustre.x86_64/arch/x86/kernel/entry_64.S: 1213