Details
-
Bug
-
Resolution: Duplicate
-
Critical
-
None
-
None
-
MDS node, Lustre 2.4.2-14chaos, ZFS OBD
-
3
-
15383
Description
After upgrading to lustre 2.4.2-14chaos (see github.com/chaos/lustre), we soon hit the following assertion on one of our MDS nodes:
mdt_handler.c:3652:mdt_intent_lock_replace()) ASSERTION( new_lock->l_readers + new_lock->l_writers == 0 ) failed
Perhaps most significantly, this tag of our lustre tree includes the patch entitled:
LU-4584 mdt: ensure orig lock is found in hash upon resend
James Simmons reported this assertion when he tested the LU-4584 patch, but the Bruno made the evaluation that the assertion was unrelated to the patch.
Whether it is related or not, we need to fix the problem.
James, "crash" tool claims "WARNING: kernel version inconsistency between vmlinux and dumpfile" and gives up due to further errors ...
Are you sure that at the time of crash you were running with this "2.6.32-431.17.1.el6.wc" kernel version for which you provided the debuginfo RPMs ? BTW, strings out from vmcore and vmcore-dmesg.txt report "2.6.32-358.23.2.el6.atlas"...