[LU-4886] Kernel Panic "cl_lock_put" Created: 12/Apr/14 Updated: 14/Apr/14 Resolved: 12/Apr/14 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.1.5 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Critical |
| Reporter: | Rustem Bikboulatov | Assignee: | WC Triage |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Linux 2.6.32-279.19.1.el6_lustre.x86_64 #1 SMP |
||
| Attachments: |
|
| Severity: | 3 |
| Rank (Obsolete): | 13522 |
| Description |
|
We have a kernel crash on Lustre Client 2.1.5 with the following log: [root@r01 ~]# crash /usr/lib/debug/lib/modules/2.6.32-279.19.1.el6_lustre.x86_64/vmlinux /var/crash/127.0.0.1-2014-04-11-17\:36\:14/vmcore crash 6.0.4-2.el6 GNU gdb (GDB) 7.3.1 KERNEL: /usr/lib/debug/lib/modules/2.6.32-279.19.1.el6_lustre.x86_64/vmlinux crash> log ... Pid: 28331, comm: ldlm_bl_00 Not tainted 2.6.32-279.19.1.el6_lustre.x86_64 #1 Supermicro X8DTH-i/6/iF/6F/X8DTH Cluster configuration: Lustre Server MGS/MDS - mmp-2 (refer to the diagram "20140113 - Hardware Diagram v0.1_R3.gif" in attachment) Environment: Mount points: MGS/MDS: Clients (r01, r02, r03, r04, mmp-1, vn-1): Clients (cln01, cln02, cln03, cln04): Stripe config: kdump config: We have a crash dump file, and if you need it for analysis, we are ready to upload it. |
| Comments |
| Comment by Jinshan Xiong (Inactive) [ 12/Apr/14 ] |
|
Can you please try patch in |
| Comment by Jinshan Xiong (Inactive) [ 12/Apr/14 ] |
| Comment by Rustem Bikboulatov [ 12/Apr/14 ] |
|
Yes, I have seen a patch 1) This patch changes the two procedures: cl_lock_delete0 Trace log in <4> [997.881412] [<ffffffffa05c55b5>] cl_lock_delete0 +0 xb5/0x1d0 [obdclass] In my case the trace log contains no procedure "cl_lock_delete0", and contains other procedures: [<ffffffffa04db225>] cl_lock_cancel0 +0 x75/0x160 [obdclass] Is patch 2) Can I install the patch |
| Comment by Jinshan Xiong (Inactive) [ 13/Apr/14 ] |
|
I think the problem you met was due to referring a freed lock, which is exactly the patch in You don't need this patch on server side, and yes, you can pick a few clients to apply the patch and upgrade all other clients only if it works. |
| Comment by Rustem Bikboulatov [ 13/Apr/14 ] |
|
When I try to compile lustre 2.1.5 with patch ====================================================== Making all in obdclass RPM build errors: =========================================================================== It seems that a "bool" type is not defined |
| Comment by Rustem Bikboulatov [ 14/Apr/14 ] |
|
In addition i want to say that we got the new kernel crash error (on server r04), which has a very similar trace log: |