Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.10.5
-
None
-
Client: CentOS 7.5 Lustre 2.10.5
Server (Oak) CentOS 7.4 Lustre 2.10.4
-
3
-
9223372036854775807
Description
Hi,
We have a type of job that keeps crashing Lustre client version 2.10.5 with the following trace. It is very likely this job has files open on Oak (Lustre 2.10.4). This looks like old tickets LU-1414 and LU-118... fixed in Lustre 1.8! The issue happens on a bigmem node (1.5TB of RAM), and doesn't seem to happen on node with less memory. I'll try to upload a crash dump file to your ftp.
[11497.465606] LustreError: 132407:0:(llite_lib.c:2047:ll_delete_inode()) ASSERTION( inode->i_data.nrpages == 0 ) failed: inode=[0x200018e83:0x1ba2c:0x0](ffff8aa85a298510) nrpages=1, see LU-118 [11497.487939] LustreError: 132407:0:(llite_lib.c:2047:ll_delete_inode()) LBUG [11497.495730] Pid: 132407, comm: spades 3.10.0-862.14.4.el7.x86_64 #1 SMP Wed Sep 26 15:12:11 UTC 2018 [11497.505939] Call Trace: [11497.508685] [<ffffffffc09947cc>] libcfs_call_trace+0x8c/0xc0 [libcfs] [11497.516009] [<ffffffffc099487c>] lbug_with_loc+0x4c/0xa0 [libcfs] [11497.522955] [<ffffffffc0f25c87>] ll_delete_inode+0x1b7/0x1c0 [lustre] [11497.530291] [<ffffffff8d43c504>] evict+0xb4/0x180 [11497.535663] [<ffffffff8d43ce0c>] iput+0xfc/0x190 [11497.540940] [<ffffffff8d43126e>] do_unlinkat+0x1ae/0x2d0 [11497.546990] [<ffffffff8d432326>] SyS_unlink+0x16/0x20 [11497.552753] [<ffffffff8d92579b>] system_call_fastpath+0x22/0x27 [11497.559484] [<ffffffffffffffff>] 0xffffffffffffffff [11497.565069] Kernel panic - not syncing: LBUG [11497.569837] CPU: 7 PID: 132407 Comm: spades Kdump: loaded Tainted: G OE ------------ 3.10.0-862.14.4.el7.x86_64 #1 [11497.582928] Hardware name: Dell Inc. PowerEdge R630/02C2CP, BIOS 2.8.0 005/17/2018 [11497.591379] Call Trace: [11497.594105] [<ffffffff8d913754>] dump_stack+0x19/0x1b [11497.599845] [<ffffffff8d90d29f>] panic+0xe8/0x21f [11497.605211] [<ffffffffc09948cb>] lbug_with_loc+0x9b/0xa0 [libcfs] [11497.612131] [<ffffffffc0f25c87>] ll_delete_inode+0x1b7/0x1c0 [lustre] [11497.619421] [<ffffffff8d43c504>] evict+0xb4/0x180 [11497.624775] [<ffffffff8d43ce0c>] iput+0xfc/0x190 [11497.630033] [<ffffffff8d43126e>] do_unlinkat+0x1ae/0x2d0 [11497.636064] [<ffffffff8d42175e>] ? ____fput+0xe/0x10 [11497.641709] [<ffffffff8d2bab90>] ? task_work_run+0xc0/0xe0 [11497.647935] [<ffffffff8d432326>] SyS_unlink+0x16/0x20 [11497.653679] [<ffffffff8d92579b>] system_call_fastpath+0x22/0x27
Thanks,
Stephane
Attachments
Issue Links
- is related to
-
LU-118 clear_inode: BUG_ON(inode->i_data.nrpages)
- Resolved