[LU-3721] kernel NULL pointer dereference at 0000000000000036 Created: 07/Aug/13 Updated: 14/Nov/13 Resolved: 14/Nov/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.4.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Critical |
| Reporter: | Mahmoud Hanafi | Assignee: | Hongchao Zhang |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Severity: | 2 |
| Rank (Obsolete): | 9581 |
| Description |
|
System crash after reboot/remount of osts. We have crash dump and can upload if needed. ustre: nbp7-OST002b: deleting orphan objects from 0x0:131250 to 0x0:131297 Entering kdb (current=0xffff881f9c5d0040, pid 24479) on processor 30 Oops: (null) |
| Comments |
| Comment by Jay Lan (Inactive) [ 07/Aug/13 ] |
|
Our source is at https://github.com/jlan/lustre-nas |
| Comment by Peter Jones [ 07/Aug/13 ] |
|
Hongchao Could you please help with this one? Thanks Peter |
| Comment by Hongchao Zhang [ 08/Aug/13 ] |
|
could you please print the line # at osd_write_commit+0x2ff/0x610. |
| Comment by Andreas Dilger [ 08/Aug/13 ] |
|
The WARNING is that there was a page alloc request with order > 1, but GFP_NOWARN. It is possible that the page allocation failed and the error handling is not expecting this? |
| Comment by Jay Lan (Inactive) [ 13/Aug/13 ] |
|
It crashed at line 842 of lustre/osd-ldiskfs/osd_io.c: static int osd_write_commit(const struct lu_env ... else if (iobuf->dr_npages > 0) { rc = osd->od_fsops->fs_map_inode_pages(inode, iobuf->dr_pages, iobuf->dr_npages, iobuf->dr_blocks, 1, NULL); } else { It was this line: /usr/src/redhat/BUILD/lustre-2.4.0/lustre/osd-ldiskfs/osd_io.c:842 where the content of rax was fffffffffffffffe RIP: ffffffffa0c3af3f RSP: ffff881f99acb8e0 RFLAGS: 00010246 Andreas, there was plenty of memory at the system: TOTAL SWAP 500013 1.9 GB ---- |
| Comment by Hongchao Zhang [ 13/Aug/13 ] |
|
this should be fixed in |
| Comment by Jay Lan (Inactive) [ 13/Aug/13 ] |
|
Yep! Looks like it! Crash at the same place. |
| Comment by Hongchao Zhang [ 19/Aug/13 ] |
|
Hi Jay, |
| Comment by Mahmoud Hanafi [ 14/Nov/13 ] |
|
Patch applied and we haven't see this issue. Please close case |
| Comment by Peter Jones [ 14/Nov/13 ] |
|
ok - thanks Mahmoud. |