[LU-8007] Kernel: LustreError: 191208:0:(vvp_io.c:1086:vvp_io_commit_write()) Write page 82782 of inode ffff8803a8fd06b8 failed -28 Created: 11/Apr/16 Updated: 05/Aug/20 Resolved: 05/Aug/20 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.5.5 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Cameron Harr | Assignee: | Nathaniel Clark |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | llnl | ||
| Environment: |
TOSS 2.4-7 |
||
| Issue Links: |
|
||||||||||||
| Severity: | 3 | ||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||
| Description |
|
On April 6, 2016, we noticed one particular file system getting many of the following errors, which corresponded with user job failures: Kernel: LustreError: 191208:0:(vvp_io.c:1086:vvp_io_commit_write()) Write page 82782 of inode ffff8803a8fd06b8 failed -28 'lfs df' showed the filesystem was only 76% full; however, it also showed that 32 of the 80 OSTs were 89%-90% full. Since deactivating those 32 "near-full" OSTs on 4/6, we haven't seen the problem on that file system. Consequently, we are now seeing the issue on another file system where OSTs are ~ 90% full. |
| Comments |
| Comment by Cameron Harr [ 11/Apr/16 ] |
|
Looks very similar to what SNL was seeing in |
| Comment by Peter Jones [ 12/Apr/16 ] |
|
Nathaniel Could you please look into the fesaibility of porting the two mentioned patches to the 2.5 FE branch? Thanks Peter |
| Comment by Nathaniel Clark [ 12/Apr/16 ] |
|
The patches for |
| Comment by Christopher Morrone [ 12/Apr/16 ] |
|
What about the second |
| Comment by Olaf Faaland [ 07/Nov/17 ] |
|
Do you still intend to merge https://review.whamcloud.com/#/c/22567/ to b2_8_fe? Looks like it got the reviews you asked for. |