[LU-8007] Kernel: LustreError: 191208:0:(vvp_io.c:1086:vvp_io_commit_write()) Write page 82782 of inode ffff8803a8fd06b8 failed -28 Created: 11/Apr/16  Updated: 05/Aug/20  Resolved: 05/Aug/20

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.5.5
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Cameron Harr Assignee: Nathaniel Clark
Resolution: Duplicate Votes: 0
Labels: llnl
Environment:

TOSS 2.4-7
lustre-2.5.5-3chaos_2.6.32_573.18.1.1chaos.ch5.4.x86_64.x86_64
ZFS 0.6.5.4-1.ch5.4.x86_64


Issue Links:
Related
is related to LU-7510 (vvp_io.c:1088:vvp_io_commit_write())... Resolved
is related to LU-2049 add support for OBD_CONNECT_GRANT_PARAM Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

On April 6, 2016, we noticed one particular file system getting many of the following errors, which corresponded with user job failures:

Kernel: LustreError: 191208:0:(vvp_io.c:1086:vvp_io_commit_write()) Write page 82782 of inode ffff8803a8fd06b8 failed -28

'lfs df' showed the filesystem was only 76% full; however, it also showed that 32 of the 80 OSTs were 89%-90% full. Since deactivating those 32 "near-full" OSTs on 4/6, we haven't seen the problem on that file system.

Consequently, we are now seeing the issue on another file system where OSTs are ~ 90% full.



 Comments   
Comment by Cameron Harr [ 11/Apr/16 ]

Looks very similar to what SNL was seeing in LU-7510.

Comment by Peter Jones [ 12/Apr/16 ]

Nathaniel

Could you please look into the fesaibility of porting the two mentioned patches to the 2.5 FE branch?

Thanks

Peter

Comment by Nathaniel Clark [ 12/Apr/16 ]

The patches for LU-2049 touch a lot of ofd code, but the code was heavily re-factored in the run up to 2.6 (at a minimum would need LU-3467), and this is JUST the ofd directory. This would be a very complex port, and I would deem very risky.

Comment by Christopher Morrone [ 12/Apr/16 ]

What about the second LU-2049 patch?

Comment by Olaf Faaland [ 07/Nov/17 ]

Do you still intend to merge https://review.whamcloud.com/#/c/22567/ to b2_8_fe?  Looks like it got the reviews you asked for.

Generated at Sat Feb 10 02:13:49 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.