Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.10.0, Lustre 2.11.0
-
None
-
3
-
9223372036854775807
Description
During large IOR testing with network fails introduced, Cray found a data corruption issues.
first issue is related to the 4MB BRW patchset and exist for long time. Bulk will be marked as failed just with real network error, but if one parts of data was lost and request timeout will treat as transfer done.
second issue is related with cleanup landed as commit 49d8a7ccd73 where "rc" parameter of obd_commit function was replaced with local data, it horror any errors before it.
Alexandr Boyko (c17825@cray.com) uploaded a new patch: https://review.whamcloud.com/35571
Subject:
LU-11169ptlrpc: handle reply and resend reorderProject: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 88c2a9a0c840d01d50762a71f04319e58c9affef