[LU-6386] lower transno may overwrite the bigger one in client last_rcvd slot Created: 19/Mar/15 Updated: 19/Mar/19 Resolved: 14/Oct/15 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Lustre 2.8.0 |
| Type: | Bug | Priority: | Critical |
| Reporter: | Mikhail Pershin | Assignee: | Mikhail Pershin |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||||||
| Severity: | 3 | ||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||
| Description |
|
The tgt_last_rcvd_update() may update transaction number in client slot with smaller value. That may cause client to see wrong error message: LustreError: 3662:0:(import.c:985:ptlrpc_connect_interpret()) pfs2dat1-OST000a_UUID went back in time (transno 77492683762 was previously committed, server now claims 77492683761)! See https://bugzilla.lustre.org/show_bug.cgi?id=9646 This is possible on OST with several IO requests from one client and safe but it is better to keep transaction value in last_rcvd monotonically increased to avoid confusion on client side. |
| Comments |
| Comment by Gerrit Updater [ 19/Mar/15 ] |
|
Mike Pershin (mike.pershin@intel.com) uploaded a new patch: http://review.whamcloud.com/14113 |
| Comment by Andreas Dilger [ 24/Jun/15 ] |
|
It would also be good to change the error message not to reference bugzilla, but instead an LU ticket. |
| Comment by Supporto Lustre Jnet2000 (Inactive) [ 03/Jul/15 ] |
|
Please could you give us a link where we can download this patch RPM? Regards |
| Comment by Gerrit Updater [ 14/Oct/15 ] |
|
Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/14113/ |
| Comment by Joseph Gmitter (Inactive) [ 14/Oct/15 ] |
|
Landed for 2.8 |