[LU-10212] operation ost_read to node [*] failed: rc = -116 Created: 08/Nov/17  Updated: 25/Apr/19  Resolved: 06/Mar/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.11.0
Fix Version/s: Lustre 2.11.0, Lustre 2.10.4

Type: Bug Priority: Critical
Reporter: Alexander Boyko Assignee: Alexander Boyko
Resolution: Fixed Votes: 0
Labels: patch

Issue Links:
Related
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

1504054839 2017 Aug 30 03:00:39 m10029 kern info kernel Lustre: 
lustre02-MDT0005-mdc-ffff880079922000: Connection restored to 
10.50.33.92@o2ib (at 10.50.33.92@o2ib)
1504063447 2017 Aug 30 05:24:07 m10029 kern err kernel LustreError: 
11-0: lustre01-OST0023-osc-ffff88085f97b000: operation ost_read to node 
10.50.32.72@o2ib failed: rc = -116

Based on logs, there were identical locks from the client. And the client canceled one of them because of two, and sent io. The read request came to server with lock handle which was canceled. And the prolong function didn`t check the resource lock tree for this case. So client got ESTALE error.



 Comments   
Comment by Gerrit Updater [ 08/Nov/17 ]

Alexandr Boyko (c17825@cray.com) uploaded a new patch: https://review.whamcloud.com/29992
Subject: LU-10212 ldlm: fix prolong for destroyed lock
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 7cd90b6c90e20bdbb94c6b7da56f5af5b8674b69

Comment by Gerrit Updater [ 04/Jan/18 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/29992/
Subject: LU-10212 ldlm: fix prolong for destroyed lock
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 0716d51d082e3a81c10529627c895db008ae8e35

Comment by Peter Jones [ 04/Jan/18 ]

Landed for 2.11

Comment by Gerrit Updater [ 31/Jan/18 ]

Alexandr Boyko (c17825@cray.com) uploaded a new patch: https://review.whamcloud.com/31101
Subject: LU-10212 test: ESTALE read
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: f9b45ace6e8b7cac4cc16c9684f338a5bd593acb

Comment by Gerrit Updater [ 20/Feb/18 ]

Minh Diep (minh.diep@intel.com) uploaded a new patch: https://review.whamcloud.com/31354
Subject: LU-10212 ldlm: fix prolong for destroyed lock
Project: fs/lustre-release
Branch: b2_10
Current Patch Set: 1
Commit: a42df8813137fd2e62e5eb76fc7ea9f5cf4d5d6b

Comment by Andreas Dilger [ 20/Feb/18 ]

Reopened to track landing of test patch.

Comment by Gerrit Updater [ 06/Mar/18 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/31101/
Subject: LU-10212 test: ESTALE read
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 6e985684b822324143c686e919fe7a75bb16b440

Comment by Gerrit Updater [ 06/Mar/18 ]

Minh Diep (minh.diep@intel.com) uploaded a new patch: https://review.whamcloud.com/31554
Subject: LU-10212 test: ESTALE read
Project: fs/lustre-release
Branch: b2_10
Current Patch Set: 1
Commit: ae46771de44160ec00b753eaba59ab92d411b455

Comment by Peter Jones [ 06/Mar/18 ]

Test has now landed for 2.11 too

Comment by Gerrit Updater [ 05/Apr/18 ]

John L. Hammond (john.hammond@intel.com) merged in patch https://review.whamcloud.com/31354/
Subject: LU-10212 ldlm: fix prolong for destroyed lock
Project: fs/lustre-release
Branch: b2_10
Current Patch Set:
Commit: 75010631ddbaef52739d8b5bd8dadf7051a2830f

Comment by Gerrit Updater [ 03/May/18 ]

John L. Hammond (john.hammond@intel.com) merged in patch https://review.whamcloud.com/31554/
Subject: LU-10212 test: ESTALE read
Project: fs/lustre-release
Branch: b2_10
Current Patch Set:
Commit: f20e7938083a542d35f1e150c23022087d115a14

Generated at Sat Feb 10 02:33:03 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.