Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10889

Inconsistent request deadline between client and server.

Details

    • Bug
    • Resolution: Fixed
    • Major
    • Lustre 2.12.0
    • None
    • 3
    • 9223372036854775807

    Description

      A client can resend requests and a server have ability to find a duplicate requests. In this case a server drops the request. As a result the client request deadline is different than the deadline for same request at server.
      For example

      00000100:00080000:11.1F:1468964870.796746:0:27071:0:(service.c:1556:ptlrpc_server_check_resend_in_progress()) @@@ Found duplicate req in processing  req@ffff880d9168b080 x1539226900119556/t0(0) o101->08998e08-887e-a620-6fb3-36cb6d9403ee@2145@gni1:-1/-1 lens 576/0 e 0 to 0 dl 1468965375 ref 1 fl New:/2/ffffffff rc 0/-1
      00000100:00080000:11.1:1468964870.796752:0:27071:0:(service.c:1557:ptlrpc_server_check_resend_in_progress()) @@@ Request being processed  req@ffff880cd7f5dcc0 x1539226900119556/t0(0) o101->08998e08-887e-a620-6fb3-36cb6d9403ee@2145@gni1:-1/-1 lens 576/0 e 0 to 0 dl 1468965030 ref 1 fl New:/0/ffffffff rc 0/-1
      

      At client request x1539226900119556 has deadline 1468965375, at server 1468965030. So, in the worst case the client will wait for 345sec long than server for the request.
       

      Attachments

        Activity

          [LU-10889] Inconsistent request deadline between client and server.
          pjones Peter Jones added a comment -

          Landed for 2.12

          pjones Peter Jones added a comment - Landed for 2.12

          Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/31910/
          Subject: LU-10889 ptlrpc: update req timeout if resending happened
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: b4603a9e81239b4e6021c640c1d24e4ed8f8fc4b

          gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/31910/ Subject: LU-10889 ptlrpc: update req timeout if resending happened Project: fs/lustre-release Branch: master Current Patch Set: Commit: b4603a9e81239b4e6021c640c1d24e4ed8f8fc4b

          This is only an issue if the server fails, and the client does not get IR notification of this and waits longer before a resend?

          On a related note (which is what I'd thought your patch was fixing), if you are working on this area of code you might consider to fix LU-8750.

          adilger Andreas Dilger added a comment - This is only an issue if the server fails, and the client does not get IR notification of this and waits longer before a resend? On a related note (which is what I'd thought your patch was fixing), if you are working on this area of code you might consider to fix LU-8750 .

          Alexandr Boyko (c17825@cray.com) uploaded a new patch: https://review.whamcloud.com/31910
          Subject: LU-10889 ptlrpc: update req timeout if resending happened
          Project: fs/lustre-release
          Branch: master
          Current Patch Set: 1
          Commit: 745ac133737b43e88c8bdbaa250850a46de8116c

          gerrit Gerrit Updater added a comment - Alexandr Boyko (c17825@cray.com) uploaded a new patch: https://review.whamcloud.com/31910 Subject: LU-10889 ptlrpc: update req timeout if resending happened Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 745ac133737b43e88c8bdbaa250850a46de8116c

          People

            aboyko Alexander Boyko
            aboyko Alexander Boyko
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: