Details

    • Bug
    • Resolution: Cannot Reproduce
    • Minor
    • None
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      running sanity-benchmark I see this error in dbench subtest very often:

      [  137.575450] Lustre: 6926:0:(client.c:2340:ptlrpc_expire_one_request()) @@@ Request sent has failed due to network error: [sent 1712216405/real 1712216405]  req@ffff9d1ebb9d4980 x1795389298614272/t0(0) o104->lustre-MDT0001@0@lo:15/16 lens 328/224 e 0 to 1 dl 1712216421 ref 1 fl Rpc:ReXQU/0/ffffffff rc 0/-1 job:'' uid:4294967295 gid:4294967295
      

      bisection:

      COMMIT		TESTED	PASSED	FAILED		COMMIT DESCRIPTION
      3d635dd3f2	1	0	1	BAD	LU-17484 gss: reply error for SEC_CTX_INIT on wrong node
      02caf71707	1	0	1	BAD	LU-17258 socklnd: stop connecting on too many retries
      a6886dba0e	5	0	5	BAD	LU-17379 ptlrpc: fix check for callback discard
      adc60b8922	5	5	0	GOOD	LU-17000 utils: Use ssize_t to store return from sysconf()
      004af719b2	5	5	0	GOOD	LU-17507 build: Allow symlink in mofed default path
      099350d6e3	5	5	0	GOOD	LU-17505 socklnd: return NETWORK_TIMEOUT to LNet on ETIMEOUT
      dba4135556	5	5	0	GOOD	LU-17379 lnet: add LNetPeerDiscovered to LNet API
      

      Attachments

        Issue Links

          Activity

            [LU-17704] network error in sanity-benchmark
            pjones Peter Jones added a comment -

            Revert merged for 2.16

            pjones Peter Jones added a comment - Revert merged for 2.16

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/54686/
            Subject: LU-17704 revert: "LU-17379 ptlrpc: fix check for callback discard"
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 394de8d90668c0110257e5cfceca50d9838d606d

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/54686/ Subject: LU-17704 revert: " LU-17379 ptlrpc: fix check for callback discard" Project: fs/lustre-release Branch: master Current Patch Set: Commit: 394de8d90668c0110257e5cfceca50d9838d606d
            asmadeus Dominique Martinet added a comment - - edited

            I couldn't figure out how to link issues in jira so leaving a comment here as well (EDIT: found the button, done!) – I've given more details of the underlying disconnection in LU-17759 before noticing this LU (basically identified this through the same revert)

            Now details are out there's probably a better way of fixing this than the revert, but I'd favor a revert until a better solution is made.

            Thanks!

            asmadeus Dominique Martinet added a comment - - edited I couldn't figure out how to link issues in jira so leaving a comment here as well (EDIT: found the button, done!) – I've given more details of the underlying disconnection in LU-17759 before noticing this LU (basically identified this through the same revert) Now details are out there's probably a better way of fixing this than the revert, but I'd favor a revert until a better solution is made. Thanks!

            "Andreas Dilger <adilger@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/54686
            Subject: LU-17704 revert: "LU-17379 ptlrpc: fix check for callback discard"
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 9bd0d33f4cc159f10fcf9a0f2c884c0b70031a6e

            gerrit Gerrit Updater added a comment - "Andreas Dilger <adilger@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/54686 Subject: LU-17704 revert: " LU-17379 ptlrpc: fix check for callback discard" Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 9bd0d33f4cc159f10fcf9a0f2c884c0b70031a6e

            tappro please check.

            bzzz Alex Zhuravlev added a comment - tappro please check.

            People

              tappro Mikhail Pershin
              bzzz Alex Zhuravlev
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: