Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-18899

o2iblnd: graceful handling of RDMA_CM_EVENT_ESTABLISHED

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.17.0
    • Lustre 2.17.0
    • 3
    • 9223372036854775807

    Description

      When handling CM events, o2iblnd expects RDMA_CM_EVENT_ESTABLISHED to be received when the connection is either in IBLND_CONN_PASSIVE_WAIT or IBLND_CONN_ACTIVE_CONNECT state, otherwise LBUG occurs.

      Modify kiblnd_cm_callback() to handle this gracefully: report an error with relevant details and ignore the event rather than just crash.

      This has been reported in the field with RoCE v2 setup.

       

       

      Attachments

        Activity

          [LU-18899] o2iblnd: graceful handling of RDMA_CM_EVENT_ESTABLISHED
          pjones Peter Jones added a comment -

          Merged for 2.17

          pjones Peter Jones added a comment - Merged for 2.17

          "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/58711/
          Subject: LU-18899 o2iblnd: handle RDMA_CM_EVENT_ESTABLISHED gracefully
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: 64d7a7e42022d070cb15e79a0d1655e8df4eb33d

          gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/58711/ Subject: LU-18899 o2iblnd: handle RDMA_CM_EVENT_ESTABLISHED gracefully Project: fs/lustre-release Branch: master Current Patch Set: Commit: 64d7a7e42022d070cb15e79a0d1655e8df4eb33d

          "Serguei Smirnov <ssmirnov@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/58711
          Subject: LU-18899 o2iblnd: handle RDMA_CM_EVENT_ESTABLISHED gracefully
          Project: fs/lustre-release
          Branch: master
          Current Patch Set: 1
          Commit: ca806d00aaee5c0594baaa1b099a7d6b39f23e5c

          gerrit Gerrit Updater added a comment - "Serguei Smirnov <ssmirnov@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/58711 Subject: LU-18899 o2iblnd: handle RDMA_CM_EVENT_ESTABLISHED gracefully Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: ca806d00aaee5c0594baaa1b099a7d6b39f23e5c

          People

            ssmirnov Serguei Smirnov
            ssmirnov Serguei Smirnov
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: