Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17325

o2iblnd: graceful handling of CM_EVENT_UNREACHABLE on established connection

Details

    • 3
    • 9223372036854775807

    Description

      There were examples in the field with RoCE setups which demonstrate that CM_EVENT_UNREACHABLE may be received when connection is already in ESTABLISHED state.

      This causes the assertion in kiblnd_cm_callback() to fail:

       ASSERTION( conn->ibc_state != 3 && conn->ibc_state != 0 ) failed:

      It is proposed to handle this in a more gracious manner:  report the event as unexpected and allow the flow to continue. If there are indeed issues on the connection, it is expected to report transaction errors and get cleaned up without crashing the whole system.

      Attachments

        Issue Links

          Activity

            [LU-17325] o2iblnd: graceful handling of CM_EVENT_UNREACHABLE on established connection

            "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/56520
            Subject: LU-17325 o2iblnd: CM_EVENT_UNREACHABLE on established conn
            Project: fs/lustre-release
            Branch: b2_15
            Current Patch Set: 1
            Commit: 2662418d67bf70025c55898e2bde2cb5fc2a3559

            gerrit Gerrit Updater added a comment - "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/56520 Subject: LU-17325 o2iblnd: CM_EVENT_UNREACHABLE on established conn Project: fs/lustre-release Branch: b2_15 Current Patch Set: 1 Commit: 2662418d67bf70025c55898e2bde2cb5fc2a3559
            pjones Peter Jones added a comment -

            Landed for 2.16

            pjones Peter Jones added a comment - Landed for 2.16

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/53298/
            Subject: LU-17325 o2iblnd: CM_EVENT_UNREACHABLE on established conn
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: f7051f0092b19416ed86d7f4bbfe1cba7bb74c02

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/53298/ Subject: LU-17325 o2iblnd: CM_EVENT_UNREACHABLE on established conn Project: fs/lustre-release Branch: master Current Patch Set: Commit: f7051f0092b19416ed86d7f4bbfe1cba7bb74c02

            "Serguei Smirnov <ssmirnov@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/53298
            Subject: LU-17325 o2iblnd: CM_EVENT_UNREACHABLE on established conn
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: cbde71bf893dba0de752a190c3b16d653ef75085

            gerrit Gerrit Updater added a comment - "Serguei Smirnov <ssmirnov@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/53298 Subject: LU-17325 o2iblnd: CM_EVENT_UNREACHABLE on established conn Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: cbde71bf893dba0de752a190c3b16d653ef75085

            People

              ssmirnov Serguei Smirnov
              ssmirnov Serguei Smirnov
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: