Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-15885

o2iblnd: RDMA_CM_EVENT_UNREACHABLE may be received after conn clean-up

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.16.0
    • None
    • 3
    • 9223372036854775807

    Description

      There's a scenario when IB port going down triggers the following assertion:

        case RDMA_CM_EVENT_UNREACHABLE:
                      conn = cmid->context;
                      LASSERT(conn->ibc_state == IBLND_CONN_ACTIVE_CONNECT ||
                              conn->ibc_state == IBLND_CONN_PASSIVE_WAIT);

      Because connection is already disconnected due to an earlier "RDMA Timeout".

      Since it appears to be possible to get RDMA_CM_EVENT_UNREACHABLE after having decided to close the connection, this code should be changed.

      Attachments

        Issue Links

          Activity

            People

              ssmirnov Serguei Smirnov
              ssmirnov Serguei Smirnov
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: