Details

    • 3
    • 9223372036854775807

    Description

      crash>kib_conn_t.ibc_peer,ibc_refcount,ibc_list,ibc_state,ibc_comms_error,ibc_tx_queue_nocred 0xffff8808c0612e00
        ibc_peer = 0xffff880f0e2a2780
        ibc_refcount = {
          counter = 1
        }
        ibc_list = {
          next = 0xdead000000100100, 
          prev = 0xdead000000200200
        }
        ibc_state = 5
        ibc_comms_error = -12
        ibc_tx_queue_nocred = {
          next = 0xffffc900200de3e0, 
          prev = 0xffffc900200de3e0
        }
      

      A tx was queued by a race into ibc_tx_queue_nocred while disconnecting connection. So the connection stays in unused state but it can't be destroyed because it is referenced by the tx.
      It results in 

      [18891797.073780] Lustre: 1572:0:(niobuf.c:292:ptlrpc_abort_bulk()) Unexpectedly long timeout: desc ffff880a4eaf1c00
      

      Attachments

        Activity

          [LU-11756] kib_conn leak

          Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/36347/
          Subject: LU-11756 o2iblnd: kib_conn leak
          Project: fs/lustre-release
          Branch: b2_12
          Current Patch Set:
          Commit: bb9644c360f4d54d2a2568f94ba8ae94489a873f

          gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/36347/ Subject: LU-11756 o2iblnd: kib_conn leak Project: fs/lustre-release Branch: b2_12 Current Patch Set: Commit: bb9644c360f4d54d2a2568f94ba8ae94489a873f

          Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/36347
          Subject: LU-11756 o2iblnd: kib_conn leak
          Project: fs/lustre-release
          Branch: b2_12
          Current Patch Set: 1
          Commit: d69377151754b5497e332fe3102dc581fd970336

          gerrit Gerrit Updater added a comment - Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/36347 Subject: LU-11756 o2iblnd: kib_conn leak Project: fs/lustre-release Branch: b2_12 Current Patch Set: 1 Commit: d69377151754b5497e332fe3102dc581fd970336
          pjones Peter Jones added a comment -

          Landed for 2.13

          pjones Peter Jones added a comment - Landed for 2.13

          Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/33828/
          Subject: LU-11756 o2iblnd: kib_conn leak
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: a155c3fca38d2a3092f9b5d116ad7877d51d1db1

          gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/33828/ Subject: LU-11756 o2iblnd: kib_conn leak Project: fs/lustre-release Branch: master Current Patch Set: Commit: a155c3fca38d2a3092f9b5d116ad7877d51d1db1

          Its in master-next so it should be landing soon

          simmonsja James A Simmons added a comment - Its in master-next so it should be landing soon
          hornc Chris Horn added a comment -

          I'm hitting this pretty regularly with master

          hornc Chris Horn added a comment - I'm hitting this pretty regularly with master

          Andriy Skulysh (c17819@cray.com) uploaded a new patch: https://review.whamcloud.com/33828
          Subject: LU-11756 o2iblnd: kib_conn leak
          Project: fs/lustre-release
          Branch: master
          Current Patch Set: 1
          Commit: 23da5af6f503bda38e70cbf8d82922676ff9bf0c

          gerrit Gerrit Updater added a comment - Andriy Skulysh (c17819@cray.com) uploaded a new patch: https://review.whamcloud.com/33828 Subject: LU-11756 o2iblnd: kib_conn leak Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 23da5af6f503bda38e70cbf8d82922676ff9bf0c

          People

            askulysh Andriy Skulysh
            askulysh Andriy Skulysh
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: