Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7236

OST connect and disconnect on demand

Details

    • Improvement
    • Resolution: Fixed
    • Minor
    • Lustre 2.12.0
    • None
    • 9223372036854775807

    Description

      currently Lustre client establishes connections to all targets at mount and tries to keep them alive all the time. at big scale this can lead to lots of extra work at mount, failover.

      Attachments

        Issue Links

          Activity

            [LU-7236] OST connect and disconnect on demand
            pjones Peter Jones added a comment -

            Landed for 2.12

            pjones Peter Jones added a comment - Landed for 2.12

            Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/16682/
            Subject: LU-7236 ptlrpc: idle connections can disconnect
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 5a6ceb664f07812c351786c1043da71ff5027f8c

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/16682/ Subject: LU-7236 ptlrpc: idle connections can disconnect Project: fs/lustre-release Branch: master Current Patch Set: Commit: 5a6ceb664f07812c351786c1043da71ff5027f8c

            there was a discussion at LAD. Developers from Fujitsu said that connections are consuming quite amount of memory. Then we recalled that there are other possible cases where less connection can help - like recovery, cluster startup.

            bzzz Alex Zhuravlev added a comment - there was a discussion at LAD. Developers from Fujitsu said that connections are consuming quite amount of memory. Then we recalled that there are other possible cases where less connection can help - like recovery, cluster startup.
            rread Robert Read added a comment -

            Interesting idea. I remember discussing this ages ago when we made the initial connection asynchronous, but don't recall exactly why we didn't do it. Still sounds like a good idea to me.

            rread Robert Read added a comment - Interesting idea. I remember discussing this ages ago when we made the initial connection asynchronous, but don't recall exactly why we didn't do it. Still sounds like a good idea to me.

            Alex Zhuravlev (alexey.zhuravlev@intel.com) uploaded a new patch: http://review.whamcloud.com/16682
            Subject: LU-7236 ptlrpc: idling connections can disconnect
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 83d9fc0a0a7914174a5373142b183a7bd99be70a

            gerrit Gerrit Updater added a comment - Alex Zhuravlev (alexey.zhuravlev@intel.com) uploaded a new patch: http://review.whamcloud.com/16682 Subject: LU-7236 ptlrpc: idling connections can disconnect Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 83d9fc0a0a7914174a5373142b183a7bd99be70a

            People

              bzzz Alex Zhuravlev
              bzzz Alex Zhuravlev
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: