Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8271

nodemap: retrying a large configuration transfer should have a delay

Details

    • Improvement
    • Resolution: Fixed
    • Major
    • Lustre 2.11.0
    • Lustre 2.9.0
    • None
    • 9223372036854775807

    Description

      In order to avoid thrashing the MGS during a bulk configuration update, the nodemap config clients should delay before retrying a config get.

      When a nodemap is larger than a single RPC, clients need to use multiple RPCs to get the nodemap config. If the config changes between RPCs, the client needs to drop the config using the previous RPCs and restart the transfer. If there are many configuration changes occurring, it's possible that a config get could be restarted multiple times, causing unnecessary load. The config get clients should wait some time before restarting the transfer, to allow the server to finish updating its config.

      It may be possible to re-enqueue the config lock to have the main MGC lock thread restart the transfer, which would add a random delay of between 5-10s.

      Attachments

        Activity

          [LU-8271] nodemap: retrying a large configuration transfer should have a delay
          pjones Peter Jones added a comment -

          Landed for 2.11

          pjones Peter Jones added a comment - Landed for 2.11

          Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/26781/
          Subject: LU-8271 nodemap: wait before getting large conf if changed
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: f75631655890260549b12233589ee4b2074f20ce

          gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/26781/ Subject: LU-8271 nodemap: wait before getting large conf if changed Project: fs/lustre-release Branch: master Current Patch Set: Commit: f75631655890260549b12233589ee4b2074f20ce
          pjones Peter Jones added a comment -

          Emoly

          Can you please follow up to get this patch landed?

          Thanks

          Peter

          pjones Peter Jones added a comment - Emoly Can you please follow up to get this patch landed? Thanks Peter

          Kit Westneat (kit.westneat@gmail.com) uploaded a new patch: https://review.whamcloud.com/26781
          Subject: LU-8271 nodemap: wait before getting large conf if changed
          Project: fs/lustre-release
          Branch: master
          Current Patch Set: 1
          Commit: b453a08f9604bc5f47f022b94da52bbf77480cbb

          gerrit Gerrit Updater added a comment - Kit Westneat (kit.westneat@gmail.com) uploaded a new patch: https://review.whamcloud.com/26781 Subject: LU-8271 nodemap: wait before getting large conf if changed Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: b453a08f9604bc5f47f022b94da52bbf77480cbb

          Sure, I'll get a patch together.

          kit.westneat Kit Westneat (Inactive) added a comment - Sure, I'll get a patch together.

          Kit, do you have any cycles to look into this?

          adilger Andreas Dilger added a comment - Kit, do you have any cycles to look into this?

          People

            emoly.liu Emoly Liu
            kit.westneat Kit Westneat (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: