Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8626

limit number of items in HSM action queue

Details

    • Improvement
    • Resolution: Unresolved
    • Major
    • None
    • None
    • 9223372036854775807

    Description

      Several presentations at RUG'16 mentioned that Lustre has poor performance when there are very large numbers of HSM actions outstanding on the coordinator.

      Firstly, having a /proc file that exposes the number of entries currently in the HSM action list would allow RBH and monitoring scripts to easily monitor the number of enties.

      Attachments

        Issue Links

          Activity

            [LU-8626] limit number of items in HSM action queue

            2.11 has already been released.

            adilger Andreas Dilger added a comment - 2.11 has already been released.

            About already landed https://review.whamcloud.com/30336/:

            Given the implemented counters refers to the contents of "active_requests" list, they should rather be named "active_archive_count", "active_restore_count", ... instead of "archive_count", etc... to be more explicit and avoid any confusion with the contents of hsm/actions that contains all requested actions.

            This change should be done before releasing 2.11 to avoid changing names in /proc later after the feature is released.

            tl-cea Thomas Leibovici added a comment - About already landed https://review.whamcloud.com/30336/ : Given the implemented counters refers to the contents of "active_requests" list, they should rather be named "active_archive_count", "active_restore_count", ... instead of "archive_count", etc... to be more explicit and avoid any confusion with the contents of hsm/actions that contains all requested actions. This change should be done before releasing 2.11 to avoid changing names in /proc later after the feature is released.
            pjones Peter Jones added a comment -

            ok

            pjones Peter Jones added a comment - ok

            Hi Peter,

            I am not sure this issue should be marked as resolved yet.
            The patches that landed only provide information about how many requests the coordinator is currently handling, there are no built-in limitations yet.

            bougetq Quentin Bouget (Inactive) added a comment - Hi Peter, I am not sure this issue should be marked as resolved yet. The patches that landed only provide information about how many requests the coordinator is currently handling, there are no built-in limitations yet.
            pjones Peter Jones added a comment -

            Landed for 2.11

            pjones Peter Jones added a comment - Landed for 2.11

            Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/30336/
            Subject: LU-8626 hsm: expose the number of active hsm requests per type
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 42e40555f250b83730d233dc5e22fd1f9396ccfe

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/30336/ Subject: LU-8626 hsm: expose the number of active hsm requests per type Project: fs/lustre-release Branch: master Current Patch Set: Commit: 42e40555f250b83730d233dc5e22fd1f9396ccfe

            Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/28677/
            Subject: LU-8626 hsm: count the number of started requests of each type
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 973759d1ff3bbcb217754bd9942fdf670dec2d96

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/28677/ Subject: LU-8626 hsm: count the number of started requests of each type Project: fs/lustre-release Branch: master Current Patch Set: Commit: 973759d1ff3bbcb217754bd9942fdf670dec2d96

            Quentin Bouget (quentin.bouget@cea.fr) uploaded a new patch: https://review.whamcloud.com/30336
            Subject: LU-8626 hsm: expose the number of active hsm requests per type
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 294a0e4ff77fb09ac643b5e15af224027ded4aee

            gerrit Gerrit Updater added a comment - Quentin Bouget (quentin.bouget@cea.fr) uploaded a new patch: https://review.whamcloud.com/30336 Subject: LU-8626 hsm: expose the number of active hsm requests per type Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 294a0e4ff77fb09ac643b5e15af224027ded4aee

            Quentin Bouget (quentin.bouget@cea.fr) uploaded a new patch: https://review.whamcloud.com/28677
            Subject: LU-8626 hsm: count the number of started requests of each type
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 4e84ba8aa669554b2d1b77459ebe79770aa4ad37

            gerrit Gerrit Updater added a comment - Quentin Bouget (quentin.bouget@cea.fr) uploaded a new patch: https://review.whamcloud.com/28677 Subject: LU-8626 hsm: count the number of started requests of each type Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 4e84ba8aa669554b2d1b77459ebe79770aa4ad37

            I think we should stick with a single value per file, since this is required when moving stats into /sys/fs/lustre, so something like action_archive_started_count, action_archive_succeed_count, action_restore_started_count, action_restore_succeed_count.

            adilger Andreas Dilger added a comment - I think we should stick with a single value per file, since this is required when moving stats into /sys/fs/lustre, so something like action_archive_started_count , action_archive_succeed_count , action_restore_started_count , action_restore_succeed_count .

            People

              bougetq Quentin Bouget (Inactive)
              adilger Andreas Dilger
              Votes:
              2 Vote for this issue
              Watchers:
              18 Start watching this issue

              Dates

                Created:
                Updated: