Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17728

Create a Lustre stat to indicate when there was a client eviction

Details

    • Improvement
    • Resolution: Duplicate
    • Minor
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      We have a non-zero number of customers using VM scale sets, AKS, or spot instances and they complain that their workload gets stuck for some amount of time and then magically continues.  Almost every time this happens, it is due to a client getting deprovisioned without properly unmounting.  This is a problem on the end-user's part, but it would be nice to not have to walk logs to identify that evictions are occurring.  Alerts and other useful actions can be built around this stat changing then.

      This work item tracks creation of a Lustre stat that increments every time a client eviction occurs.

      Patch to be sent shortly.

      Attachments

        Issue Links

          Activity

            [LU-17728] Create a Lustre stat to indicate when there was a client eviction
            gerrit Gerrit Updater added a comment - - edited

            "Ellis Wilson <elliswilson@microsoft.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/54756
            Subject: LU-17728 obdclass: Lustre stat to track client evictions
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: a70a22124edd6d3dd593a7abf405a2e927c63b70

            gerrit Gerrit Updater added a comment - - edited "Ellis Wilson <elliswilson@microsoft.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/54756 Subject: LU-17728 obdclass: Lustre stat to track client evictions Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: a70a22124edd6d3dd593a7abf405a2e927c63b70

            People

              elliswilson Ellis Wilson
              elliswilson Ellis Wilson
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: