Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8017

All Nodes report NOT HEALTHY, system is healthy

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.9.0
    • Lustre 2.9.0
    • 3
    • 9223372036854775807

    Description

      Current build installed; https://build.hpdd.intel.com/job/lustre-reviews/38245/
      This issue has persisted for the last two builds.
      After mounting the filesystem, all nodes report NOT HEALTHY in /proc/fs/lustre/health_check.

      1. pdsh -g server 'lctl get_param health_check' |dshbak -c
        ----------------
        lola-[2-11]
        ----------------
        health_check=healthy
        NOT HEALTHY

      The filesystem otherwise operates normally, jobs run, results are created.
      We were using the health_check as part of our monitoring - this has been discontinued.
      We are uncertain as to the cause, as all operations we can test work fine, and no errors are reported.

      Attachments

        Issue Links

          Activity

            People

              simmonsja James A Simmons
              cliffw Cliff White (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: