Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8017

All Nodes report NOT HEALTHY, system is healthy

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Critical Critical
    • Lustre 2.9.0
    • Lustre 2.9.0
    • 3
    • 9223372036854775807

      Current build installed; https://build.hpdd.intel.com/job/lustre-reviews/38245/
      This issue has persisted for the last two builds.
      After mounting the filesystem, all nodes report NOT HEALTHY in /proc/fs/lustre/health_check.

      1. pdsh -g server 'lctl get_param health_check' |dshbak -c
        ----------------
        lola-[2-11]
        ----------------
        health_check=healthy
        NOT HEALTHY

      The filesystem otherwise operates normally, jobs run, results are created.
      We were using the health_check as part of our monitoring - this has been discontinued.
      We are uncertain as to the cause, as all operations we can test work fine, and no errors are reported.

            simmonsja James A Simmons
            cliffw Cliff White (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

              Created:
              Updated:
              Resolved: