Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8458

Pacemaker script to monitor Lustre servers status

    XMLWordPrintable

Details

    • New Feature
    • Resolution: Fixed
    • Minor
    • Lustre 2.10.0
    • None
    • 9223372036854775807

    Description

      A new script to be used in Pacemaker to monitor the Lustre Servers status compatible with ZFS and LDISKFS based Lustre server installations.

      This RA is able to monitor a Lustre Server using the Pacemaker's clone technology.

      pcs resource create [Resource Name] ocf:pacemaker:healthLUSTRE \
      dampen=[seconds 5s] \
      --clone 
      

      where:

      • dampen The time to wait (dampening) further changes occur

      This script should be located in /usr/lib/ocf/resource.d/heartbeat/ of both the Lustre servers with permission 755.

      Default values:

      • dampen 5s

      Default timeout:

      • start timeout 60s
      • stop timeout 20s
      • monitor timeout 60s interval 10s

      Compatible and tested:

      • pacemaker 1.1.13
      • corosync 2.3.4
      • pcs 0.9.143
      • RHEL/CentOS 7.2

      Example of procedure to configure:

      pcs resource create healthLUSTRE ocf:pacemaker:healthLUSTRE dampen=5s  --clone 
      
      targets=`crm_mon -1|grep 'OST'| awk '{print $1}'` 
      
      for i in $targets; do pcs constraint location $i rule score=-INFINITY lustred lt 1 or not_defined lustred; done 
      

      Attachments

        Issue Links

          Activity

            People

              gabriele.paciucci Gabriele Paciucci (Inactive)
              gabriele.paciucci Gabriele Paciucci (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: