Details
-
New Feature
-
Resolution: Fixed
-
Minor
-
None
-
9223372036854775807
Description
A new script to be used in Pacemaker to monitor the Lustre Servers status compatible with ZFS and LDISKFS based Lustre server installations.
This RA is able to monitor a Lustre Server using the Pacemaker's clone technology.
pcs resource create [Resource Name] ocf:pacemaker:healthLUSTRE \ dampen=[seconds 5s] \ --clone
where:
- dampen The time to wait (dampening) further changes occur
This script should be located in /usr/lib/ocf/resource.d/heartbeat/ of both the Lustre servers with permission 755.
Default values:
- dampen 5s
Default timeout:
- start timeout 60s
- stop timeout 20s
- monitor timeout 60s interval 10s
Compatible and tested:
- pacemaker 1.1.13
- corosync 2.3.4
- pcs 0.9.143
- RHEL/CentOS 7.2
Example of procedure to configure:
pcs resource create healthLUSTRE ocf:pacemaker:healthLUSTRE dampen=5s --clone targets=`crm_mon -1|grep 'OST'| awk '{print $1}'` for i in $targets; do pcs constraint location $i rule score=-INFINITY lustred lt 1 or not_defined lustred; done