[LU-8457] Pacemaker script to monitor LNet Created: 01/Aug/16 Updated: 26/Apr/17 Resolved: 26/Apr/17 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Lustre 2.10.0 |
| Type: | New Feature | Priority: | Minor |
| Reporter: | Gabriele Paciucci (Inactive) | Assignee: | Gabriele Paciucci (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | lnet, pacemaker | ||
| Issue Links: |
|
||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||
| Description |
|
A new script to be used in Pacemaker to monitor LNet compatible with ZFS and LDISKFS based Lustre server installations. This RA is able to monitor a single LNet device using the Pacemaker's clone technology. pcs resource create [Resource Name] ocf:pacemaker:healthLNET \ dampen=[seconds 5s] \ multiplier=[number 1000] \ lctl=[ true | false] \ device=[device name ib0] \ host_list=[ list of NIDs, space separated, if lctl is true otherwise list of IPs] \ --clone where:
This script should be located in /usr/lib/ocf/resource.d/heartbeat/ of both the Lustre servers with permission 755. Default values:
Default timeout:
Compatible and tested:
Example of procedure to configure: pcs resource create healthLNET ocf:pacemaker:healthLNET dampen=5s multiplier=1000 lctl=true device=eth1 host_list="10.10.130.1@tcp1 10.10.130.2@tcp1" --clone
targets=`crm_mon -1|grep 'OST'| awk '{print $1}'`
for i in $targets; do pcs constraint location $i rule score=-INFINITY pingd lt 1 or not_defined pingd; done
|
| Comments |
| Comment by Gerrit Updater [ 01/Sep/16 ] |
|
Gabriele Paciucci (gabriele.paciucci@intel.com) uploaded a new patch: http://review.whamcloud.com/22266 |
| Comment by Gerrit Updater [ 07/Feb/17 ] |
|
Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/22266/ |
| Comment by Peter Jones [ 07/Feb/17 ] |
|
Landed for 2.10 |
| Comment by Gerrit Updater [ 07/Feb/17 ] |
|
Nathaniel Clark (nathaniel.l.clark@intel.com) uploaded a new patch: https://review.whamcloud.com/25297 |
| Comment by Gerrit Updater [ 26/Apr/17 ] |
|
Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/25297/ |