[LUDOC-497] Snapshot tools may fail when #targets > MaxStartups in sshd_config Created: 04/Nov/21  Updated: 04/Nov/21

Status: Open
Project: Lustre Documentation
Component/s: None
Affects Version/s: Lustre 2.13.0 Manual
Fix Version/s: None

Type: Question/Request Priority: Minor
Reporter: Nathan Crawford Assignee: Lustre Manual Triage
Resolution: Unresolved Votes: 0
Labels: snapshots, zfs
Environment:

Lustre 2.12.7, CentOS 7.9, zfs 0.8.5


Rank (Obsolete): 9223372036854775807

 Description   

When using ssh as the remote shell for zfs-based lustre snapshots, servers with large numbers of targets may fail to create snapshots on all relevant zfs datasets. A new ssh connection is attempted for each target, and when the total reaches the default maximum number of concurrent unauthenticated connections (10), there is a chance the next connection will be killed.

The solution suggested for LU-9368 works for me: make sure that the MaxStartups value in /etc/ssh/sshd_config is at larger than the number of targets on the server.

Can a note about this appear in the documentation or man pages? It took me an embarrassingly long time to track it down.


Generated at Sat Feb 10 03:43:25 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.