[LU-14329] Interop: recovery-small test 140a fails with 'no clients with recovery disabled' Created: 13/Jan/21  Updated: 19/Mar/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.14.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Nunez (Inactive) Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: interop, tests

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

recovery-small test_140a fails for interop testing starting on 07 AUG 2020 for Lustre server version < 2.13.55.16 and Lustre client version >= 2.13.55.16. This failure does not happen for Lustre servers 2.12.5 and 2.12.6, but we do see this failure for 2.13.0 servers.

Looking at suite_log for the latest failure at https://testing.whamcloud.com/test_sets/8adef6a4-82c3-4286-811b-c3600c371395, we can see there is an issue setting the ‘local_recovery’ parameter

== recovery-small test 140a: local mount is flagged properly ========================================= 17:08:55 (1608829735)
CMD: trevis-17vm4 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.local_recovery
trevis-17vm4: error: get_param: param_path 'mdt/lustre-MDT0000/local_recovery': No such file or directory
pdsh@trevis-17vm1: trevis-17vm4: ssh exited with exit code 2
CMD: trevis-17vm4 /usr/sbin/lctl set_param mdt.*.local_recovery=0
trevis-17vm4: error: set_param: param_path 'mdt/*/local_recovery': No such file or directory
pdsh@trevis-17vm1: trevis-17vm4: ssh exited with exit code 2
mds1_HOST
…
CMD: trevis-17vm4 umount  /mnt/lustre2 2>&1
CMD: trevis-17vm4 rmdir /mnt/lustre2
 recovery-small test_140a: @@@@@@ FAIL: no clients with recovery disabled 
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:6273:error()
  = /usr/lib64/lustre/tests/recovery-small.sh:2935:test_140a()

Generated at Sat Feb 10 03:08:48 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.