Details
-
Bug
-
Resolution: Fixed
-
Major
-
Lustre 2.15.0
-
None
-
3
-
9223372036854775807
Description
This issue was created by maloo for Elena <elena.gryaznova@hpe.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/ec8cab41-6587-4369-815b-b53f9684d272
test_failover_mds failed with the following error:
Checking clients are in FULL|IDLE state before next failover CMD: onyx-33vm3,onyx-33vm4 PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/utils/gss:/usr/lib64/lustre/utils:/usr/lib64/openmpi/bin:/usr/share/Modules/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/usr/sbin:/sbin:/bin::/sbin:/bin:/usr/sbin: NAME=autotest_config bash rpc.sh wait_import_state_mount FULL|IDLE mdc.lustre-MDT0000-mdc-*.mds_server_uuid onyx-33vm4: sh: IDLE: command not found onyx-33vm3: sh: IDLE: command not found onyx-33vm4: CMD: onyx-64vm13 /usr/sbin/lctl get_param -n version 2>/dev/null onyx-33vm3: CMD: onyx-64vm13 /usr/sbin/lctl get_param -n version 2>/dev/null onyx-33vm4: CMD: onyx-64vm13 /usr/sbin/lctl get_param -n version 2>/dev/null onyx-33vm3: CMD: onyx-64vm13 /usr/sbin/lctl get_param -n version 2>/dev/null onyx-33vm4: CMD: onyx-24vm5 /usr/sbin/lctl get_param -n version 2>/dev/null onyx-33vm3: CMD: onyx-24vm5 /usr/sbin/lctl get_param -n version 2>/dev/null onyx-33vm3: CMD: onyx-33vm3.onyx.whamcloud.com /usr/sbin/lctl get_param -n version 2>/dev/null onyx-33vm3: onyx-33vm3.onyx.whamcloud.com: executing wait_import_state_mount FULL onyx-33vm4: CMD: onyx-33vm4.onyx.whamcloud.com /usr/sbin/lctl get_param -n version 2>/dev/null onyx-33vm4: onyx-33vm4.onyx.whamcloud.com: executing wait_import_state_mount FULL onyx-33vm3: CMD: onyx-33vm3.onyx.whamcloud.com lctl get_param -n at_max pdsh@onyx-33vm1: onyx-33vm3: ssh exited with exit code 127 onyx-33vm4: CMD: onyx-33vm4.onyx.whamcloud.com lctl get_param -n at_max pdsh@onyx-33vm1: onyx-33vm4: ssh exited with exit code 127 recovery-mds-scale test_failover_mds: @@@@@@ FAIL: import is not in FULL|IDLE state
regression caused by:
commit af666bef058c5b7997527fc851a84a89375912fb Author: Andreas Dilger <adilger@whamcloud.com> AuthorDate: Wed Oct 20 19:47:25 2021 -0600 Commit: Oleg Drokin <green@whamcloud.com> CommitDate: Tue Nov 30 03:52:10 2021 +0000 LU-12857 tests: allow clients to be IDLE after recovery If clients are not connected to an OST when it fails (connection is IDLE), they do not need to be involved in recovery, so this should not be considered an error when checking the client state. Test-Parameters: trivial testlist=recovery-mds-scale env=SLOW=no Test-Parameters: testlist=conf-sanity Test-Parameters: testlist=replay-dual,replay-single Signed-off-by: Andreas Dilger <adilger@whamcloud.com> Change-Id: I6cfeb718acd233378ed1608f22061bc15c3ebbe5 Reviewed-on: https://review.whamcloud.com/45318 Tested-by: jenkins <devops@whamcloud.com> Tested-by: Maloo <maloo@whamcloud.com> Reviewed-by: Alex Zhuravlev <bzzz@whamcloud.com> Reviewed-by: James Nunez <jnunez@whamcloud.com> Reviewed-by: Oleg Drokin <green@whamcloud.com>
VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
recovery-mds-scale test_failover_mds - import is not in FULL|IDLE state
Attachments
Issue Links
- is related to
-
LU-12857 recovery-mds-scale test_failover_ost fails with “import is not in FULL state”
- Resolved