[LU-16506] recovery-mds-scale test_failover_mds: recovery_status recovery not done in 1475 sec. status: RECOVERING Created: 26/Jan/23  Updated: 26/Jan/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Elena <elena.gryaznova@hpe.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/fbd79818-f68c-4b5c-ba38-ce51b1a81377

test_failover_mds failed with the following error:

Timeout occurred after 1448 minutes, last suite running was recovery-mds-scale

Test session details:
clients: https://build.whamcloud.com/job/lustre-reviews/91846 - 4.18.0-348.7.1.el8_5.x86_64
servers: https://build.whamcloud.com/job/lustre-reviews/91846 - 4.18.0-348.23.1.el8_lustre.x86_64

trevis-55vm8: Waiting 0 secs for *.lustre-MDT0001.recovery_status recovery done. status: RECOVERING
trevis-55vm8: *.lustre-MDT0001.recovery_status recovery not done in 1475 sec. status: RECOVERING
pdsh@trevis-55vm1: trevis-55vm8: ssh exited with exit code 1
mds2 recovery is not completed!

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
recovery-mds-scale test_failover_mds - Timeout occurred after 1448 minutes, last suite running was recovery-mds-scale


Generated at Sat Feb 10 03:27:36 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.