[LU-11231] recovery-small test_26a: timeouts are wrong Created: 09/Aug/18  Updated: 26/Jan/19

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.10.5
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/ccb60282-98de-11e8-87f3-52540065bddc

test_26a failed with the following error:

'timeouts are wrong! mds: 100, client: 20, TIMEOUT=20'
== recovery-small test 26a: evict dead exports == 21:20:49 (1533417649)
CMD: trevis-8vm12 lctl get_param -n timeout
 recovery-small test_26a: @@@@@@ FAIL: timeouts are wrong! mds: 100, client: 20, TIMEOUT=20 

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
recovery-small test_26a - 'timeouts are wrong! mds: 100, client: 20, TIMEOUT=20'



 Comments   
Comment by Andreas Dilger [ 14/Aug/18 ]

This looks like it is test script fallout from test_23 failing to restart the MDS for some reason. It doesn't appear to be an actual Lustre bug.

Comment by James Nunez (Inactive) [ 14/Aug/18 ]

Whenever we see recovery-small test 26a to fail with the ''timeouts are wrong! mds: 100, client: 20, TIMEOUT=20'' error message, we also see that a previous test, recovery-small test 23, fails with a ping/pm issue (DCO-8171)

waiting ! ping -w 3 -c 1 trevis-8vm11, 4 secs left ...
waiting ! ping -w 3 -c 1 trevis-8vm11, 3 secs left ...
waiting ! ping -w 3 -c 1 trevis-8vm11, 2 secs left ...
waiting ! ping -w 3 -c 1 trevis-8vm11, 1 secs left ...
waiting for trevis-8vm11 to fail attempts=3
+ pm -h powerman --off trevis-8vm11
waiting ! ping -w 3 -c 1 trevis-8vm11, 4 secs left ...
waiting ! ping -w 3 -c 1 trevis-8vm11, 3 secs left ...
waiting ! ping -w 3 -c 1 trevis-8vm11, 2 secs left ...
waiting ! ping -w 3 -c 1 trevis-8vm11, 1 secs left ...
waiting for trevis-8vm11 to fail attempts=3
trevis-8vm11 still pingable after power down! attempts=3

The converse is not true; recovery-small test 23 fails some times and test 26a does not fail.

Generated at Sat Feb 10 02:42:05 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.