[LU-11231] recovery-small test_26a: timeouts are wrong Created: 09/Aug/18 Updated: 26/Jan/19 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.10.5 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Maloo | Assignee: | WC Triage |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
This issue was created by maloo for sarah <sarah@whamcloud.com> This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/ccb60282-98de-11e8-87f3-52540065bddc test_26a failed with the following error: 'timeouts are wrong! mds: 100, client: 20, TIMEOUT=20' == recovery-small test 26a: evict dead exports == 21:20:49 (1533417649) CMD: trevis-8vm12 lctl get_param -n timeout recovery-small test_26a: @@@@@@ FAIL: timeouts are wrong! mds: 100, client: 20, TIMEOUT=20 VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV |
| Comments |
| Comment by Andreas Dilger [ 14/Aug/18 ] |
|
This looks like it is test script fallout from test_23 failing to restart the MDS for some reason. It doesn't appear to be an actual Lustre bug. |
| Comment by James Nunez (Inactive) [ 14/Aug/18 ] |
|
Whenever we see recovery-small test 26a to fail with the ''timeouts are wrong! mds: 100, client: 20, TIMEOUT=20'' error message, we also see that a previous test, recovery-small test 23, fails with a ping/pm issue (DCO-8171) waiting ! ping -w 3 -c 1 trevis-8vm11, 4 secs left ... waiting ! ping -w 3 -c 1 trevis-8vm11, 3 secs left ... waiting ! ping -w 3 -c 1 trevis-8vm11, 2 secs left ... waiting ! ping -w 3 -c 1 trevis-8vm11, 1 secs left ... waiting for trevis-8vm11 to fail attempts=3 + pm -h powerman --off trevis-8vm11 waiting ! ping -w 3 -c 1 trevis-8vm11, 4 secs left ... waiting ! ping -w 3 -c 1 trevis-8vm11, 3 secs left ... waiting ! ping -w 3 -c 1 trevis-8vm11, 2 secs left ... waiting ! ping -w 3 -c 1 trevis-8vm11, 1 secs left ... waiting for trevis-8vm11 to fail attempts=3 trevis-8vm11 still pingable after power down! attempts=3 The converse is not true; recovery-small test 23 fails some times and test 26a does not fail. |