[LU-13317] lustre-rsync-test test_3c: timeout Created: 03/Mar/20  Updated: 03/Mar/20

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for S Buisson <sbuisson@ddn.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/7c420730-9a2b-462f-a20e-024e08a6264c

test_3c failed with the following error:

Timeout occurred after 488 mins, last suite running was lustre-rsync-test

Very few clues of what happened, it seems the secondary MDS was unreachable:

[29493.811278] Lustre: 29576:0:(client.c:2228:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1583191310/real 1583191310]  req@ffff9430263bc900 x1660080395434304/t0(0) o400->lustre-MDT0001-mdc-ffff94303b57a000@10.9.4.245@tcp:12/10 lens 224/224 e 0 to 1 dl 1583191317 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/1:2.0'
[29493.814254] Lustre: 29576:0:(client.c:2228:ptlrpc_expire_one_request()) Skipped 2 previous similar messages
[29493.815183] Lustre: lustre-MDT0001-mdc-ffff94303b57a000: Connection to lustre-MDT0001 (at 10.9.4.245@tcp) was lost; in progress operations using this service will wait for recovery to complete
[29493.816784] Lustre: Skipped 1 previous similar message
[29498.817136] LustreError: 166-1: MGC10.9.4.244@tcp: Connection to MGS (at 10.9.4.244@tcp) was lost; in progress operations using this service will fail
[29498.818478] LustreError: Skipped 1 previous similar message

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
lustre-rsync-test test_3c - Timeout occurred after 488 mins, last suite running was lustre-rsync-test


Generated at Sat Feb 10 03:00:15 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.