Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
None
-
3
-
9223372036854775807
Description
This issue was created by maloo for S Buisson <sbuisson@ddn.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/7c420730-9a2b-462f-a20e-024e08a6264c
test_3c failed with the following error:
Timeout occurred after 488 mins, last suite running was lustre-rsync-test
Very few clues of what happened, it seems the secondary MDS was unreachable:
[29493.811278] Lustre: 29576:0:(client.c:2228:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1583191310/real 1583191310] req@ffff9430263bc900 x1660080395434304/t0(0) o400->lustre-MDT0001-mdc-ffff94303b57a000@10.9.4.245@tcp:12/10 lens 224/224 e 0 to 1 dl 1583191317 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/1:2.0' [29493.814254] Lustre: 29576:0:(client.c:2228:ptlrpc_expire_one_request()) Skipped 2 previous similar messages [29493.815183] Lustre: lustre-MDT0001-mdc-ffff94303b57a000: Connection to lustre-MDT0001 (at 10.9.4.245@tcp) was lost; in progress operations using this service will wait for recovery to complete [29493.816784] Lustre: Skipped 1 previous similar message [29498.817136] LustreError: 166-1: MGC10.9.4.244@tcp: Connection to MGS (at 10.9.4.244@tcp) was lost; in progress operations using this service will fail [29498.818478] LustreError: Skipped 1 previous similar message
VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
lustre-rsync-test test_3c - Timeout occurred after 488 mins, last suite running was lustre-rsync-test