Details
-
Bug
-
Resolution: Fixed
-
Minor
-
Lustre 2.5.0
-
None
-
3
-
11434
Description
Sanity test_120e seems to be failing intermittently on some tests.
Seeing the following on the MDT
Lustre: lustre-MDT0000: Not available for connect from 10.10.16.108@tcp (stopping)
Lustre: lustre-MDT0000: Not available for connect from 10.10.16.108@tcp (stopping)
Lustre: Skipped 5 previous similar messages
Lustre: lustre-MDT0000: Not available for connect from 10.10.16.109@tcp (stopping)
LustreError: 2966:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_CLOSED req@ffff8800686f7c00 x1450597855266416/t0(0) o13->lustre-OST0002-osc-MDT0000@10.10.16.108@tcp:7/4 lens 224/368 e 0 to 0 dl 0 ref 1 fl Rpc:/0/ffffffff rc 0/-1
LustreError: 2966:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 3 previous similar messages
Lustre: lustre-MDT0000: Not available for connect from 10.10.16.110@tcp (stopping)
Lustre: 10030:0:(client.c:1897:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1383398053/real 1383398053] req@ffff880068291c00 x1450597855266444/t0(0) o251->MGC10.10.16.107@tcp@0@lo:26/25 lens 224/224 e 0 to 1 dl 1383398059 ref 2 fl Rpc:XN/0/ffffffff rc 0/-1
Lustre: server umount lustre-MDT0000 complete
Might be the that mdt is being umounted while Clients are communicating with it
On Client
LustreError: 11-0: lustre-MDT0000-mdc-ffff880037d5a400: Communicating with 10.10.16.107@tcp, operation obd_ping failed with -107.