With the prefer Fast Reg patch I'm seeing the follow error using older mlx4 FDR hardware with RHEL7.4.
10000000:01000000:5.0:1505423143.375531:0:6932:0:(mgc_request.c:1205:mgc_target_register()) register lustre-MDT0000
00000800:00000100:1.0:1505423143.375818:0:3469:0:(o2iblnd_cb.c:3464:kiblnd_complete()) FastReg failed: 6
00000800:00000100:1.0:1505423143.375821:0:3469:0:(o2iblnd_cb.c:3475:kiblnd_complete()) RDMA (tx: ffffc90006c03728) failed: 5
00000800:00000100:1.0:1505423143.375834:0:3469:0:(o2iblnd_cb.c:967:kiblnd_tx_complete()) Tx -> 10.37.248.196@o2ib1 cookie 0x1 sending 1 waiting 0: failed 5
00000800:00000100:1.0:1505423143.375838:0:3469:0:(o2iblnd_cb.c:1919:kiblnd_close_conn_locked()) Closing conn to 10.37.248.196@o2ib1: error -5(waiting)
00000100:00000400:5.0:1505423143.375874:0:6932:0:(client.c:2113:ptlrpc_expire_one_request()) @@@ Request sent has failed due to network error: [sent 1505423143/real 1505423143] req@ffff881011e5c300 x1578550577594400/t0(0) o253->MGC10.37.248.196@o2ib1@10.37.248.196@o2ib1:26/25 lens 4768/4768 e 0 to 1 dl 1505423150 ref 2 fl Rpc:eX/0/ffffffff rc 0/-1
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/34322/
Subject:
LU-9810lnet: fix build with M-OFED 4.1Project: fs/lustre-release
Branch: b2_10
Current Patch Set:
Commit: b4c93d99c633003d90f478d999805c76ccd744f1