Details
-
Question/Request
-
Resolution: Unresolved
-
Critical
-
None
-
Lustre 2.12.8
-
None
-
9223372036854775807
Description
We have a critical Mellanox firmware issue causing HCA soft lockups. As a possible work around we are considering using tcp NIDs, until we have a fix from Mellanox.
We want to configure servers with tcp and o2ib nids, doing so will avoid future down time switching NIDS on the server.
Is there a way to select/prefer NIDs on the client per filesystem?
Here is an example config:
client_a: [o2ib(ib0) tcp(ib0)] (mount fs1 using tcp0) (mount fs2 using o2ib) (mount fs3 mount o2ib)
client_b: [o2ib414(ib0)] (mount fs1 using tcp0) (mount fs2 using o2ib) (mount fs3 mount o2ib)
lrouter: o2ib41(ib1) :: o2ib(ib0) tcp(ib0)
fs1-srv1: o2ib(ib0) tcp(ib0)
fs2-srv1: o2ib(ib0) tcp(ib0)
fs3-srv1: o2ib(ib0)