[LU-12289] Route with fault remote device selected on separated IB subnet Created: 13/May/19 Updated: 16/Oct/20 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.12.1 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Tatsushi Takamura | Assignee: | Tatsushi Takamura |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||
| Epic/Theme: | lnet | ||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||
| Description |
|
LNet MultiRail selects routes from local to remote in order.
Subnet1 Subnet2
(DOWN) |
REMOTE IB(A) | IB(B)
↑ |
x |
| |
LOCAL IB(A) | IB(B)
local device is selected in round-robin fashion regardless of remote side device status
We modify the finding best local device algorithm as follows,
|
| Comments |
| Comment by Amir Shehata (Inactive) [ 16/May/19 ] |
|
We've made some changes as part of the Multi-Rail Routing feature, which I believe should accommodate the issues you mentioned here. If you can please look at these patch series, as they are planned to be merged into master in the near feature The tip of the patch series starts here: https://review.whamcloud.com/#/c/34580/ |