[LU-14566] Skip discovery in LNetPrimaryNID when lnet_peer_discovery_disabled is set Created: 26/Mar/21 Updated: 15/Jul/21 Resolved: 28/Apr/21 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Lustre 2.15.0 |
| Type: | Improvement | Priority: | Minor |
| Reporter: | Chris Horn | Assignee: | Chris Horn |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||
| Description |
|
If discovery is disabled locally then the discovery thread will not |
| Comments |
| Comment by Gerrit Updater [ 26/Mar/21 ] |
|
Chris Horn (chris.horn@hpe.com) uploaded a new patch: https://review.whamcloud.com/43141 |
| Comment by Amir Shehata (Inactive) [ 27/Mar/21 ] |
|
Do you see this issue even with: LU-13972 o2iblnd: Don't retry indefinitely ? |
| Comment by Chris Horn [ 27/Mar/21 ] |
|
That change will only impact local peers. Remote clients will still have to wait for the full lnet transaction timeout |
| Comment by Gerrit Updater [ 28/Apr/21 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/43141/ |
| Comment by Peter Jones [ 28/Apr/21 ] |
|
Landed for 2.15 |
| Comment by Gerrit Updater [ 04/May/21 ] |
|
Etienne AUJAMES (eaujames@ddn.com) uploaded a new patch: https://review.whamcloud.com/43537 |
| Comment by Etienne Aujames [ 04/May/21 ] |
|
We encountered an issue with 2 LNet routes missing on the server side (OSS): the clients could communicate with server but the servers could not answer. Clients tried periodically to connect to the servers maintaining the missing peers in the discovery list (the_lnet.ln_dc_working). This have the consequences to wait indefinitely for peer discovery in ll_ostXX_XXX threads and progressively contaminating all the available threads (the client keep sending connection requests). The server became unavailable for all the clients. The "LNet discovery" and the "LNet health" is disabled on the clients and on the servers. |