[LU-16149] LNet Discovery queue and deletion race Created: 12/Sep/22 Updated: 23/Feb/23 Resolved: 25/Oct/22 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.16.0, Lustre 2.15.2 |
| Fix Version/s: | Lustre 2.16.0, Lustre 2.15.3 |
| Type: | Bug | Priority: | Major |
| Reporter: | Chris Horn | Assignee: | Chris Horn |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||
| Severity: | 3 | ||||
| Rank (Obsolete): | 9223372036854775807 | ||||
| Description |
|
lnet_peer_deletion() can race with another thread calling Discovery thread:
Another thread:
Discovery thread:
At this point, the peer is not on any discovery list, and it has To solve this, modify lnet_peer_deletion() so that it waits to clear Futhermore, do not bother deleting the peer from the ln_dc_working |
| Comments |
| Comment by Gerrit Updater [ 12/Sep/22 ] |
|
"Chris Horn <chris.horn@hpe.com>" uploaded a new patch: https://review.whamcloud.com/48532 |
| Comment by Gerrit Updater [ 25/Oct/22 ] |
|
"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/48532/ |
| Comment by Peter Jones [ 25/Oct/22 ] |
|
Landed for 2.16 |
| Comment by Gerrit Updater [ 25/Jan/23 ] |
|
"Serguei Smirnov <ssmirnov@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49772 |
| Comment by Gerrit Updater [ 23/Feb/23 ] |
|
"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/49772/ |