[LU-16850] socklnd: ksocknal_shutdown() ASSERTION( net->ksnn_interface.ksni_nroutes == 0 ) failed Created: 26/May/23 Updated: 17/Jun/23 Resolved: 14/Jun/23 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Lustre 2.16.0 |
| Type: | Bug | Priority: | Minor |
| Reporter: | Serguei Smirnov | Assignee: | Serguei Smirnov |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | lnet, socklnd | ||
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
This showed up in Janitor testing for The sanity-lnet test introduced in that patch causes Janitor to run tests on that specific test ~20 times exercising the touched test only. All of these runs failed on the same assert. Running the same test as part of full sanity-lnet suite execution doesn't trigger the assertion failure. Local testing also failed to reproduce the issue. The assertion implies incomplete clean-up of socklnd peer on NI shutdown or, less likely, memory corruption. It needs to be determined whether the issue causing the assertion to fail is introduced by LNet code changes |
| Comments |
| Comment by Gerrit Updater [ 26/May/23 ] |
|
"Serguei Smirnov <ssmirnov@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51148 |
| Comment by Gerrit Updater [ 14/Jun/23 ] |
|
"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/51148/ |
| Comment by Peter Jones [ 14/Jun/23 ] |
|
Landed for 2.16 |