[LU-10428] LNet events should generated without resource lock held Created: 22/Dec/17  Updated: 27/Mar/22  Resolved: 07/Sep/20

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: Lustre 2.14.0

Type: Improvement Priority: Major
Reporter: Alexey Lyashkov Assignee: Alexey Lyashkov
Resolution: Fixed Votes: 0
Labels: patch
Environment:

any lustre


Issue Links:
Related
is related to LU-14425 BUG: unable to handle kernel paging r... Resolved
Rank (Obsolete): 9223372036854775807

 Description   

LNet execute an events with resource lock held, but event handler can allocate an memory, so it will block other callers to lockup over hash table or parallel event execution.
I will send a patch son.



 Comments   
Comment by Cory Spitz [ 09/Aug/18 ]

For some reason, a comment wasn't added here when https://review.whamcloud.com/#/c/30657 was uploaded for this issue.

Comment by Alexey Lyashkov [ 16/Jul/19 ]

It looks not a improvement now.

http://testing.linuxhacker.ru:3333/lustre-reports/1276/testresults/runtests-ssk-ldiskfs-SSK-centos7_x86_64-centos7_x86_64/oleg78-client-console.txt

Comment by Gerrit Updater [ 20/Dec/19 ]

Neil Brown (neilb@suse.de) uploaded a new patch: https://review.whamcloud.com/37068
Subject: LU-10428 lnet: call event handlers without res_lock
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 706e000cd325fb47f2e4b77e3f3afd17fc8b62b8

Comment by Gerrit Updater [ 01/Sep/20 ]

Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/37068/
Subject: LU-10428 lnet: call event handlers without res_lock
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: d05427a7856e8f89cf6ec47f2731e12c6fa22901

Comment by James A Simmons [ 07/Sep/20 ]

Neil's patch has landed. If it hasn't resolved the problem feel free to reopen.

Generated at Sat Feb 10 02:35:00 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.