[LU-16807] Resolve newer debug kernel warnings Created: 09/May/23  Updated: 03/Aug/23

Status: Reopened
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.16.0
Fix Version/s: Lustre 2.16.0

Type: Bug Priority: Minor
Reporter: James A Simmons Assignee: James A Simmons
Resolution: Unresolved Votes: 0
Labels: None
Environment:

Newer distro running a debug kernel.


Issue Links:
Related
is related to LU-15808 ptlrpc lockdep warning "do not call b... Open
is related to LU-16783 DEBUG_LOCKS_WARN_ON(!lockdep_enabled(... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Currently we use RHEL7 debug kernels for collecting potential kernel issues. This has two limitations. One is that for Lustre 2.17 we will be dropping RHEL7 support and second RHEL7 kernels has unique issues that are not present in newer kernels. The reason for the delay to using a newer debug kernel like RHEL8 is that it reports far more problems that needs to be resolved before we move to this target for general testing.



 Comments   
Comment by Gerrit Updater [ 09/May/23 ]

"James Simmons <jsimmons@infradead.org>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/50897
Subject: LU-16807 ksocklnd: remove LASSERT that could cause a deadlock
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: e281814e47f8d6d7ebcd742f84b14709440058f0

Comment by James A Simmons [ 09/May/23 ]

First of many patches

Comment by Gerrit Updater [ 15/May/23 ]

"Neil Brown <neilb@suse.de>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/50992
Subject: LU-16807 libcfs: give the tcd_lock types different classes.
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 8c5c54d267d2e4ad61814cbadd1f627b8a9d51a5

Comment by Gerrit Updater [ 16/May/23 ]

"James Simmons <jsimmons@infradead.org>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51028
Subject: LU-16807 ksocklnd: ksocklnd_ni_get_eth_intf_speed() missing rcu lock
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 5898e6320ed8167e288e0ebf2122863fee12687a

Comment by James A Simmons [ 24/May/23 ]

More patches will be coming

Comment by Gerrit Updater [ 31/May/23 ]

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/50992/
Subject: LU-16807 libcfs: give the tcd_lock types different classes.
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 7393c5a57876309863aebc7f87eba966ebbec320

Comment by Gerrit Updater [ 31/May/23 ]

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/51028/
Subject: LU-16807 ksocklnd: ksocklnd_ni_get_eth_intf_speed() must use only rtnl lock
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 65d664bff40590366aa44ea6f4fc32be2a8aae37

Comment by Peter Jones [ 31/May/23 ]

Landed for 2.16

Comment by James A Simmons [ 01/Jun/23 ]

Several more patches are needed. 

Comment by Gerrit Updater [ 01/Jun/23 ]

"James Simmons <jsimmons@infradead.org>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51192
Subject: LU-16807 llite: make lsm_sem nested for ll_update_dir_depth
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 9de949e168d61c549b78ca0077af481feedea24b

Comment by Gerrit Updater [ 20/Jun/23 ]

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/51192/
Subject: LU-16807 llite: make lsm_sem nested for ll_update_dir_depth
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: fe44198b7fb710fce0f3daf669952c11ea0f4189

Comment by James A Simmons [ 03/Aug/23 ]

ZFS lockdep issue is here: https://github.com/openzfs/zfs/issues/15111

Generated at Sat Feb 10 03:30:11 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.