[LU-11889] sanity test 156 fails on ZFS: roc_hit not safe to use Created: 26/Jan/19  Updated: 21/Mar/19

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.10.7, Lustre 2.12.1
Fix Version/s: None

Type: Bug Priority: Trivial
Reporter: Alex Zhuravlev Assignee: Alex Zhuravlev
Resolution: Unresolved Votes: 0
Labels: None
Environment:

zfs


Issue Links:
Related
is related to LU-2261 Add cache stats to zfs osd Resolved
is related to LU-11607 Reduce repeated function calls in Lus... Resolved
is related to LU-5375 Failure on test suite sanity test_151... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

because osd-zfs doesn't maintain cache hit/miss stats, there is just no regular API for this at the moment.



 Comments   
Comment by Gerrit Updater [ 26/Jan/19 ]

Alex Zhuravlev (bzzz@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/34114
Subject: LU-11889 tests: disable sanity/156 for ZFS OST
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: e138216b361324b2fb4da7384b07f6f086d29649

Comment by Andreas Dilger [ 27/Jan/19 ]

Are there examples of this failing in autotest?

Comment by Alex Zhuravlev [ 27/Jan/19 ]

yes, I found number of failures like this:
https://testing.whamcloud.com/test_sets/f337b3f4-20cb-11e9-af70-52540065bddc

Comment by Andreas Dilger [ 27/Jan/19 ]

It looks like this bug originated in LU-2261, which enabled sanity test_156 for ZFS when brw_stats was enabled for ZFS, even though the cache_hits tunable is not actually being tracked for ZFS. The error went unnoticed because there was a syntax error in the "skip" logic until patch "LU-11607 tests: create routine to get Lustre env" landed, which properly enabled the test for ZFS and proceeded to cause all the ZFS tests to fail.

Comment by Gerrit Updater [ 27/Jan/19 ]

Andreas Dilger (adilger@whamcloud.com) merged in patch https://review.whamcloud.com/34114/
Subject: LU-11889 tests: disable sanity/156 for ZFS OST
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 42c4dab3c817f9f03efe457fd33e946ed68fab14

Comment by Andreas Dilger [ 27/Jan/19 ]

Alex, is it possible to easily implement the cache_hits tracking for ZFS?

Comment by Alex Zhuravlev [ 04/Feb/19 ]

adilger I checked ZFS sources and it looks that there is no easy way to get stats for Lustre-related dbufs - all stats ZFS collects are global. We could pre-lookup dbufs and check its status, but that would be +1 lookup for every dbuf which isn't free. not sure we want to pay that price.

Generated at Sat Feb 10 02:47:49 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.