[LU-16020] sanity test_244a: FAIL: sendfile+grouplock failed Created: 15/Jul/22  Updated: 22/Nov/22  Resolved: 21/Oct/22

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.15.1
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: rhel9, ubuntu22
Environment:

RHEL 9.0 client


Issue Links:
Related
is related to LU-15781 Ubuntu 22.04 LTS release support Open
is related to LU-15874 RHEL 9.0 support Resolved
is related to LU-16258 Explicitly support .splice_write Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for jianyu <yujian@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/3a8eb542-e2e6-4223-8a43-3f37b87a4636

test_244a failed with the following error:

== sanity test 244a: sendfile with group lock tests ====== 20:30:20 (1657917020)
35+0 records in
35+0 records out
36700160 bytes (37 MB, 35 MiB) copied, 0.149954 s, 245 MB/s
Starting test test10 at 1657917021
sendfile_grouplock: sendfile_grouplock.c:259: sendfile_copy: assertion 'sret > 0' failed: sendfile failed: Invalid argument
 sanity test_244a: @@@@@@ FAIL: sendfile+grouplock failed

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity test_244a - sendfile+grouplock failed



 Comments   
Comment by Andreas Dilger [ 26/Sep/22 ]

This is mostly failing on ubuntu2204, but there was also a failure on RHEL9.0 so it looks like it is a "new kernel" issue related to sendfile.

Comment by Peter Jones [ 13/Oct/22 ]

Could we add this test to always_accept while we're working on a fix and then enforce testing on ubuntu 22.04 immediately to stop any other regressions creeping in?

Comment by Andreas Dilger [ 13/Oct/22 ]

Yes, that is definitely a way forward, and will at least avoid new regressions. We just need to remember to actually fix the problem and remove the subtest from ALWAYS_EXCEPT, as we normally do with similar issues...

I think the check needs to be for the kernel version, and not the distro, since the same issue was also hit on el9.

Comment by Gerrit Updater [ 18/Oct/22 ]

"Andreas Dilger <adilger@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/48912
Subject: LU-16020 tests: add sanity/244a to always_except
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 8790cba5ee0facefd24a64862cd8619f432912c8

Comment by Andreas Dilger [ 21/Oct/22 ]

It looks like patch https://review.whamcloud.com/48928 "LU-16258 llite: Explicitly support .splice_write" has fixed this problem for Ubuntu22.04, since that patch and the one based on top if it have both passed:

https://testing.whamcloud.com/sub_tests/46663bdd-3ad0-4af3-8147-5057ac38898a
https://testing.whamcloud.com/sub_tests/5cda616c-c670-4721-9eb4-0b03b733aa94

No other Ubuntu22.04 test runs have passed in the past 4 weeks.

Comment by Andreas Dilger [ 21/Oct/22 ]

Fixed by LU-16258.

Generated at Sat Feb 10 03:23:17 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.