[LU-14623] sanity test_210: FAIL: multiop failed / still running Created: 20/Apr/21  Updated: 10/Oct/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Related
is related to LU-13693 lfs getstripe should avoid opening re... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Artem Blagodarenko <artem.blagodarenko@hpe.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/b84b44cf-e922-41a9-8743-c0ce927351cb

test_210 failed with the following error:
== sanity test 210: lfs getstripe does not break leases ============================================== 20:16:43 (1618863403)
error: getstripe failed for /mnt/lustre/f210.sanity.
/usr/lib64/lustre/tests/sanity.sh: line 17423: 441020 User defined signal 1 $MULTIOP $DIR/$tfile oO_CREAT:O_RDWR:eW_E+eUc
sanity test_210: @@@@@@ FAIL: multiop failed
Trace dump:
= /usr/lib64/lustre/tests/test-framework.sh:6273:error()
= /usr/lib64/lustre/tests/sanity.sh:17423:test_210()
= /usr/lib64/lustre/tests/test-framework.sh:6576:run_one()
= /usr/lib64/lustre/tests/test-framework.sh:6623:run_one_logged()
= /usr/lib64/lustre/tests/test-framework.sh:6465:run_test()
= /usr/lib64/lustre/tests/sanity.sh:17433:main()
Dumping lctl log to /autotest/autotest-2/2021-04-19/lustre-reviews_review-ldiskfs-ubuntu_80230_1_17_af660c11-2f1e-40bd-9f1f-e440cab47d96/sanity.test_210.*.1618863406.log
CMD: trevis-6vm6.trevis.whamcloud.com,trevis-6vm7,trevis-6vm8,trevis-6vm9 /usr/sbin/lctl dk > /autotest/autotest-2/2021-04-19/lustre-reviews_review-ldiskfs-ubuntu_80230_1_17_af660c11-2f1e-40bd-9f1f-e440cab47d96/sanity.test_210.debug_log.\$(hostname -s).1618863406.log;
dmesg > /autotest/autotest-2/2021-04-19/lustre-reviews_review-ldiskfs-ubuntu_80230_1_17_af660c11-2f1e-40bd-9f1f-e440cab47d96/sanity.test_210.dmesg.\$(hostname -s).1618863406.log
Resetting fail_loc on all nodes...CMD: trevis-6vm6.trevis.whamcloud.com,trevis-6vm7,trevis-6vm8,trevis-6vm9 lctl set_param -n fail_loc=0 fail_val=0 2>/dev/null
done.
VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity test_210 - multiop failed



 Comments   
Comment by Andreas Dilger [ 04/Dec/21 ]

This subtest is also failing intermittently with "multiop still running", since 2020-06-22 (several times) when the test was first added in patch https://review.whamcloud.com/39139 "LU-13693 lfs: avoid opening regular files for getstripe", so I suspect the test is just not very robust.

For 2021-11 there were 5 failures in 2764 runs, so the actual failure rate is very low v

Generated at Sat Feb 10 03:11:21 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.