[LU-13514] conf-sanity test_32a: Timeout occurred after 143 mins Created: 04/May/20 Updated: 07/Dec/23 Resolved: 29/Jan/22 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.14.0, Lustre 2.12.6 |
| Fix Version/s: | Lustre 2.12.6, Lustre 2.15.0 |
| Type: | Bug | Priority: | Major |
| Reporter: | Maloo | Assignee: | Yang Sheng |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||||||
| Severity: | 3 | ||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||
| Description |
|
This issue was created by maloo for Chris Horn <hornc@cray.com> This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/4960ea6e-2914-4b3d-a77d-e0e5a0a4c9a6 test_32a failed with the following error: Timeout occurred after 143 mins, last suite running was conf-sanity VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV |
| Comments |
| Comment by Chris Horn [ 05/May/20 ] |
|
+1 on master https://testing.whamcloud.com/test_sets/c0c4e2b5-ac3b-4739-b5fe-e317575a774a |
| Comment by Sebastien Buisson [ 14/May/20 ] |
|
I got multiple occurrences of this problem after rebase of my patches on top of master branch, eg: |
| Comment by Chris Horn [ 14/May/20 ] |
|
+1 on master https://testing.whamcloud.com/test_sessions/cc02ea29-8465-4511-ac15-7690c947d11e |
| Comment by Sebastien Buisson [ 14/May/20 ] |
|
It seems that no recent testing managed to pass conf-sanity because of this problem. |
| Comment by Gerrit Updater [ 15/May/20 ] |
|
Sebastien Buisson (sbuisson@ddn.com) uploaded a new patch: https://review.whamcloud.com/38615 |
| Comment by Gerrit Updater [ 15/May/20 ] |
|
Sebastien Buisson (sbuisson@ddn.com) uploaded a new patch: https://review.whamcloud.com/38616 |
| Comment by Gerrit Updater [ 15/May/20 ] |
|
Sebastien Buisson (sbuisson@ddn.com) uploaded a new patch: https://review.whamcloud.com/38617 |
| Comment by Sebastien Buisson [ 15/May/20 ] |
|
The patches above show that conf-sanity test_32a fails with tip of master branch (results of https://review.whamcloud.com/38615), and passes when we revert commit 6b979daaff " So I think commit 6b979daaff " |
| Comment by Andreas Dilger [ 15/May/20 ] |
|
Sebastien, do you know what is broken in the tests, and why/how that patch passed testing before it landed? Is it just because the testing is now slower and timing out, or is there a code defect (hang)? |
| Comment by Sebastien Buisson [ 15/May/20 ] |
|
Well, commit 6b979daaff " The explanation I see for the test failure in master now is that patch https://review.whamcloud.com/35049 (6b979daaff " |
| Comment by Andreas Dilger [ 19/May/20 ] |
|
I see that conf-sanity test_32a is still failing with this same error even for a patch based on the latest master commit v2_13_53-165-gebaf3b1b9980 " |
| Comment by Arshad Hussain [ 30/May/20 ] |
|
+1 on Master: https://testing.whamcloud.com/sub_tests/e3575182-057a-4057-965e-fd0c293e939b |
| Comment by Emoly Liu [ 01/Jun/20 ] |
|
more on master: |
| Comment by Chris Horn [ 01/Jun/20 ] |
|
+1 on master: https://testing.whamcloud.com/test_sessions/807df025-8fd7-45d1-8961-b7fbaf84cdc2 |
| Comment by Sebastien Buisson [ 02/Jun/20 ] |
|
It seems that almost no patch managed to pass conf-sanity test_32a in the last couple of days. |
| Comment by Andreas Dilger [ 17/Jun/20 ] |
|
Looking at the test results, it seems that review-dne-part-3 (ldiskfs) is the only session that is timing out, and never review-dne-zfs-part-3, so the failure must be related to one of the ldiskfs test images. |
| Comment by Gerrit Updater [ 19/Jun/20 ] |
|
James Nunez (jnunez@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/39108 |
| Comment by Gerrit Updater [ 19/Jun/20 ] |
|
James Nunez (jnunez@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/39109 |
| Comment by Gerrit Updater [ 10/Jul/20 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/39109/ |
| Comment by Gerrit Updater [ 30/Oct/20 ] |
|
Yang Sheng (ys@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/40492 |
| Comment by Gerrit Updater [ 07/Nov/20 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/40492/ |
| Comment by Peter Jones [ 07/Nov/20 ] |
|
Is turning off this test a solution or the equivalent of adding something to the always accept list? |
| Comment by James Nunez (Inactive) [ 16/Nov/20 ] |
|
For master, future 2.14.0, for the past 4 weeks, the only conf-sanity test 32a hangs/timeouts are for interop testing: For the b2_12 branch, we see this test hang in non-interop testing: |
| Comment by Gerrit Updater [ 20/Nov/20 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/40537/ |
| Comment by Andreas Dilger [ 02/Mar/21 ] |
|
Yang Sheng, is patch https://review.whamcloud.com/40537 " |
| Comment by Gerrit Updater [ 28/Jan/22 ] |
|
"Andreas Dilger <adilger@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/46354 |
| Comment by Gerrit Updater [ 29/Jan/22 ] |
|
"Andreas Dilger <adilger@whamcloud.com>" merged in patch https://review.whamcloud.com/46354/ |