[LU-13156] sanity-selinux test_21a test_21b: c0.sepol not updated on mds1 after 20 secs Created: 20/Jan/20  Updated: 24/Mar/23  Resolved: 14/Apr/20

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: Lustre 2.14.0

Type: Bug Priority: Minor
Reporter: Maloo Assignee: Sebastien Buisson
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
is related to LU-13133 sanity-selinux test_21a and test_21b:... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/73f9289e-3af0-11ea-b0f4-52540065bddc

test_21b failed with the following error:

c0.sepol not updated on mds1 after 20 secs

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity-selinux test_21a - c0.sepol not updated on mds1 after 20 secs
sanity-selinux test_21b - c0.sepol not updated on mds1 after 20 secs



 Comments   
Comment by Andreas Dilger [ 11/Mar/20 ]

After the landing of LU-13133 both test_21a and test_21b are still failing regularly, but with a different error message. This is still happening on the latest master - 58/60 failures in the past 3 days. It looks like both review-dne-selinux and review-dne-selinux-ssk are both failing a majority of the times, though there is about one or two passes per day for some reason.

Comment by Andreas Dilger [ 11/Mar/20 ]

It looks like these are the only two subtest failures remaining for sanity-selinux in the past few days, so if they are excluded or fixed then we might finally be able to start enforcing this test session so that it stops getting new regressions.

Comment by Gerrit Updater [ 11/Mar/20 ]

Andreas Dilger (adilger@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/37899
Subject: LU-13156 tests: skip sanity-selinux test_21a+21b
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 8351af09c42d6959677c2d15351c50859a615c88

Comment by Sebastien Buisson [ 12/Mar/20 ]

Patch https://review.whamcloud.com/37224 was landed to fix sanity-selinux test_21a and test_21b, so they should not be skipped.

Comment by Andreas Dilger [ 12/Mar/20 ]

The 37224 patch was landed on 2020-02-24, but these subtests are still failing regularly so there appears to be a different problem (maybe exposed once the configuration issue was resulted?) For example, patch https://review.whamcloud.com/37620 commit 1071a24b57a34dc87e704ac9cc99132700d0d0a5 is based on the current tip of master but still failed both of these tess.

https://testing.whamcloud.com/test_sessions/461de6f9-30fe-4357-8088-489cd8295f29

Comment by Sebastien Buisson [ 12/Mar/20 ]

Oops my bad. I totally misread this ticket

You are right, this issue with sanity-selinux test_21a and test_21b is different from the one fixed by patch https://review.whamcloud.com/37224. Before patch https://review.whamcloud.com/37224, the client connection was working without sepol, which was not ok. Now that this patch has landed, we realize that sepol never gets updated, which is the source of the problem.

I think it is ok to skip those 2 tests for now, as sepol is not widely used.

Comment by Gerrit Updater [ 22/Mar/20 ]

Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/37899/
Subject: LU-13156 tests: skip sanity-selinux test_21a+21b
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 8c1b62d09484161a40071c0aeecd1f1753d56870

Comment by Gerrit Updater [ 23/Mar/20 ]

Sebastien Buisson (sbuisson@ddn.com) uploaded a new patch: https://review.whamcloud.com/38034
Subject: LU-13156 tests: wait for nodemap update in sanity-selinux
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 164b0381dddbd3a81e614493d30b811f6297348a

Comment by Sebastien Buisson [ 24/Mar/20 ]

Thanks to the testing carried out under https://review.whamcloud.com/37902, we can see that the problem hit with sanity-selinux test_21a and test_21b is due to primary MDS reboot/restart in prior tests launched in recovery-small, like test_23 or test_141.

Because of such primary MDS reboot/restart, we need to make sure that nodemaps are properly updated on all nodes before using them.

Comment by Gerrit Updater [ 14/Apr/20 ]

Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/38034/
Subject: LU-13156 tests: wait for nodemap update in sanity-selinux
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: f1761cbe6b1243edd7a69c68c401d7285f7f3b38

Comment by Peter Jones [ 14/Apr/20 ]

Landed for 2.14

Comment by Gerrit Updater [ 24/Mar/23 ]

"Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/50400
Subject: LU-13156 tests: wait for nodemap update in sanity-selinux
Project: fs/lustre-release
Branch: b2_12
Current Patch Set: 1
Commit: dc79dedfea4dbd46f1f9fcf2ac0c1e9f9ff25a6b

Generated at Sat Feb 10 02:58:52 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.