[LU-15506] conf-sanity test_32b failed with 2: sha1sum verification failed Created: 31/Jan/22  Updated: 18/Apr/22  Resolved: 18/Mar/22

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.15.0
Fix Version/s: Lustre 2.15.0

Type: Bug Priority: Major
Reporter: Maloo Assignee: Andreas Dilger
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
is related to LU-11643 create disk images for Lustre 2.10 an... Resolved
is related to LU-15751 Interop conf-sanity test_32b: Fails l... In Progress
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/ccde588e-154c-43d4-918b-f108a20e7259

test_32b failed for the new disk2_10.ldiskfs and disk2_12.ldiskfs with the following error:

/usr/lib64/lustre/tests
--- /tmp/t32/sha1sums.orig	2022-01-28 21:56:39.118805795 +0000
+++ /tmp/t32/sha1sums	2022-01-28 21:56:39.122806020 +0000
@@ -1,10 +0,0 @@
-59ced6686342e5fdff70a29277632622ad271168  ./init.d/functions
-ff4f8d1bcd9ab4a9edcf77496e23963e5c6f6a2c  ./init.d/lsvcgss
-f8f634b92b75af4112634a6f14464e562cd82454  ./init.d/lustre
-dff7d87de75271f0714c3b82921d40c96598f67a  ./init.d/netconsole
-21414c2b3c89f95d3eab00dafc954d3f6cf3ba9f  ./init.d/network
-f87a11aceaf7dc0e1614ea074fda14d6896ac66f  ./init.d/README
-92624163580750ca250a2c1cc8bd531d0609702a  ./init.d/rhnsd
-a17ecaeb91c0218092c8b01308a132698da9b81f  ./pfl_dir/pfl_file
-da39a3ee5e6b4b0d3255bfef95601890afd80709  ./project_quota_dir/pj_quota_file_old
-2c72448b440f16c9fae18e287ca827c25d29a7cb  ./rc.local
 conf-sanity test_32b: @@@@@@ FAIL: sha1sum verification failed 

This failed

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
conf-sanity test_32b - test_32b failed with 2



 Comments   
Comment by Andreas Dilger [ 01/Feb/22 ]

The init.d directory is a "remote" directory It looks like the problem might be that the mdt2 image is not being mounted when test_32b run itself is using MDSCOUNT=1?

Comment by Andreas Dilger [ 01/Feb/22 ]

I think all of the failures with disk2_10-ldiskfs and disk2_12-ldiskfs are because these images have 2 MDTs and/or 2 OSTs. All of the passing cases are with 1 MDT/1 OST (ldiskfs and zfs). For some reason I don't see results for disk2_5-ldiskfs, but not sure why.

Comment by Gerrit Updater [ 01/Feb/22 ]

"Andreas Dilger <adilger@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/46403
Subject: LU-15506 tests: skip conf-sanity test_32b until fixed
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: caa54a5a64703ab691a4b51c6e6468b35e521d2f

Comment by Gerrit Updater [ 01/Feb/22 ]

"Andreas Dilger <adilger@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/46404
Subject: LU-15506 tests: add better conf-sanity state messages
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: ffc5e39dea1725a04e30a258ec6548ffc28162c9

Comment by Gerrit Updater [ 01/Feb/22 ]

"Andreas Dilger <adilger@whamcloud.com>" merged in patch https://review.whamcloud.com/46403/
Subject: LU-15506 tests: skip conf-sanity test_32b until fixed
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 19e6f6f234101bdc0666bcc8be2c1afb396925e8

Comment by Peter Jones [ 03/Feb/22 ]

Now failing tests off this is no longer a blocker. We will still work on trying to fix the tests in place.

Comment by Andreas Dilger [ 03/Feb/22 ]

I think all of the failures with disk2_10-ldiskfs and disk2_12-ldiskfs are because these images have 2 MDTs and/or 2 OSTs. All of the passing cases are with 1 MDT/1 OST (ldiskfs and zfs).

It looks like the test script is checking the wrong directory when running the list and sha1sum tests, which is why it is failing.

For some reason I don't see results for disk2_5-ldiskfs, but not sure why.

This is because disk2_5-ldiskfs.tar.bz2 is not listed in the lustre/tests/Makefile.am, so it is not included in the lustre-tests RPM installed on autotest clients for testing. This DNE image is also broken in a similar manner (sha1sum of the wrong directory).

Comment by Gerrit Updater [ 04/Feb/22 ]

"Wei Liu <sarah@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/46464
Subject: LU-15506 tests: fix sha1sum error for disk images with multiple mdts
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 9536612cdd0ba5fc0a1acf63ee2608db3c9c1e17

Comment by Gerrit Updater [ 18/Mar/22 ]

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/46404/
Subject: LU-15506 tests: fix sha1sum error for disk images with multiple mdts
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 4be1f7995f06d00362d2845a6ba2c81edd5ffc8a

Comment by Peter Jones [ 18/Mar/22 ]

Landed for 2.15

Generated at Sat Feb 10 03:18:53 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.