Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
Lustre 2.13.0, Lustre 2.14.0, Lustre 2.12.4
-
PPC clients
-
3
-
9223372036854775807
Description
Looking at a recent failure, https://testing.whamcloud.com/test_sets/52073ba0-4715-11ea-b69a-52540065bddc, sanity-pfl test 16a fails with the following in the client test_log
== sanity-pfl test 16a: Verify setstripe/getstripe with YAML config file ============================= 05:25:02 (1580793902) CMD: trevis-4vm12 dumpe2fs -h /dev/mapper/mds1_flakey 2>&1 | grep -E -q '(ea_inode|large_xattr)' 1. PFL file getstripe --yaml /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl setstripe --yaml=/mnt/lustre/d16a.sanity-pfl/template /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl.copy Set stripe size 4096 failed: Invalid argument lfs setstripe: cannot build layout from YAML file /mnt/lustre/d16a.sanity-pfl/template. error: setstripe: can't create composite layout from template file /mnt/lustre/d16a.sanity-pfl/template sanity-pfl test_16a: @@@@@@ FAIL: setstripe /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl.copy failed Trace dump: = /usr/lib64/lustre/tests/test-framework.sh:6121:error() = /usr/lib64/lustre/tests/test-framework.sh:9710:verify_yaml_layout() = /usr/lib64/lustre/tests/sanity-pfl.sh:761:test_16a()
The only hint of a problem in the console logs is for client 2 (vm9) where we see
[ 1212.789487] LustreError: 1758:0:(pack_generic.c:2447:lustre_swab_lov_comp_md_v1()) Invalid magic 0x1
We’ve seen this same error message in sanity-pfl test 14 failures in LU-13186.
Patch https://review.whamcloud.com/#/c/28425/ for LU-9846 landed to Lustre 2.12.54 on 01 JUNE 2019 and was not back ported to b2_12. This patch moved test_16 to test_16a and created a new test 16b.
sanity-pfl test 16a started failing with this error message on 30 JULY 2019 with Lustre 2.12.56.72 and test 16 started failing for b2_12 on 13 AUG 2019 for Lustre 2.12.2.115.
sanity-pfl test 16a fails only for PPC clients and fails 100% of the time for PPC.
Logs for recent failures are at
https://testing.whamcloud.com/test_sets/fd407770-4706-11ea-a1c8-52540065bddc
https://testing.whamcloud.com/test_sets/07588ab8-2592-11ea-80b4-52540065bddc
https://testing.whamcloud.com/test_sets/a2a89ace-1fdb-11ea-adca-52540065bddc
Attachments
Issue Links
- is duplicated by
-
LU-13207 sanity-pfl test 16b crashes in “Oops: Kernel access of bad area”
- Open
- is related to
-
LU-13215 sanity-pfl test 17 hangs with “incorrect message magic”
- Open
-
LU-13186 sanity-pfl test 14 fails with '/mnt/lustre/d14.sanity-pfl/f14.sanity-pfl: component 4 doesn't have poolname pool2'
- Reopened
- is related to
-
LU-10100 sanity test_27a: setstripe failed with "error on ioctl 0x8008669a for '*' (3): Invalid argument"
- Resolved
- mentioned in
-
Page Loading...