Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13205

sanity-pfl test 16a fails with “setstripe /mnt/lustre/d16.sanity-pfl/f16.sanity-pfl.copy failed“

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.13.0, Lustre 2.14.0, Lustre 2.12.4
    • PPC clients
    • 3
    • 9223372036854775807

    Description

      Looking at a recent failure, https://testing.whamcloud.com/test_sets/52073ba0-4715-11ea-b69a-52540065bddc, sanity-pfl test 16a fails with the following in the client test_log

      == sanity-pfl test 16a: Verify setstripe/getstripe with YAML config file ============================= 05:25:02 (1580793902)
      CMD: trevis-4vm12 dumpe2fs -h /dev/mapper/mds1_flakey 2>&1 |
      		grep -E -q '(ea_inode|large_xattr)'
      1. PFL file
      getstripe --yaml /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl
      setstripe --yaml=/mnt/lustre/d16a.sanity-pfl/template /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl.copy
      Set stripe size 4096 failed: Invalid argument
      lfs setstripe: cannot build layout from YAML file /mnt/lustre/d16a.sanity-pfl/template.
      error: setstripe: can't create composite layout from template file /mnt/lustre/d16a.sanity-pfl/template
       sanity-pfl test_16a: @@@@@@ FAIL: setstripe /mnt/lustre/d16a.sanity-pfl/f16a.sanity-pfl.copy failed 
        Trace dump:
        = /usr/lib64/lustre/tests/test-framework.sh:6121:error()
        = /usr/lib64/lustre/tests/test-framework.sh:9710:verify_yaml_layout()
        = /usr/lib64/lustre/tests/sanity-pfl.sh:761:test_16a()
      

      The only hint of a problem in the console logs is for client 2 (vm9) where we see

      [ 1212.789487] LustreError: 1758:0:(pack_generic.c:2447:lustre_swab_lov_comp_md_v1()) Invalid magic 0x1
      

      We’ve seen this same error message in sanity-pfl test 14 failures in LU-13186.

      Patch https://review.whamcloud.com/#/c/28425/ for LU-9846 landed to Lustre 2.12.54 on 01 JUNE 2019 and was not back ported to b2_12. This patch moved test_16 to test_16a and created a new test 16b.

      sanity-pfl test 16a started failing with this error message on 30 JULY 2019 with Lustre 2.12.56.72 and test 16 started failing for b2_12 on 13 AUG 2019 for Lustre 2.12.2.115.

      sanity-pfl test 16a fails only for PPC clients and fails 100% of the time for PPC.

      Logs for recent failures are at
      https://testing.whamcloud.com/test_sets/fd407770-4706-11ea-a1c8-52540065bddc
      https://testing.whamcloud.com/test_sets/07588ab8-2592-11ea-80b4-52540065bddc
      https://testing.whamcloud.com/test_sets/a2a89ace-1fdb-11ea-adca-52540065bddc

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              jamesanunez James Nunez (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: