Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17450

sanity: interop test failures with master+2.15

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.16.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/dc77145c-b7d3-4010-a7a2-f8435f9353ff

      Test session details:
      clients: https://build.whamcloud.com/job/lustre-reviews/101478 - 4.18.0-477.27.1.el8_8.x86_64
      servers: https://build.whamcloud.com/job/lustre-b2_15/81 - 4.18.0-513.9.1.el8_lustre.x86_64

      there are a number of sanity interop test failures with 2.15.4 servers

      Please review the test failures to determine for each one:

      • when did the failure first start happening?
      • is this a new test added since 2.15.50 was forked from b2_15?
      • should the test be skipped because of an older MDS or OSS version?
      • is this a legitimate regression?
      • if not a clear case of new test with old server, ensure an LU ticket is open for it with details and add to always_except

      Attachments

        Issue Links

          Activity

            [LU-17450] sanity: interop test failures with master+2.15
            pjones Peter Jones added a comment -

            Merged for 2.16

            pjones Peter Jones added a comment - Merged for 2.16

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/54962/
            Subject: LU-17450 test: disable test 56x 56xa 56xb in sanity
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: bfcab56c5f35ae33c42325326f9141d9c48412b1

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/54962/ Subject: LU-17450 test: disable test 56x 56xa 56xb in sanity Project: fs/lustre-release Branch: master Current Patch Set: Commit: bfcab56c5f35ae33c42325326f9141d9c48412b1

            "Hongchao Zhang <hongchao@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/54962
            Subject: LU-17450 test: disable test 56x 56xa 56xb in sanity
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: d763aa7873e9160f3e406a4066dbc114bd0320d8

            gerrit Gerrit Updater added a comment - "Hongchao Zhang <hongchao@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/54962 Subject: LU-17450 test: disable test 56x 56xa 56xb in sanity Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: d763aa7873e9160f3e406a4066dbc114bd0320d8
            hongchao.zhang Hongchao Zhang added a comment - - edited

            the interop failure of test_56x, test_56xa, test_56xb in sanity could be fixed by the patch
            https://review.whamcloud.com/53997/ in LU-17525

            hongchao.zhang Hongchao Zhang added a comment - - edited the interop failure of test_56x, test_56xa, test_56xb in sanity could be fixed by the patch https://review.whamcloud.com/53997/ in LU-17525
            pjones Peter Jones added a comment -

            Hongchao

            Can you please investigate what needs to be done to avoid this test failure?

            Thanks

            Peter

            pjones Peter Jones added a comment - Hongchao Can you please investigate what needs to be done to avoid this test failure? Thanks Peter

            I've requested patches to add interop checks for the identified source patches, but not sure why test_56x is failing. Those tests look related to lfs_migrate, but needs some more investigation why it is failing.

            adilger Andreas Dilger added a comment - I've requested patches to add interop checks for the identified source patches, but not sure why test_56x is failing. Those tests look related to lfs_migrate, but needs some more investigation why it is failing.

            test_56x, test_56xa, test_56xc - not sure (they are old tests)
            test_65p - from LU-16194 lod: define negative extent offset as invalid
            test_70a - from LU-17216 ofd: make enable_health_write tunable, requested interop check there
            test_119h, test_119i, test_398d, test_398o - from LU-13805 clio: bounce buffer for unaligned DIO
            test_123g, test_123h, test_123i - from LU-14361 statahead: Add test for statahead advise

            adilger Andreas Dilger added a comment - test_56x, test_56xa, test_56xc - not sure (they are old tests) test_65p - from LU-16194 lod: define negative extent offset as invalid test_70a - from LU-17216 ofd: make enable_health_write tunable , requested interop check there test_119h, test_119i, test_398d, test_398o - from LU-13805 clio: bounce buffer for unaligned DIO test_123g, test_123h, test_123i - from LU-14361 statahead: Add test for statahead advise

            Make this a blocker for 2.16.0 since we can't release it until we are sure it is not introducing any interop issues, which would be much harder to fix afterward.

            It looks at a minimum that there is something wrong with file migrate, but I haven't looked at all of the failures in this interop session yet. There is a small chance that one or two subtest failures relate to the patch that was being tested (which is why I requested interop testing in the first place), but many of the failures have been present for weeks and are not isolated to a single subtest.

            adilger Andreas Dilger added a comment - Make this a blocker for 2.16.0 since we can't release it until we are sure it is not introducing any interop issues, which would be much harder to fix afterward. It looks at a minimum that there is something wrong with file migrate, but I haven't looked at all of the failures in this interop session yet. There is a small chance that one or two subtest failures relate to the patch that was being tested (which is why I requested interop testing in the first place), but many of the failures have been present for weeks and are not isolated to a single subtest.

            People

              hongchao.zhang Hongchao Zhang
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: