Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17927

sanity test_119e: FAIL: trivial unaligned dio failed

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Critical
    • None
    • Lustre 2.16.0
    • None
    • RHEL 9.4 client
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for jianyu <yujian@whamcloud.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/36e444c9-8e5c-48de-a4a2-bb198ffc9fe6

      test_119e failed with the following error:

      == sanity test 119e: Basic tests of dio read and write at various sizes ========================================================== 23:19:26 (1718061566)
      1+0 records in
      1+0 records out
      26214400 bytes (26 MB, 25 MiB) copied, 0.681171 s, 38.5 MB/s
      dd: error reading '/mnt/lustre/f119e.sanity.1': Invalid argument
      0+0 records in
      0+0 records out
      0 bytes copied, 0.00488452 s, 0.0 kB/s
       sanity test_119e: @@@@@@ FAIL: trivial unaligned dio failed
      

      Test session details:
      clients: https://build.whamcloud.com/job/lustre-reviews/105356 - 5.14.0-427.16.1.el9_4.x86_64
      servers: https://build.whamcloud.com/job/lustre-reviews/105356 - 5.14.0-362.24.1_lustre.el9.x86_64

      <<Please provide additional information about the failure here>>

      VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
      sanity test_119e - trivial unaligned dio failed

      Attachments

        Issue Links

          Activity

            [LU-17927] sanity test_119e: FAIL: trivial unaligned dio failed

            This is resolved now, these tests are not failing with the udio fixes landed.

            stancheff Shaun Tancheff added a comment - This is resolved now, these tests are not failing with the udio fixes landed.
            pjones Peter Jones added a comment -

            As per discussion on the LWG call today, moving tickets that do not appear to be essential to fix version 2.17. If the fix lands before code freeze we will update the fix version to reflect that but we want to focus on activities on the critical path. Please speak up if you think that this issue definitely needs to be fixed before we could issue a 2.16 release.

            pjones Peter Jones added a comment - As per discussion on the LWG call today, moving tickets that do not appear to be essential to fix version 2.17. If the fix lands before code freeze we will update the fix version to reflect that but we want to focus on activities on the critical path. Please speak up if you think that this issue definitely needs to be fixed before we could issue a 2.16 release.
            pjones Peter Jones added a comment -

            According to a comment from Patrick in the above changeset it was rolled into the RHEL 9.4 server support patch which has been merged. So is there anything still remaining to be tracked by this ticket? 

            pjones Peter Jones added a comment - According to a comment from Patrick in the above changeset it was rolled into the RHEL 9.4 server support patch which has been merged. So is there anything still remaining to be tracked by this ticket? 

            "Patrick Farrell <patrick.farrell@oracle.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/55650
            Subject: LU-17927 tests: remove dio always_except for el9.4
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 1c61d064db37b31bb0e1f7409435efc885b7f992

            gerrit Gerrit Updater added a comment - "Patrick Farrell <patrick.farrell@oracle.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/55650 Subject: LU-17927 tests: remove dio always_except for el9.4 Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 1c61d064db37b31bb0e1f7409435efc885b7f992

            Depending on which patch is landing first, the always_except for test_119 should be removed either from https://review.whamcloud.com/54712 or by one of the UDIO patches.

            adilger Andreas Dilger added a comment - Depending on which patch is landing first, the always_except for test_119 should be removed either from https://review.whamcloud.com/54712 or by one of the UDIO patches.
            stancheff Shaun Tancheff added a comment - - edited

            fwiw:
            https://review.whamcloud.com/c/fs/lustre-release/+/53997
            and
            https://review.whamcloud.com/c/fs/lustre-release/+/52266
            are probably both needed for el9.4, the later needed an el9.4 specific fix.

            Without the above patches el9.4 crashes, with the patches all the 119 series tests pass.

            stancheff Shaun Tancheff added a comment - - edited fwiw: https://review.whamcloud.com/c/fs/lustre-release/+/53997 and https://review.whamcloud.com/c/fs/lustre-release/+/52266 are probably both needed for el9.4, the later needed an el9.4 specific fix. Without the above patches el9.4 crashes, with the patches all the 119 series tests pass.
            adilger Andreas Dilger added a comment - - edited

            paf, stancheff, this seems like a blocker for UDIO in 2.16.0 if this is failing consistently with el9.4.

            Is this addressed by Shaun's in-flight UDIO fix patch in LU-17525, or is there something new in the el9.4 kernel that is causing problems for UDIO?

            adilger Andreas Dilger added a comment - - edited paf , stancheff , this seems like a blocker for UDIO in 2.16.0 if this is failing consistently with el9.4. Is this addressed by Shaun's in-flight UDIO fix patch in LU-17525 , or is there something new in the el9.4 kernel that is causing problems for UDIO?
            yujian Jian Yu added a comment -

            sanity test 119f, 119g, and 119h hit the similar failure.

            yujian Jian Yu added a comment - sanity test 119f, 119g, and 119h hit the similar failure.

            People

              stancheff Shaun Tancheff
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: