Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-869

sanity.sh test_81b failed with "dd should fail for -ENOSPC, but succeed"

Details

    • Bug
    • Resolution: Duplicate
    • Major
    • None
    • Lustre 2.2.0, Lustre 2.1.1, Lustre 2.1.2
    • None
    • 3
    • 4249

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/d3d44298-114c-11e1-ad46-52540025f9af.
      Found similar error in ORI-403

      Attachments

        1. 869.tar.gz
          1.25 MB
        2. LU-869-debug.patch
          0.6 kB

        Issue Links

          Activity

            [LU-869] sanity.sh test_81b failed with "dd should fail for -ENOSPC, but succeed"

            Closing as a duplicate LU-2131.

            adilger Andreas Dilger added a comment - Closing as a duplicate LU-2131 .

            This looks a duplicate of LU-2131.

            egryaznova Elena Gryaznova added a comment - This looks a duplicate of LU-2131 .
            yujian Jian Yu added a comment -

            Lustre Tag: v2_1_2_RC2
            Lustre Build: http://build.whamcloud.com/job/lustre-b2_1/87/
            Distro/Arch: RHEL6.2/x86_64(server), SLES11SP1/x86_64(client)
            Network: TCP (1GigE)
            ENABLE_QUOTA=yes

            The same issue occurred: https://maloo.whamcloud.com/test_sets/a3243c4a-ae52-11e1-ae0d-52540035b04c

            yujian Jian Yu added a comment - Lustre Tag: v2_1_2_RC2 Lustre Build: http://build.whamcloud.com/job/lustre-b2_1/87/ Distro/Arch: RHEL6.2/x86_64(server), SLES11SP1/x86_64(client) Network: TCP (1GigE) ENABLE_QUOTA=yes The same issue occurred: https://maloo.whamcloud.com/test_sets/a3243c4a-ae52-11e1-ae0d-52540035b04c
            sarah Sarah Liu added a comment -

            tag-2.2.54 RHEL6 server/SLES client hit this issue again: https://maloo.whamcloud.com/test_sets/f0c1b98c-ad58-11e1-8152-52540035b04c

            sarah Sarah Liu added a comment - tag-2.2.54 RHEL6 server/SLES client hit this issue again: https://maloo.whamcloud.com/test_sets/f0c1b98c-ad58-11e1-8152-52540035b04c

            it looks like this issue can only be seen on suse clients. I don't know what's the difference between rhel and suse clients.

            jay Jinshan Xiong (Inactive) added a comment - it looks like this issue can only be seen on suse clients. I don't know what's the difference between rhel and suse clients.
            sarah Sarah Liu added a comment -

            Got this issue on 2.2-RC1 test, RHEL6 server/SUSE client:
            https://maloo.whamcloud.com/test_sets/5c52111e-70f6-11e1-a89e-5254004bbbd3

            sarah Sarah Liu added a comment - Got this issue on 2.2-RC1 test, RHEL6 server/SUSE client: https://maloo.whamcloud.com/test_sets/5c52111e-70f6-11e1-a89e-5254004bbbd3
            bobijam Zhenyu Xu added a comment -

            another hit for 2.2 https://maloo.whamcloud.com/test_sets/9d70bce8-6ed7-11e1-b737-5254004bbbd3

            == sanity test 81b: OST should return -ENOSPC when retry still fails ================================= 21:47:12 (1331786832)
            fail_loc=0x228
            write: Input/output error
             sanity test_81b: @@@@@@ FAIL: dd should fail for -ENOSPC, but succeed. 
            
            bobijam Zhenyu Xu added a comment - another hit for 2.2 https://maloo.whamcloud.com/test_sets/9d70bce8-6ed7-11e1-b737-5254004bbbd3 == sanity test 81b: OST should return -ENOSPC when retry still fails ================================= 21:47:12 (1331786832) fail_loc=0x228 write: Input/output error sanity test_81b: @@@@@@ FAIL: dd should fail for -ENOSPC, but succeed.

            Hi, Andreas, did your tests pass or not?

            I tried the tests locally, and they all passed, I think the EIO error messages are what we expected, since these tests are trying to create file after exhausting the precreation, and that should get EIO error from the server. (see qos_prep_create()->alloc_idx_array(), when we fail to create any stripe, -EIO will be returned the client)

            niu Niu Yawei (Inactive) added a comment - Hi, Andreas, did your tests pass or not? I tried the tests locally, and they all passed, I think the EIO error messages are what we expected, since these tests are trying to create file after exhausting the precreation, and that should get EIO error from the server. (see qos_prep_create()->alloc_idx_array(), when we fail to create any stripe, -EIO will be returned the client)

            Niu, could you please run this test locally to try and debug it. I'm repeatedly seeing IO errors running sanity.sh test_27[opqv] during file object creations running on a RHEL6 kernel.

            adilger Andreas Dilger added a comment - Niu, could you please run this test locally to try and debug it. I'm repeatedly seeing IO errors running sanity.sh test_27 [opqv] during file object creations running on a RHEL6 kernel.

            Thanks, Sarah. The log shows that AS_ENOSPC was set on the proper i_mapping, and I can't tell why it's not transfered to -ENOSPCE in wait_on_page_writeback_range() from the kernel code.

            niu Niu Yawei (Inactive) added a comment - Thanks, Sarah. The log shows that AS_ENOSPC was set on the proper i_mapping, and I can't tell why it's not transfered to -ENOSPCE in wait_on_page_writeback_range() from the kernel code.

            People

              niu Niu Yawei (Inactive)
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: