Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-3522

sanity-benchmark test_iozone: "no space left on device" on ZFS

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.10.0
    • Lustre 2.4.0, Lustre 2.4.1, Lustre 2.5.0, Lustre 2.8.0
    • None
    • server and client : lustre-master build# 1536 zfs
    • 3
    • 8858

    Description

      This issue was created by maloo for sarah <sarah@whamcloud.com>

      This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/903643c8-ddca-11e2-85a3-52540035b04c.

      The sub-test test_iozone failed with the following error:

      iozone (2) failed

      test log shows:

      Error writing block 7269, fd= 3
      write: No space left on device
      
      iozone: interrupted
      
      exiting iozone
      
       sanity-benchmark test_iozone: @@@@@@ FAIL: iozone (2) failed 
      

      Attachments

        Issue Links

          Activity

            [LU-3522] sanity-benchmark test_iozone: "no space left on device" on ZFS
            yujian Jian Yu added a comment -

            Lustre build: http://build.whamcloud.com/job/lustre-b2_4/47/
            Distro/Arch: RHEL6.4/x86_64
            FSTYPE=zfs

            The same failure occurred:
            https://maloo.whamcloud.com/test_sets/5d2a22d4-43a9-11e3-942a-52540035b04c

            yujian Jian Yu added a comment - Lustre build: http://build.whamcloud.com/job/lustre-b2_4/47/ Distro/Arch: RHEL6.4/x86_64 FSTYPE=zfs The same failure occurred: https://maloo.whamcloud.com/test_sets/5d2a22d4-43a9-11e3-942a-52540035b04c
            yujian Jian Yu added a comment -

            Lustre build: http://build.whamcloud.com/job/lustre-b2_4/47/
            Distro/Arch: RHEL6.4/x86_64(server), SLES11SP2/x86_64(client)
            FSTYPE=ldiskfs

            The same failure occurred:
            https://maloo.whamcloud.com/test_sets/e5731cd8-432f-11e3-8676-52540035b04c

            yujian Jian Yu added a comment - Lustre build: http://build.whamcloud.com/job/lustre-b2_4/47/ Distro/Arch: RHEL6.4/x86_64(server), SLES11SP2/x86_64(client) FSTYPE=ldiskfs The same failure occurred: https://maloo.whamcloud.com/test_sets/e5731cd8-432f-11e3-8676-52540035b04c
            pjones Peter Jones added a comment -

            Lai

            Are you able to assist with this one?

            Peter

            pjones Peter Jones added a comment - Lai Are you able to assist with this one? Peter
            utopiabound Nathaniel Clark added a comment - - edited

            The current state of this bug seems to be: Behaving as expected, wait for OBD_CONNECT_GRANT_PARAM to be supported on the client (LU-2049).

            utopiabound Nathaniel Clark added a comment - - edited The current state of this bug seems to be: Behaving as expected, wait for OBD_CONNECT_GRANT_PARAM to be supported on the client ( LU-2049 ).
            yujian Jian Yu added a comment -

            Lustre build: http://build.whamcloud.com/job/lustre-b2_4/45/ (2.4.1 RC2)
            Distro/Arch: RHEL6.4/x86_64
            FSTYPE=ldiskfs

            sanity-benchmark test iozone also hit the same failure:
            https://maloo.whamcloud.com/test_sets/497a00b4-182b-11e3-b39a-52540035b04c

            yujian Jian Yu added a comment - Lustre build: http://build.whamcloud.com/job/lustre-b2_4/45/ (2.4.1 RC2) Distro/Arch: RHEL6.4/x86_64 FSTYPE=ldiskfs sanity-benchmark test iozone also hit the same failure: https://maloo.whamcloud.com/test_sets/497a00b4-182b-11e3-b39a-52540035b04c
            sarah Sarah Liu added a comment -

            lustre-master build #1652 also hit this issue on ldiskfs:

            https://maloo.whamcloud.com/test_sets/c30260ee-15b4-11e3-8938-52540035b04c

            sarah Sarah Liu added a comment - lustre-master build #1652 also hit this issue on ldiskfs: https://maloo.whamcloud.com/test_sets/c30260ee-15b4-11e3-8938-52540035b04c
            yujian Jian Yu added a comment -

            Lustre build: http://build.whamcloud.com/job/lustre-b2_4/44/ (2.4.1 RC1)
            Distro/Arch: RHEL6.4/x86_64
            FSTYPE=zfs

            sanity-benchmark test iozone also hit the same failure:
            https://maloo.whamcloud.com/test_sets/e004601a-1556-11e3-8938-52540035b04c

            yujian Jian Yu added a comment - Lustre build: http://build.whamcloud.com/job/lustre-b2_4/44/ (2.4.1 RC1) Distro/Arch: RHEL6.4/x86_64 FSTYPE=zfs sanity-benchmark test iozone also hit the same failure: https://maloo.whamcloud.com/test_sets/e004601a-1556-11e3-8938-52540035b04c
            yujian Jian Yu added a comment -

            Lustre build: http://build.whamcloud.com/job/lustre-b2_4/44/ (2.4.1 RC1)
            Distro/Arch: RHEL6.4/x86_64 + FC18/x86_64 (Server + Client)
            FSTYPE=ldiskfs

            sanity-benchmark test iozone hit the same failure:
            https://maloo.whamcloud.com/test_sets/becb9218-14ef-11e3-ac48-52540035b04c

            yujian Jian Yu added a comment - Lustre build: http://build.whamcloud.com/job/lustre-b2_4/44/ (2.4.1 RC1) Distro/Arch: RHEL6.4/x86_64 + FC18/x86_64 (Server + Client) FSTYPE=ldiskfs sanity-benchmark test iozone hit the same failure: https://maloo.whamcloud.com/test_sets/becb9218-14ef-11e3-ac48-52540035b04c

            I put a brief description of what needs to be done into LU-2049. I thought that Jinshan and Johann had worked up a more detailed design for this, but I can't find it.

            adilger Andreas Dilger added a comment - I put a brief description of what needs to be done into LU-2049 . I thought that Jinshan and Johann had worked up a more detailed design for this, but I can't find it.
            utopiabound Nathaniel Clark added a comment - http://review.whamcloud.com/7402

            The difference between ldiskfs and zfs seems to be in the calculated grant request space, namely ofd_grant_from_cli()'s conversion factor for ldiskfs is 1, whereas for zfs it's 32. This is because the ofd_grant_compat() assumes that if the ofd_blockbits (set from statfs which gives the largest available block size in zfs) is greater than 4KB (12 bits) that it should assume the worst case, and shift out by that amount (which in this case is 5 bits).

            utopiabound Nathaniel Clark added a comment - The difference between ldiskfs and zfs seems to be in the calculated grant request space, namely ofd_grant_from_cli()'s conversion factor for ldiskfs is 1, whereas for zfs it's 32. This is because the ofd_grant_compat() assumes that if the ofd_blockbits (set from statfs which gives the largest available block size in zfs) is greater than 4KB (12 bits) that it should assume the worst case, and shift out by that amount (which in this case is 5 bits).

            People

              utopiabound Nathaniel Clark
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: