Details

    • Bug
    • Resolution: Cannot Reproduce
    • Major
    • None
    • Lustre 2.10.1
    • None
    • Lustre 2.10.1 RHEL 7.4
    • 3
    • 9223372036854775807

    Description

      While doing some bench marking with FIO is seems libaio is not working with Lustre.

      FIO can be found here: https://github.com/axboe/fio/

      When running with libaio it quickly fails with error messages. A sample FIO configuration file is below:

      fsync=1
      directory=/mnt/lustre 
      direct=1
      numjobs=32
      ioengine=libaio
      blocksize=64k
      readwrite=readwrite
      size=65536m
      group_reporting
      
      

      Sample error messages:

      .....
      lustre: Laying out IO file(s) (1 file(s) / 65536MB)
      fio: io_u error on file /mnt/lustre/lustre.21.0: Unknown error -131072: read offset=0, buflen=65536
      fio: io_u error on file /mnt/lustre/lustre.21.0: Unknown error -131072: read offset=65536, buflen=65536
      fio: pid=51260, err=-131072/file:io_u.c:1708, func=io_u error, error=Unknown error -131072
      ...
      
      

      Removing libaio options allows FIO to run.  

      Fsync and Direct really slow Lustre down but they run. 

      Other tests  work as expected. 

      Attachments

        Issue Links

          Activity

            [LU-10367] FIO Fails to run with libaio

            The problem is still present in 2.10 LTS. The provided patch seems to fix it.

            Could be interesting for 2.10.7?

            degremoa Aurelien Degremont (Inactive) added a comment - The problem is still present in 2.10 LTS. The provided patch seems to fix it. Could be interesting for 2.10.7?
            pjones Peter Jones added a comment -

            ok - thanks

            pjones Peter Jones added a comment - ok - thanks

            The patch isn't needed after LU-11825

            askulysh Andriy Skulysh added a comment - The patch isn't needed after LU-11825

            Now that the PIO code has been removed is this a problem still?

            simmonsja James A Simmons added a comment - Now that the PIO code has been removed is this a problem still?

            Andriy Skulysh (c17819@cray.com) uploaded a new patch: https://review.whamcloud.com/33716
            Subject: LU-10367 llite: aio read returns doubled bytes count
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 823b81a1b911ce92612e5953ca4d3135bd01b2a7

            gerrit Gerrit Updater added a comment - Andriy Skulysh (c17819@cray.com) uploaded a new patch: https://review.whamcloud.com/33716 Subject: LU-10367 llite: aio read returns doubled bytes count Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 823b81a1b911ce92612e5953ca4d3135bd01b2a7

            Hello,

            I'm having the same problem with 2.10.3.

            Async IO simply won't work (libaio), directIO are much slower than what we get with 2.7 (same hardware).

            Any idea where this could come from please?

            Thank you,

            tfb-ddn Thomas Favre-Bulle (Inactive) added a comment - Hello, I'm having the same problem with 2.10.3. Async IO simply won't work (libaio), directIO are much slower than what we get with 2.7 (same hardware). Any idea where this could come from please? Thank you,
            ieelusername Homer Li (Inactive) added a comment - - edited

            Yes, I have the same problem.

            It 's lustre 2.10.2 +  zfs 0.7.3

            ieelusername Homer Li (Inactive) added a comment - - edited Yes, I have the same problem. It 's lustre 2.10.2 +  zfs 0.7.3

            People

              askulysh Andriy Skulysh
              keith Keith Mannthey (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: