Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13365

spin_lock in after_reply() eat up most of cpu

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • Lustre 2.14.0
    • Lustre 2.14.0
    • None
    • master
    • 3
    • 9223372036854775807

    Description

      There are two clients, but one of client was 25% slower write than other client. According to an framegraph on CPU time of each client, client spent amount of CPU time with spin_lock at after_reply(). However, fast client didn't show that cpu time on after_reply().
      The workload is simple 1MB, FPP with IOR. (mpirun -np 16 ior -t 1m -b 16g -e -F -vv -o /fast/file -w)
      Here is two client's node information and performance results.

      fast client (2 x Platinum 8160 CPU, 192GB memory, 1 x IB-EDR)
      Max Write: 11219.97 MiB/sec (11765.00 MB/sec)
      slow client (1 x Gold 5218 CPU, 96GB memory, 1 x IB-HDR100)
      Max Write: 9278.14 MiB/sec (9728.84 MB/sec)
      

      Attachments

        Activity

          People

            wshilong Wang Shilong (Inactive)
            sihara Shuichi Ihara
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: