Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-9305

Running File System Aging create write checksum errors

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: Lustre 2.10.0, Lustre 2.11.0
    • Labels:
    • Environment:
    • Severity:
      1
    • Rank (Obsolete):
      9223372036854775807

      Description

      My most recent re-production of this was:
      ZFS based on 0.7.0 RC4 fs/zfs:coral-rc1-combined
      Lustre tagged release 2.9.57(but 2.9.58 fails as well)
      Centos 7.3 3.10.0-514.16.1.el7.x86_64

      I have personally verified this fails on Lustre 2.8, 2.9 and latest tagged release, zfs 0.6.5-current ZOL Master and the most recent Centos 7.1, 7.2, and 7.3 kernels.

      This may well be a Lustre issue I need to try to reproduce on raidz, with out large RPCs, etc.

      On both the clients and OSS nodes we see checksum errors while the file aging test is running such as:
      [ 9354.968454] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000401:0x254:0x0] object 0x0:292 extent [117440512-125698047]: client csum de357896, server csum 5cd77893

      [ 9394.315856] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000401:0x28c:0x0] object 0x0:320 extent [67108864-82968575]: client csum df6bd34a, server csum 8480d352
      [ 9404.371609] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000401:0x298:0x0] object 0x0:326 extent [67108864-74448895]: client csum 2ced4ec0, server csum 1f814ec4

        Attachments

        1. BasicLibs.py
          6 kB
        2. debug_info.20170406_143409_48420_wolf-3.wolf.hpdd.intel.com.tgz
          3.45 MB
        3. debug_vmalloc_lustre.patch
          6 kB
        4. debug_vmalloc_spl.patch
          14 kB
        5. debug_vmalloc.patch
          22 kB
        6. FileAger-wolf6.py
          6 kB
        7. FileAger-wolf7.py
          6 kB
        8. FileAger-wolf8.py
          6 kB
        9. FileAger-wolf9.py
          6 kB
        10. Linux_x64_Memory_Address_Mapping.pdf
          224 kB
        11. wolf-6_client.tgz
          5.67 MB

          Issue Links

            Activity

              People

              • Assignee:
                bzzz Alex Zhuravlev
                Reporter:
                jsalians_intel John Salinas (Inactive)
              • Votes:
                1 Vote for this issue
                Watchers:
                23 Start watching this issue

                Dates

                • Due:
                  Created:
                  Updated:
                  Resolved: