Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-9305

Running File System Aging create write checksum errors

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • Lustre 2.10.0, Lustre 2.11.0
    • None
    • 1
    • 9223372036854775807

    Description

      My most recent re-production of this was:
      ZFS based on 0.7.0 RC4 fs/zfs:coral-rc1-combined
      Lustre tagged release 2.9.57(but 2.9.58 fails as well)
      Centos 7.3 3.10.0-514.16.1.el7.x86_64

      I have personally verified this fails on Lustre 2.8, 2.9 and latest tagged release, zfs 0.6.5-current ZOL Master and the most recent Centos 7.1, 7.2, and 7.3 kernels.

      This may well be a Lustre issue I need to try to reproduce on raidz, with out large RPCs, etc.

      On both the clients and OSS nodes we see checksum errors while the file aging test is running such as:
      [ 9354.968454] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000401:0x254:0x0] object 0x0:292 extent [117440512-125698047]: client csum de357896, server csum 5cd77893

      [ 9394.315856] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000401:0x28c:0x0] object 0x0:320 extent [67108864-82968575]: client csum df6bd34a, server csum 8480d352
      [ 9404.371609] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000401:0x298:0x0] object 0x0:326 extent [67108864-74448895]: client csum 2ced4ec0, server csum 1f814ec4

      Attachments

        1. debug_info.20170406_143409_48420_wolf-3.wolf.hpdd.intel.com.tgz
          3.45 MB
        2. wolf-6_client.tgz
          5.67 MB
        3. BasicLibs.py
          6 kB
        4. FileAger-wolf6.py
          6 kB
        5. FileAger-wolf7.py
          6 kB
        6. FileAger-wolf8.py
          6 kB
        7. FileAger-wolf9.py
          6 kB
        8. debug_vmalloc.patch
          22 kB
        9. debug_vmalloc_lustre.patch
          6 kB
        10. debug_vmalloc_spl.patch
          14 kB
        11. Linux_x64_Memory_Address_Mapping.pdf
          224 kB

        Issue Links

          Activity

            People

              bzzz Alex Zhuravlev
              jsalians_intel John Salinas (Inactive)
              Votes:
              1 Vote for this issue
              Watchers:
              23 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: