Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-9234

replay-single test_70f: checksum doesn't match

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.10.0
    • Lustre 2.10.0, Lustre 2.11.0
    • None
    • onyx-35vm3 thru 6, Interop test,
      RHEL7.3/ldiskfs, branch master, v2.9.54, b3541, 2.9 Lustre,
      Client 2.10 Lustre
    • 3
    • 9223372036854775807

    Description

      https://testing.hpdd.intel.com/test_sessions/1a6bc6e8-0a05-11e7-9053-5254006e85c2

      After unmounting/mounting an OST, client detects a checksum mismatch:

      test_log:

      CMD: onyx-35vm4 umount -d /mnt/lustre-ost1
      CMD: onyx-35vm4 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
      reboot facets: ost1
      Failover ost1 to onyx-35vm4
      03:10:44 (1489572644) waiting for onyx-35vm4 network 900 secs ...
      03:10:44 (1489572644) network interface is UP
      CMD: onyx-35vm4 hostname
      mount facets: ost1
      CMD: onyx-35vm4 test -b /dev/lvm-Role_OSS/P1
      CMD: onyx-35vm4 e2label /dev/lvm-Role_OSS/P1
      Starting ost1:   /dev/lvm-Role_OSS/P1 /mnt/lustre-ost1
      CMD: onyx-35vm4 mkdir -p /mnt/lustre-ost1; mount -t lustre
      

      followed by:

      CMD: onyx-35vm6 md5sum /mnt/lustre/d70f.replay-single/f70f.replay-single.onyx-35vm5.onyx.hpdd.intel.com
      onyx-35vm5: osc.lustre-OST0000-osc-*.ost_server_uuid in FULL state after 3 sec
      onyx-35vm6: osc.lustre-OST0000-osc-*.ost_server_uuid in FULL state after 3 sec
       replay-single test_70f: @@@@@@ FAIL: /mnt/lustre/d70f.replay-single/f70f.replay-single.onyx-35vm5.onyx.hpdd.intel.com: checksum doesn't match on onyx-35vm6 
        Trace dump:
        = /usr/lib64/lustre/tests/test-framework.sh:4841:error()
        = /usr/lib64/lustre/tests/replay-single.sh:2334:test_70f_write_and_read()
        = /usr/lib64/lustre/tests/replay-single.sh:2350:test_70f_loop()
        = /usr/lib64/lustre/tests/replay-single.sh:2394:test_70f()
        = /usr/lib64/lustre/tests/test-framework.sh:5117:run_one()
        = /usr/lib64/lustre/tests/test-framework.sh:5156:run_one_logged()
        = /usr/lib64/lustre/tests/test-framework.sh:5003:run_test()
        = /usr/lib64/lustre/tests/replay-single.sh:2415:main()
      

      Attachments

        Issue Links

          Activity

            People

              sarah Sarah Liu
              jcasper James Casper (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: