Details
-
Bug
-
Resolution: Fixed
-
Critical
-
Lustre 2.10.0, Lustre 2.11.0
-
None
-
onyx-35vm3 thru 6, Interop test,
RHEL7.3/ldiskfs, branch master, v2.9.54, b3541, 2.9 Lustre,
Client 2.10 Lustre
-
3
-
9223372036854775807
Description
https://testing.hpdd.intel.com/test_sessions/1a6bc6e8-0a05-11e7-9053-5254006e85c2
After unmounting/mounting an OST, client detects a checksum mismatch:
test_log:
CMD: onyx-35vm4 umount -d /mnt/lustre-ost1 CMD: onyx-35vm4 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST ' reboot facets: ost1 Failover ost1 to onyx-35vm4 03:10:44 (1489572644) waiting for onyx-35vm4 network 900 secs ... 03:10:44 (1489572644) network interface is UP CMD: onyx-35vm4 hostname mount facets: ost1 CMD: onyx-35vm4 test -b /dev/lvm-Role_OSS/P1 CMD: onyx-35vm4 e2label /dev/lvm-Role_OSS/P1 Starting ost1: /dev/lvm-Role_OSS/P1 /mnt/lustre-ost1 CMD: onyx-35vm4 mkdir -p /mnt/lustre-ost1; mount -t lustre
followed by:
CMD: onyx-35vm6 md5sum /mnt/lustre/d70f.replay-single/f70f.replay-single.onyx-35vm5.onyx.hpdd.intel.com onyx-35vm5: osc.lustre-OST0000-osc-*.ost_server_uuid in FULL state after 3 sec onyx-35vm6: osc.lustre-OST0000-osc-*.ost_server_uuid in FULL state after 3 sec replay-single test_70f: @@@@@@ FAIL: /mnt/lustre/d70f.replay-single/f70f.replay-single.onyx-35vm5.onyx.hpdd.intel.com: checksum doesn't match on onyx-35vm6 Trace dump: = /usr/lib64/lustre/tests/test-framework.sh:4841:error() = /usr/lib64/lustre/tests/replay-single.sh:2334:test_70f_write_and_read() = /usr/lib64/lustre/tests/replay-single.sh:2350:test_70f_loop() = /usr/lib64/lustre/tests/replay-single.sh:2394:test_70f() = /usr/lib64/lustre/tests/test-framework.sh:5117:run_one() = /usr/lib64/lustre/tests/test-framework.sh:5156:run_one_logged() = /usr/lib64/lustre/tests/test-framework.sh:5003:run_test() = /usr/lib64/lustre/tests/replay-single.sh:2415:main()