Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Critical
Fix Version/s: Lustre 2.10.0
Affects Version/s: Lustre 2.10.0, Lustre 2.11.0
Labels:
None
Environment:
onyx-35vm3 thru 6, Interop test,
RHEL7.3/ldiskfs, branch master, v2.9.54, b3541, 2.9 Lustre,
Client 2.10 Lustre

Severity:
3
Rank (Obsolete):
9223372036854775807

Description

https://testing.hpdd.intel.com/test_sessions/1a6bc6e8-0a05-11e7-9053-5254006e85c2

After unmounting/mounting an OST, client detects a checksum mismatch:

test_log:

CMD: onyx-35vm4 umount -d /mnt/lustre-ost1
CMD: onyx-35vm4 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
reboot facets: ost1
Failover ost1 to onyx-35vm4
03:10:44 (1489572644) waiting for onyx-35vm4 network 900 secs ...
03:10:44 (1489572644) network interface is UP
CMD: onyx-35vm4 hostname
mount facets: ost1
CMD: onyx-35vm4 test -b /dev/lvm-Role_OSS/P1
CMD: onyx-35vm4 e2label /dev/lvm-Role_OSS/P1
Starting ost1:   /dev/lvm-Role_OSS/P1 /mnt/lustre-ost1
CMD: onyx-35vm4 mkdir -p /mnt/lustre-ost1; mount -t lustre

followed by:

CMD: onyx-35vm6 md5sum /mnt/lustre/d70f.replay-single/f70f.replay-single.onyx-35vm5.onyx.hpdd.intel.com
onyx-35vm5: osc.lustre-OST0000-osc-*.ost_server_uuid in FULL state after 3 sec
onyx-35vm6: osc.lustre-OST0000-osc-*.ost_server_uuid in FULL state after 3 sec
 replay-single test_70f: @@@@@@ FAIL: /mnt/lustre/d70f.replay-single/f70f.replay-single.onyx-35vm5.onyx.hpdd.intel.com: checksum doesn't match on onyx-35vm6 
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:4841:error()
  = /usr/lib64/lustre/tests/replay-single.sh:2334:test_70f_write_and_read()
  = /usr/lib64/lustre/tests/replay-single.sh:2350:test_70f_loop()
  = /usr/lib64/lustre/tests/replay-single.sh:2394:test_70f()
  = /usr/lib64/lustre/tests/test-framework.sh:5117:run_one()
  = /usr/lib64/lustre/tests/test-framework.sh:5156:run_one_logged()
  = /usr/lib64/lustre/tests/test-framework.sh:5003:run_test()
  = /usr/lib64/lustre/tests/replay-single.sh:2415:main()

Attachments

Issue Links

is related to

LU-10702 replay-single test_87a: checksum doesn't match

Resolved

mentioned in: Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...

(5 mentioned in)

Activity

People

Assignee:: Sarah Liu

Reporter:: James Casper (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 20/Mar/17 11:19 PM

Updated:: 27/Mar/18 4:43 PM

Resolved:: 27/Mar/18 4:43 PM