[LU-9321] replay-single test_70f: dd oflag=direct bs=1M . . . failed on onyx-42vm1.onyx.hpdd.intel.com, rc=1 Created: 11/Apr/17  Updated: 03/Dec/21  Resolved: 30/Aug/21

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.10.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Casper Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None
Environment:

onyx-42, Failover,
RHEL7.3, ZFS, master branch, v2.10.55, b3550


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

https://testing.hpdd.intel.com/test_sessions/b8a0b1a6-9492-444d-aa73-c8f99e8ae0cd

From test_log:

 replay-single test_70f: @@@@@@ FAIL: dd oflag=direct bs=1M count=10 if=/tmp/f70f.replay-single  of=/mnt/lustre/d70f.replay-single/f70f.replay-single.onyx-42vm1.onyx.hpdd.intel.com failed on onyx-42vm1.onyx.hpdd.intel.com, rc=1 
onyx-42vm1: osc.lustre-OST0000-osc-*.ost_server_uuid in FULL state after 28 sec
onyx-42vm5: osc.lustre-OST0002-osc-*.ost_server_uuid in FULL state after 0 sec
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:4893:error()
  = /usr/lib64/lustre/tests/replay-single.sh:2317:test_70f_write_and_read()
  = /usr/lib64/lustre/tests/replay-single.sh:2350:test_70f_loop()
  = /usr/lib64/lustre/tests/replay-single.sh:2394:test_70f()
  = /usr/lib64/lustre/tests/test-framework.sh:5169:run_one()
  = /usr/lib64/lustre/tests/test-framework.sh:5208:run_one_logged()
  = /usr/lib64/lustre/tests/test-framework.sh:5055:run_test()
  = /usr/lib64/lustre/tests/replay-single.sh:2415:main()


 Comments   
Comment by James Nunez (Inactive) [ 30/Aug/21 ]

We stopped seeing this failure for seven months but has started again.

I'm going to close this ticket and open a new one with current information.

Generated at Sat Feb 10 02:25:09 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.