[LU-14200] ost-pools test 23b fails with 'dd did not fail with ENOSPC' Created: 08/Dec/20  Updated: 11/Dec/20  Resolved: 11/Dec/20

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.14.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Nunez (Inactive) Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: rhel8.3
Environment:

RHEL8.3 servers/clients


Issue Links:
Duplicate
duplicates LU-14205 RHEL8.3: sanity test 398c fails with ... Resolved
Related
is related to LU-14202 sanity-sec test 16 fails with 'unable... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

ost-pools test_23b fails for RHEL 8.3 server/clients.

Looking at the suite_log for the failure at https://testing.whamcloud.com/test_sets/8445d81a-2e35-493c-bcf7-311524a97aa5, this is the first time we’ve seen ost-pools test_23b fail with

[4 iteration] dd: closing output file '/mnt/lustre/d23b.ost-pools/dir/f23b.ost-pools-quota4': Input/output error
total written: 20971520
stime=1607360253, etime=1607360637, elapsed=384
 ost-pools test_23b: @@@@@@ FAIL: dd did not fail with ENOSPC 
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:6257:error()
  = /usr/lib64/lustre/tests/ost-pools.sh:1360:test_23b()

We have seen this test fail with the same error message, see LU-10396, and with “write error ... Input/output error“, but this is the first time we see dd fail with “close ... Input/output error”. From the value after total written, it looks like dd did fail, just not with “No space left on device" as the test requires/is looking for.

In the client1 (vm) dmesg log, we see

[ 5322.937853] Lustre: DEBUG MARKER: lctl get_param -n lov.lustre-*.pools.testpool | sort -u | tr '\n' ' ' 
[ 5342.709668] Lustre: lustre-OST0001-osc-ffff913ecdbb6000: disconnect after 20s idle
[ 5665.004748] LustreError: 7828:0:(osc_request.c:1947:osc_brw_fini_request()) lustre-OST0000-osc-ffff913ecdbb6000: unexpected positive size 1
[ 5713.926869] Lustre: DEBUG MARKER: /usr/sbin/lctl mark  ost-pools test_23b: @@@@@@ FAIL: dd did not fail with ENOSPC 


 Comments   
Comment by James Nunez (Inactive) [ 10/Dec/20 ]

It looks like we are seeing this same error in sanity-quota test 9 seen in RHEL8.3 client/server testing:
https://testing.whamcloud.com/test_sets/79e432c6-29f8-4449-a43f-372fffb66c4d
https://testing.whamcloud.com/test_sets/7bd7e9d6-c722-41f4-8e4d-9619692b5da4

We also see sanity-flr tests 204e and 204f fail in the similar way; https://testing.whamcloud.com/test_sets/fbde9358-e264-4548-95fb-236490a7135b .

Generated at Sat Feb 10 03:07:42 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.