[LU-14086] sanity-hsm test 260c fails with 'request on 0x200001b71:0x2b1:0x0 is not SUCCEED on mds1' Created: 28/Oct/20  Updated: 06/Nov/20  Resolved: 06/Nov/20

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.14.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Nunez (Inactive) Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

RHEL8.2 DNE


Issue Links:
Duplicate
duplicates LU-13543 hsm.actions file is broken on RHEL 8.2 Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

sanity-hsm test_260c fails with 'request on 0x200001b71:0x2b1:0x0 is not SUCCEED on mds1' for el8.2 client/server configured with DNE testing. Looking at the suite_log from the failure at https://testing.whamcloud.com/test_sets/d0e6b0ec-1d1c-4dca-8b5b-73d3aacab59e, we see

CMD: trevis-6vm7 mkdir -p /tmp/arc1/sanity-hsm.test_260c/
Starting copytool agt1 on trevis-6vm7
CMD: trevis-6vm7 lhsmtool_posix  --daemon --hsm-root "/tmp/arc1/sanity-hsm.test_260c/" --archive 2 "/mnt/lustre2" < /dev/null > "/autotest/autotest-1/2020-10-27/lustre-reviews_review-dne-zfs-part-2_77301_1_108_c482d742-cea8-4774-bf62-e9684176b15d/sanity-hsm.test_260c.copytool2_log.trevis-6vm7.log" 2>&1
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
Waiting 200s for 'SUCCEED'
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
Waiting 190s for 'SUCCEED'
…
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
Waiting 0s for 'SUCCEED'
Update not seen after 200s: want 'SUCCEED' got ''
 sanity-hsm test_260c: @@@@@@ FAIL: request on 0x200001b71:0x2b1:0x0 is not SUCCEED on mds1 
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:6254:error()
  = /usr/lib64/lustre/tests/test-framework.sh:10392:wait_request_state()
  = /usr/lib64/lustre/tests/sanity-hsm.sh:4579:test_260c()

There are no obvious issues found in the console logs.

This failure looks like LU-11709.

Logs for other failures are at
https://testing.whamcloud.com/test_sets/39cda0dc-b495-4af9-b0ca-757042d6fd3a
https://testing.whamcloud.com/test_sets/97837d06-6c88-4167-8c22-2b01a1fb6832


Generated at Sat Feb 10 03:06:43 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.