Details
-
Bug
-
Resolution: Duplicate
-
Minor
-
None
-
Lustre 2.14.0
-
None
-
RHEL8.2 DNE
-
3
-
9223372036854775807
Description
sanity-hsm test_260c fails with 'request on 0x200001b71:0x2b1:0x0 is not SUCCEED on mds1' for el8.2 client/server configured with DNE testing. Looking at the suite_log from the failure at https://testing.whamcloud.com/test_sets/d0e6b0ec-1d1c-4dca-8b5b-73d3aacab59e, we see
CMD: trevis-6vm7 mkdir -p /tmp/arc1/sanity-hsm.test_260c/ Starting copytool agt1 on trevis-6vm7 CMD: trevis-6vm7 lhsmtool_posix --daemon --hsm-root "/tmp/arc1/sanity-hsm.test_260c/" --archive 2 "/mnt/lustre2" < /dev/null > "/autotest/autotest-1/2020-10-27/lustre-reviews_review-dne-zfs-part-2_77301_1_108_c482d742-cea8-4774-bf62-e9684176b15d/sanity-hsm.test_260c.copytool2_log.trevis-6vm7.log" 2>&1 CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= Waiting 200s for 'SUCCEED' CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= Waiting 190s for 'SUCCEED' … CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d= Waiting 0s for 'SUCCEED' Update not seen after 200s: want 'SUCCEED' got '' sanity-hsm test_260c: @@@@@@ FAIL: request on 0x200001b71:0x2b1:0x0 is not SUCCEED on mds1 Trace dump: = /usr/lib64/lustre/tests/test-framework.sh:6254:error() = /usr/lib64/lustre/tests/test-framework.sh:10392:wait_request_state() = /usr/lib64/lustre/tests/sanity-hsm.sh:4579:test_260c()
There are no obvious issues found in the console logs.
This failure looks like LU-11709.
Logs for other failures are at
https://testing.whamcloud.com/test_sets/39cda0dc-b495-4af9-b0ca-757042d6fd3a
https://testing.whamcloud.com/test_sets/97837d06-6c88-4167-8c22-2b01a1fb6832
Attachments
Issue Links
- duplicates
-
LU-13543 hsm.actions file is broken on RHEL 8.2
- Resolved