[LU-14086] sanity-hsm test 260c fails with 'request on 0x200001b71:0x2b1:0x0 is not SUCCEED on mds1' Created: 28/Oct/20 Updated: 06/Nov/20 Resolved: 06/Nov/20 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.14.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | James Nunez (Inactive) | Assignee: | WC Triage |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: |
RHEL8.2 DNE |
||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||
| Description |
|
sanity-hsm test_260c fails with 'request on 0x200001b71:0x2b1:0x0 is not SUCCEED on mds1' for el8.2 client/server configured with DNE testing. Looking at the suite_log from the failure at https://testing.whamcloud.com/test_sets/d0e6b0ec-1d1c-4dca-8b5b-73d3aacab59e, we see CMD: trevis-6vm7 mkdir -p /tmp/arc1/sanity-hsm.test_260c/
Starting copytool agt1 on trevis-6vm7
CMD: trevis-6vm7 lhsmtool_posix --daemon --hsm-root "/tmp/arc1/sanity-hsm.test_260c/" --archive 2 "/mnt/lustre2" < /dev/null > "/autotest/autotest-1/2020-10-27/lustre-reviews_review-dne-zfs-part-2_77301_1_108_c482d742-cea8-4774-bf62-e9684176b15d/sanity-hsm.test_260c.copytool2_log.trevis-6vm7.log" 2>&1
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
Waiting 200s for 'SUCCEED'
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
Waiting 190s for 'SUCCEED'
…
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
CMD: trevis-6vm9 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200001b71:0x2b1:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
Waiting 0s for 'SUCCEED'
Update not seen after 200s: want 'SUCCEED' got ''
sanity-hsm test_260c: @@@@@@ FAIL: request on 0x200001b71:0x2b1:0x0 is not SUCCEED on mds1
Trace dump:
= /usr/lib64/lustre/tests/test-framework.sh:6254:error()
= /usr/lib64/lustre/tests/test-framework.sh:10392:wait_request_state()
= /usr/lib64/lustre/tests/sanity-hsm.sh:4579:test_260c()
There are no obvious issues found in the console logs. This failure looks like Logs for other failures are at |