[LU-16496] sanity-hsm test_103a: Fail to archive files 0/8 Created: 20/Jan/23  Updated: 19/Dec/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Related
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for eaujames <eaujames@ddn.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/07bba847-cd03-4c22-984d-9588640bf28a

test_103a failed with the following error:

Fail to archive files 0/8

Test session details:
clients: https://build.whamcloud.com/job/lustre-reviews/91760 - 4.18.0-372.32.1.el8_6.x86_64
servers: https://build.whamcloud.com/job/lustre-reviews/91760 - 4.18.0-372.32.1.el8_lustre.x86_64

CMD: trevis-97vm2 mkdir -p /tmp/arc1/sanity-hsm.test_103a/
Starting copytool 'agt1' on 'trevis-97vm2' with cmdline 'lhsmtool_posix --archive-format=v2 --hsm-root=/tmp/arc1/sanity-hsm.test_103a/ --daemon --pid-file=/var/run/lhsmtool_posix.pid  "/mnt/lustre2"'
CMD: trevis-97vm2 lhsmtool_posix --archive-format=v2 --hsm-root=/tmp/arc1/sanity-hsm.test_103a/ --daemon --pid-file=/var/run/lhsmtool_posix.pid  "/mnt/lustre2" < /dev/null > "/autotest/autotest-1/2023-01-20/lustre-reviews_review-dne-part-4_91760_6_b840693a-3f75-48a2-807f-1b5befd2e381//sanity-hsm.test_103a.copytool_log.trevis-97vm2.log" 2>&1
(0x200007937:0x9b:0x0|0x200007937:0x9c:0x0|0x200007937:0x9d:0x0|0x200007937:0x9e:0x0|0x200007937:0x9f:0x0|0x200007937:0xa0:0x0|0x200007937:0xa1:0x0|0x200007937:0xa2:0x0).*action=ARCHIVE.*status=SUCCEED
CMD: trevis-97vm4 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions |
			grep -c -E '(0x200007937:0x9b:0x0|0x200007937:0x9c:0x0|0x200007937:0x9d:0x0|0x200007937:0x9e:0x0|0x200007937:0x9f:0x0|0x200007937:0xa0:0x0|0x200007937:0xa1:0x0|0x200007937:0xa2:0x0).*action=ARCHIVE.*status=SUCCEED'
pdsh@trevis-97vm1: trevis-97vm4: ssh exited with exit code 1
CMD: trevis-97vm4 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions |
			grep -c -E '(0x200007937:0x9b:0x0|0x200007937:0x9c:0x0|0x200007937:0x9d:0x0|0x200007937:0x9e:0x0|0x200007937:0x9f:0x0|0x200007937:0xa0:0x0|0x200007937:0xa1:0x0|0x200007937:0xa2:0x0).*action=ARCHIVE.*status=SUCCEED'
pdsh@trevis-97vm1: trevis-97vm4: ssh exited with exit code 1
CMD: trevis-97vm4 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions |
			grep -c -E '(0x200007937:0x9b:0x0|0x200007937:0x9c:0x0|0x200007937:0x9d:0x0|0x200007937:0x9e:0x0|0x200007937:0x9f:0x0|0x200007937:0xa0:0x0|0x200007937:0xa1:0x0|0x200007937:0xa2:0x0).*action=ARCHIVE.*status=SUCCEED'
pdsh@trevis-97vm1: trevis-97vm4: ssh exited with exit code 1
CMD: trevis-97vm4 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions |
			grep -c -E '(0x200007937:0x9b:0x0|0x200007937:0x9c:0x0|0x200007937:0x9d:0x0|0x200007937:0x9e:0x0|0x200007937:0x9f:0x0|0x200007937:0xa0:0x0|0x200007937:0xa1:0x0|0x200007937:0xa2:0x0).*action=ARCHIVE.*status=SUCCEED'
pdsh@trevis-97vm1: trevis-97vm4: ssh exited with exit code 1
CMD: trevis-97vm4 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions |
			grep -c -E '(0x200007937:0x9b:0x0|0x200007937:0x9c:0x0|0x200007937:0x9d:0x0|0x200007937:0x9e:0x0|0x200007937:0x9f:0x0|0x200007937:0xa0:0x0|0x200007937:0xa1:0x0|0x200007937:0xa2:0x0).*action=ARCHIVE.*status=SUCCEED'
pdsh@trevis-97vm1: trevis-97vm4: ssh exited with exit code 1
 sanity-hsm test_103a: @@@@@@ FAIL: Fail to archive files 0/8 

Copytool:

1674184476.831846 lhsmtool_posix[277418]: xattr file for '/mnt/lustre2/.lustre/fid/0x200007937:0x9f:0x0' saved to archive '/tmp/arc1/sanity-hsm.test_103a//79a8/0x200007937:0x9f:0x0_tmp'
lhsmtool_posix: 1674184476.831829 lhsmtool_posix[277415]: symlink '/tmp/arc1/sanity-hsm.test_103a//shadow/d103a.sanity-hsm/f103a.sanity-hsm_1' to '../../79ab/0x200007937:0x9c:0x0' done
lhsmtool_posix: 1674184476.831954 lhsmtool_posix[277415]: Action completed, notifying coordinator cookie=0x63ca0716, FID=[0x200007937:0x9c:0x0], hp_flags=0 err=0
lhsmtool_posix: 1674184476.832199 lhsmtool_posix[277420]: symlink '/tmp/arc1/sanity-hsm.test_103a//shadow/d103a.sanity-hsm/f103a.sanity-hsm_6' to '../../7996/0x200007937:0xa1:0x0' done
lhsmtool_posix: 1674184476.832211 lhsmtool_posix[277418]: symlink '/tmp/arc1/sanity-hsm.test_103a//shadow/d103a.sanity-hsm/f103a.sanity-hsm_4' to '../../79a8/0x200007937:0x9f:0x0' done
lhsmtool_posix: 1674184476.832248 lhsmtool_posix[277418]: Action completed, notifying coordinator cookie=0x63ca0719, FID=[0x200007937:0x9f:0x0], hp_flags=0 err=0
lhsmtool_posix: 1674184476.832235 lhsmtool_posix[277420]: Action completed, notifying coordinator cookie=0x63ca071b, FID=[0x200007937:0xa1:0x0], hp_flags=0 err=0
lhsmtool_posix: 1674184476.832615 lhsmtool_posix[277417]: llapi_hsm_action_end() on '/mnt/lustre2/.lustre/fid/0x200007937:0x9e:0x0' ok (rc=0)
lhsmtool_posix: 1674184476.832646 lhsmtool_posix[277421]: llapi_hsm_action_end() on '/mnt/lustre2/.lustre/fid/0x200007937:0xa2:0x0' ok (rc=0)
lhsmtool_posix: 1674184476.833221 lhsmtool_posix[277418]: llapi_hsm_action_end() on '/mnt/lustre2/.lustre/fid/0x200007937:0x9f:0x0' ok (rc=0)
lhsmtool_posix: 1674184476.833282 lhsmtool_posix[277420]: llapi_hsm_action_end() on '/mnt/lustre2/.lustre/fid/0x200007937:0xa1:0x0' ok (rc=0)
lhsmtool_posix: 1674184476.833385 lhsmtool_posix[277415]: llapi_hsm_action_end() on '/mnt/lustre2/.lustre/fid/0x200007937:0x9c:0x0' ok (rc=0)

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity-hsm test_103a - Fail to archive files 0/8



 Comments   
Comment by Etienne Aujames [ 20/Jan/23 ]

The archives seem to need more time: the last archive on the copy tool ends when the test fails.

Generated at Sat Feb 10 03:27:31 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.