[LU-14022] sanity-hsm test 1a hung at verifying released state Created: 09/Oct/20  Updated: 23/Dec/20  Resolved: 12/Oct/20

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.14.0
Fix Version/s: Upstream

Type: Bug Priority: Minor
Reporter: Jian Yu Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

Ubuntu 20.04 client


Issue Links:
Related
is related to LU-13182 MAP_POPULATE hangs with Linux 5.4 Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

sanity-hsm test 1a hung as follows:

== sanity-hsm test 1a: mmap & cat a HSM released file ================================================ 10:16:06 (1601633766)
1+0 records in
1+0 records out
1048576 bytes (1.0 MB, 1.0 MiB) copied, 0.22336 s, 4.7 MB/s
CMD: trevis-28vm2 mkdir -p /tmp/arc1/sanity-hsm.test_1a/
Starting copytool agt1 on trevis-28vm2
CMD: trevis-28vm2 lhsmtool_posix  --daemon --hsm-root "/tmp/arc1/sanity-hsm.test_1a/" "/mnt/lustre2"  "/autotest/autotest-1/2020-10-02/lustre-master_full_4098_1_17_9303ce79-8526-4737-9e0e-fcd3f0d27e76/sanity-hsm.test_1a.copytool_log.trevis-28vm2.log" 2>&1
CMD: trevis-28vm4 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200000404:0x2:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
Waiting 200s for 'SUCCEED'
CMD: trevis-28vm4 /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200000404:0x2:0x0'.*action='ARCHIVE'/ {print \$13}' | cut -f2 -d=
Verifying released state: 

https://testing.whamcloud.com/test_sets/56f2170d-4406-49de-842b-08470046b905



 Comments   
Comment by James A Simmons [ 09/Oct/20 ]

I see this with the Linux client as well.

Comment by Oleg Drokin [ 12/Oct/20 ]

The fix is tracked under LU-13182

Generated at Sat Feb 10 03:06:10 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.