[LU-6559] sanity-hsm test_15: rebind list of files: test failed to respond and timed out Created: 04/May/15  Updated: 10/Aug/15  Resolved: 07/May/15

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.8.0
Fix Version/s: Lustre 2.8.0

Type: Bug Priority: Blocker
Reporter: Maloo Assignee: John Hammond
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Duplicate
duplicates LU-6555 sanity-hsm test_13, test_15 timeout Resolved
Related
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for liuying <emoly.liu@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/8cd9b77a-efbe-11e4-96a8-5254006e85c2.

The sub-test test_15 failed with the following error:

test failed to respond and timed out

Please provide additional information about the failure here.
Client 2 (shadow-20vm9) test_log:

rebind list of files
CMD: shadow-20vm6 lhsmtool_posix --archive 2 --hsm-root /home/autotest2/.autotest/shared_dir/2015-04-30/193557-69851878931860/arc1		 --rebind /home/autotest2/.autotest/shared_dir/2015-04-30/193557-69851878931860/tmp.20081 /mnt/lustre
shadow-20vm6: 1430444492.342460 lhsmtool_posix[22365]: action=2 src=/home/autotest2/.autotest/shared_dir/2015-04-30/193557-69851878931860/tmp.20081 dst=(null) mount_point=/mnt/lustre
shadow-20vm6: 1430444493.717411 lhsmtool_posix[22365]: rebind [0x400000401:0x13:0x0] to [0x400000400:0x35:0x0]

This issued has occurred since April 27.



 Comments   
Comment by Gerrit Updater [ 04/May/15 ]

John L. Hammond (john.hammond@intel.com) uploaded a new patch: http://review.whamcloud.com/14659
Subject: LU-6559 test: use local tmp for HSM archive
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 7146134576d753aa55d1c1d9b07639766fd0d681

Comment by John Hammond [ 05/May/15 ]

In the last two weeks sanity-hsm runs on shadow went from taking 1 hour to 4. I suspect that this and the timeouts are due to NFS performance degradation on shadow.

https://testing.hpdd.intel.com/test_sessions/e305acd0-f2b6-11e4-898d-5254006e85c2 would seem to confirm this.

Comment by Gerrit Updater [ 07/May/15 ]

Andreas Dilger (andreas.dilger@intel.com) merged in patch http://review.whamcloud.com/14659/
Subject: LU-6559 test: use local tmp for HSM archive
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: bd07c02c775d9a15fb1cdd34d556c660311c725a

Comment by Andreas Dilger [ 07/May/15 ]

Patch landed to master for 2.8.0

Generated at Sat Feb 10 02:01:16 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.