Details
-
Bug
-
Resolution: Fixed
-
Minor
-
Lustre 2.5.0
-
Luster master build # 1715
OpenSFS cluster with combined MGS/MDS, single OSS with two OSTs, three clients; one agent + client, one with robinhood/db running + client and one just running as Lustre clients
-
3
-
10887
Description
The test results are at: https://maloo.whamcloud.com/test_sets/8e9cca2c-2c8b-11e3-85ee-52540035b04c
From the client test_log:
== sanity-hsm test 12a: Restore an imported file explicitly == 14:02:01 (1380834121) pdsh@c15: c13: ssh exited with exit code 1 Purging archive on c13 Starting copytool agt1 on c13 c13: lhsmtool_posix[5634]: action=1 src=d0.sanity-hsm/d12/f.sanity-hsm.12a dst=/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a mount_point=/lustre/scratch c13: lhsmtool_posix[5634]: importing '/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a' from '/lustre/archive/d0.sanity-hsm/d12/f.sanity-hsm.12a' c13: lhsmtool_posix[5634]: imported '/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a' from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0'=='/lustre/archive/d0.sanity-hsm/d12/f.sanity-hsm.12a' c13: lhsmtool_posix[5634]: process finished, errs: 0 major, 0 minor, rc=0 (Success) Verifying released state: Verifying file state: c13: diff: /lustre/scratch2/d0.sanity-hsm/d12/f.sanity-hsm.12a: No such file or directory pdsh@c15: c13: ssh exited with exit code 2 sanity-hsm test_12a: @@@@@@ FAIL: Restored file differs Trace dump: = /usr/lib64/lustre/tests/test-framework.sh:4264:error_noexit() = /usr/lib64/lustre/tests/test-framework.sh:4291:error() = /usr/lib64/lustre/tests/sanity-hsm.sh:853:test_12a()
From the copytool log, it looks like the copy tool is having problems finding the file:
lhsmtool_posix[5564]: action=0 src=(null) dst=(null) mount_point=/lustre/scratch lhsmtool_posix[5565]: waiting for message from kernel lhsmtool_posix[5565]: copytool fs=scratch archive#=2 item_count=1 lhsmtool_posix[5565]: waiting for message from kernel lhsmtool_posix[5635]: '[0x200000402:0x2:0x0]' action RESTORE reclen 72, cookie=0x524dd1eb lhsmtool_posix[5635]: processing file 'd0.sanity-hsm/d12/f.sanity-hsm.12a' lhsmtool_posix[5635]: reading stripe rules from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0.lov' for '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' lhsmtool_posix[5635]: cannot open '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0.lov': No such file or directory (2) lhsmtool_posix[5635]: cannot get stripe rules for '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' (No data available), use default lhsmtool_posix[5635]: restoring data from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]' lhsmtool_posix[5635]: going to copy data from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]' lhsmtool_posix[5635]: Going to copy 363 bytes /lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0 -> {VOLATILE}=[0x200000402:0x3:0x0] lhsmtool_posix[5635]: data restore from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]' done lhsmtool_posix[5635]: Action completed, notifying coordinator cookie=0x524dd1eb, FID=[0x200000402:0x2:0x0], hp_flags=0 err=0 lhsmtool_posix[5635]: llapi_hsm_action_end() on '/lustre/scratch/.lustre/fid/0x200000402:0x2:0x0' ok (rc=0) exiting: Interrupt
Looking at all the tests in sanity-hsm, very few tests use $DIR2 to access files on the file system:
local f=$DIR/$tdir/$tfile import_file $tdir/$tfile $f local f=$DIR2/$tdir/$tfile
Commenting out the last line above allows the test to complete successfully, but this may defeat what is being tested.