[LU-4064] sanity-hsm test_13 failure: 'not enough identical files (0 != 100)' Created: 04/Oct/13  Updated: 19/Dec/14  Resolved: 28/Oct/13

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.5.0
Fix Version/s: Lustre 2.6.0, Lustre 2.5.1

Type: Bug Priority: Minor
Reporter: James Nunez (Inactive) Assignee: John Hammond
Resolution: Fixed Votes: 0
Labels: HSM
Environment:

Luster master build # 1715
OpenSFS cluster with combined MGS/MDS, single OSS with two OSTs, three clients; one agent + client, one with robinhood/db running + client and one just running as Lustre clients


Severity: 3
Rank (Obsolete): 10888

 Description   

The test results are at: https://maloo.whamcloud.com/test_sets/8e9cca2c-2c8b-11e3-85ee-52540035b04c

From the client test_log:

== sanity-hsm test 13: Recursively import and restore a directory == 14:02:44 (1380834164)
pdsh@c15: c13: ssh exited with exit code 1
Purging archive on c13
Starting copytool agt1 on c13
/usr/lib64/lustre/tests/sanity-hsm.sh: line 1084: /lustre/archive/import.orig/dir.1/f.sanity-hsm.13.1: No such file or directory
/usr/lib64/lustre/tests/sanity-hsm.sh: line 1084: /lustre/archive/import.orig/dir.1/f.sanity-hsm.13.2: No such file or directory
/usr/lib64/lustre/tests/sanity-hsm.sh: line 1084: /lustre/archive/import.orig/dir.1/f.sanity-hsm.13.3: No such file or directory
/usr/lib64/lustre/tests/sanity-hsm.sh: line 1084: /lustre/archive/import.orig/dir.1/f.sanity-hsm.13.4: No such file or directory
/usr/lib64/lustre/tests/sanity-hsm.sh: line 1084: /lustre/archive/import.orig/dir.1/f.sanity-hsm.13.5: No such file or directory
...
/usr/lib64/lustre/tests/sanity-hsm.sh: line 1084: /lustre/archive/import.orig/dir.10/f.sanity-hsm.13.8: No such file or directory
/usr/lib64/lustre/tests/sanity-hsm.sh: line 1084: /lustre/archive/import.orig/dir.10/f.sanity-hsm.13.9: No such file or directory
/usr/lib64/lustre/tests/sanity-hsm.sh: line 1084: /lustre/archive/import.orig/dir.10/f.sanity-hsm.13.10: No such file or directory
c13: lhsmtool_posix[8051]: action=1 src=import.orig dst=/lustre/scratch/d0.sanity-hsm/d13 mount_point=/lustre/scratch
c13: lhsmtool_posix[8051]: process finished, errs: 0 major, 0 minor, rc=0 (Success)
c13: diff: /lustre/scratch/d0.sanity-hsm/d13/import.orig: No such file or directory
pdsh@c15: c13: ssh exited with exit code 2
c13: diff: /lustre/scratch/d0.sanity-hsm/d13/import.orig: No such file or directory
pdsh@c15: c13: ssh exited with exit code 2
 sanity-hsm test_13: @@@@@@ FAIL: not enough identical files (0 != 100) 
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:4264:error_noexit()
  = /usr/lib64/lustre/tests/test-framework.sh:4291:error()
  = /usr/lib64/lustre/tests/sanity-hsm.sh:1101:test_13()

Nothing interesting in the copytool log.



 Comments   
Comment by John Hammond [ 04/Oct/13 ]

The issue is that

	                do_facet $SINGLEAGT \
	                        echo "d=$d, f=$f, dir=$CURR_DIR, "\
                                     "file=$CURR_FILE" > $CURR_FILE

writes to $CURR_FILE on the local node, not on the agent.

Comment by John Hammond [ 08/Oct/13 ]

Please see http://review.whamcloud.com/7885.

Comment by John Hammond [ 28/Oct/13 ]

Patch landed to master.

Comment by Gerrit Updater [ 19/Dec/14 ]

Bobi Jam (bobijam@gmail.com) uploaded a new patch: http://review.whamcloud.com/13137
Subject: LU-4064 clio: update comments after cl_lock simplification
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: f68368ef573fa7cdcc3e36352a3df29d14eb7a32

Generated at Sat Feb 10 01:39:20 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.