Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4063

sanity-hsm test_12a failure: 'Restored file differs'

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Minor Minor
    • Lustre 2.6.0, Lustre 2.5.2
    • Lustre 2.5.0
    • Luster master build # 1715
      OpenSFS cluster with combined MGS/MDS, single OSS with two OSTs, three clients; one agent + client, one with robinhood/db running + client and one just running as Lustre clients
    • 3
    • 10887

      The test results are at: https://maloo.whamcloud.com/test_sets/8e9cca2c-2c8b-11e3-85ee-52540035b04c

      From the client test_log:

      == sanity-hsm test 12a: Restore an imported file explicitly == 14:02:01 (1380834121)
      pdsh@c15: c13: ssh exited with exit code 1
      Purging archive on c13
      Starting copytool agt1 on c13
      c13: lhsmtool_posix[5634]: action=1 src=d0.sanity-hsm/d12/f.sanity-hsm.12a dst=/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a mount_point=/lustre/scratch
      c13: lhsmtool_posix[5634]: importing '/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a' from '/lustre/archive/d0.sanity-hsm/d12/f.sanity-hsm.12a'
      c13: lhsmtool_posix[5634]: imported '/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a' from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0'=='/lustre/archive/d0.sanity-hsm/d12/f.sanity-hsm.12a'
      c13: lhsmtool_posix[5634]: process finished, errs: 0 major, 0 minor, rc=0 (Success)
      Verifying released state: 
      Verifying file state: 
      c13: diff: /lustre/scratch2/d0.sanity-hsm/d12/f.sanity-hsm.12a: No such file or directory
      pdsh@c15: c13: ssh exited with exit code 2
       sanity-hsm test_12a: @@@@@@ FAIL: Restored file differs 
        Trace dump:
        = /usr/lib64/lustre/tests/test-framework.sh:4264:error_noexit()
        = /usr/lib64/lustre/tests/test-framework.sh:4291:error()
        = /usr/lib64/lustre/tests/sanity-hsm.sh:853:test_12a()
      

      From the copytool log, it looks like the copy tool is having problems finding the file:

      lhsmtool_posix[5564]: action=0 src=(null) dst=(null) mount_point=/lustre/scratch
      lhsmtool_posix[5565]: waiting for message from kernel
      lhsmtool_posix[5565]: copytool fs=scratch archive#=2 item_count=1
      lhsmtool_posix[5565]: waiting for message from kernel
      lhsmtool_posix[5635]: '[0x200000402:0x2:0x0]' action RESTORE reclen 72, cookie=0x524dd1eb
      lhsmtool_posix[5635]: processing file 'd0.sanity-hsm/d12/f.sanity-hsm.12a'
      lhsmtool_posix[5635]: reading stripe rules from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0.lov' for '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0'
      lhsmtool_posix[5635]: cannot open '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0.lov': No such file or directory (2)
      lhsmtool_posix[5635]: cannot get stripe rules for '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' (No data available), use default
      lhsmtool_posix[5635]: restoring data from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]'
      lhsmtool_posix[5635]: going to copy data from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]'
      lhsmtool_posix[5635]: Going to copy 363 bytes /lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0 -> {VOLATILE}=[0x200000402:0x3:0x0]
      
      lhsmtool_posix[5635]: data restore from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]' done
      lhsmtool_posix[5635]: Action completed, notifying coordinator cookie=0x524dd1eb, FID=[0x200000402:0x2:0x0], hp_flags=0 err=0
      lhsmtool_posix[5635]: llapi_hsm_action_end() on '/lustre/scratch/.lustre/fid/0x200000402:0x2:0x0' ok (rc=0)
      exiting: Interrupt
      

      Looking at all the tests in sanity-hsm, very few tests use $DIR2 to access files on the file system:

      	local f=$DIR/$tdir/$tfile
      	import_file $tdir/$tfile $f
      	local f=$DIR2/$tdir/$tfile
      

      Commenting out the last line above allows the test to complete successfully, but this may defeat what is being tested.

            jay Jinshan Xiong (Inactive)
            jamesanunez James Nunez (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

              Created:
              Updated:
              Resolved: