Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4063

sanity-hsm test_12a failure: 'Restored file differs'

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.6.0, Lustre 2.5.2
    • Lustre 2.5.0
    • Luster master build # 1715
      OpenSFS cluster with combined MGS/MDS, single OSS with two OSTs, three clients; one agent + client, one with robinhood/db running + client and one just running as Lustre clients
    • 3
    • 10887

    Description

      The test results are at: https://maloo.whamcloud.com/test_sets/8e9cca2c-2c8b-11e3-85ee-52540035b04c

      From the client test_log:

      == sanity-hsm test 12a: Restore an imported file explicitly == 14:02:01 (1380834121)
      pdsh@c15: c13: ssh exited with exit code 1
      Purging archive on c13
      Starting copytool agt1 on c13
      c13: lhsmtool_posix[5634]: action=1 src=d0.sanity-hsm/d12/f.sanity-hsm.12a dst=/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a mount_point=/lustre/scratch
      c13: lhsmtool_posix[5634]: importing '/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a' from '/lustre/archive/d0.sanity-hsm/d12/f.sanity-hsm.12a'
      c13: lhsmtool_posix[5634]: imported '/lustre/scratch/d0.sanity-hsm/d12/f.sanity-hsm.12a' from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0'=='/lustre/archive/d0.sanity-hsm/d12/f.sanity-hsm.12a'
      c13: lhsmtool_posix[5634]: process finished, errs: 0 major, 0 minor, rc=0 (Success)
      Verifying released state: 
      Verifying file state: 
      c13: diff: /lustre/scratch2/d0.sanity-hsm/d12/f.sanity-hsm.12a: No such file or directory
      pdsh@c15: c13: ssh exited with exit code 2
       sanity-hsm test_12a: @@@@@@ FAIL: Restored file differs 
        Trace dump:
        = /usr/lib64/lustre/tests/test-framework.sh:4264:error_noexit()
        = /usr/lib64/lustre/tests/test-framework.sh:4291:error()
        = /usr/lib64/lustre/tests/sanity-hsm.sh:853:test_12a()
      

      From the copytool log, it looks like the copy tool is having problems finding the file:

      lhsmtool_posix[5564]: action=0 src=(null) dst=(null) mount_point=/lustre/scratch
      lhsmtool_posix[5565]: waiting for message from kernel
      lhsmtool_posix[5565]: copytool fs=scratch archive#=2 item_count=1
      lhsmtool_posix[5565]: waiting for message from kernel
      lhsmtool_posix[5635]: '[0x200000402:0x2:0x0]' action RESTORE reclen 72, cookie=0x524dd1eb
      lhsmtool_posix[5635]: processing file 'd0.sanity-hsm/d12/f.sanity-hsm.12a'
      lhsmtool_posix[5635]: reading stripe rules from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0.lov' for '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0'
      lhsmtool_posix[5635]: cannot open '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0.lov': No such file or directory (2)
      lhsmtool_posix[5635]: cannot get stripe rules for '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' (No data available), use default
      lhsmtool_posix[5635]: restoring data from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]'
      lhsmtool_posix[5635]: going to copy data from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]'
      lhsmtool_posix[5635]: Going to copy 363 bytes /lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0 -> {VOLATILE}=[0x200000402:0x3:0x0]
      
      lhsmtool_posix[5635]: data restore from '/lustre/archive/0002/0000/0402/0000/0002/0000/0x200000402:0x2:0x0' to '{VOLATILE}=[0x200000402:0x3:0x0]' done
      lhsmtool_posix[5635]: Action completed, notifying coordinator cookie=0x524dd1eb, FID=[0x200000402:0x2:0x0], hp_flags=0 err=0
      lhsmtool_posix[5635]: llapi_hsm_action_end() on '/lustre/scratch/.lustre/fid/0x200000402:0x2:0x0' ok (rc=0)
      exiting: Interrupt
      

      Looking at all the tests in sanity-hsm, very few tests use $DIR2 to access files on the file system:

      	local f=$DIR/$tdir/$tfile
      	import_file $tdir/$tfile $f
      	local f=$DIR2/$tdir/$tfile
      

      Commenting out the last line above allows the test to complete successfully, but this may defeat what is being tested.

      Attachments

        Activity

          People

            jay Jinshan Xiong (Inactive)
            jamesanunez James Nunez (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: