Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-5231

HSM: incorrect progress reported by lfs hsm_action

Details

    • Bug
    • Resolution: Unresolved
    • Minor
    • None
    • Lustre 2.6.0
    • Centos 6.5
      Lustre 2.5.56
    • 3
    • 14572

    Description

      While archiving a 5GiB file on a client:

      # ls -l bigf
      -rw-r--r--  1 root root 5129887744 Jun 18 16:48 bigf
      # lfs hsm_archive bigf
      

      I repeatedly checked the progress with lfs hsm_action:

      bigf: ARCHIVE running (0xa00000 bytes moved)
      ...
      bigf: ARCHIVE running (0x3200000 bytes moved)
      ...
      bigf: ARCHIVE running (0x6400000 bytes moved)
      ...
      ...
      ...
      bigf: ARCHIVE running (0x1a0e500000 bytes moved)
      ...
      bigf: NOOP
      

      The last result should have been close to 5 GiB, instead it was 0x1a0e500000 = 111909273600 = 104GiB

      It appears the (correct) lengths sent by the copytool through llapi_hsm_action_progress() are being added somewhere in the kernel modules.

      Also, is there a reason for the reported size to be written in hex? It's not convenient.

      Attachments

        Activity

          [LU-5231] HSM: incorrect progress reported by lfs hsm_action
          pjones Peter Jones added a comment -

          My overzealousness I'm afraid - sorry. Reopening

          pjones Peter Jones added a comment - My overzealousness I'm afraid - sorry. Reopening
          spitzcor Cory Spitz added a comment -

          I'm not sure why this ticket was marked resolved when http://review.whamcloud.com/12020 is still pending.

          spitzcor Cory Spitz added a comment - I'm not sure why this ticket was marked resolved when http://review.whamcloud.com/12020 is still pending.
          pjones Peter Jones added a comment -

          Landed for 2.8

          pjones Peter Jones added a comment - Landed for 2.8

          Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/13678/
          Subject: LU-5231 hsm: display file size in decimal not hex
          Project: fs/lustre-release
          Branch: master
          Current Patch Set:
          Commit: 9e40a86a875475f3fe1991a48694ba5601033ebe

          gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/13678/ Subject: LU-5231 hsm: display file size in decimal not hex Project: fs/lustre-release Branch: master Current Patch Set: Commit: 9e40a86a875475f3fe1991a48694ba5601033ebe

          frank zago (fzago@cray.com) uploaded a new patch: http://review.whamcloud.com/13678
          Subject: LU-5231 hsm: display file size in decimal not hex
          Project: fs/lustre-release
          Branch: master
          Current Patch Set: 1
          Commit: 1f8c980d4cd2328936e9dd484909c62a4a5fd8cc

          gerrit Gerrit Updater added a comment - frank zago (fzago@cray.com) uploaded a new patch: http://review.whamcloud.com/13678 Subject: LU-5231 hsm: display file size in decimal not hex Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 1f8c980d4cd2328936e9dd484909c62a4a5fd8cc

          Please observe the comments and kernel logs below.
          Looking at these logs, I strongly feel that last extent alone will give proper amount of data moved.
          Logs are taken in steps. Please observe the data->done_size in each step (numbered as 1, 2, 3....).
          The values are repeating in each step which means whenever the ioctl is performed, the extents are
          getting stored in tree. I believe we should not add up all these extents and the last entry in the
          tree (last extent) will alone give the total amount of data moved.

          vinayak_clogeny Vinayak Hariharmath (Inactive) added a comment - Please observe the comments and kernel logs below. Looking at these logs, I strongly feel that last extent alone will give proper amount of data moved. Logs are taken in steps. Please observe the data->done_size in each step (numbered as 1, 2, 3....). The values are repeating in each step which means whenever the ioctl is performed, the extents are getting stored in tree. I believe we should not add up all these extents and the last entry in the tree (last extent) will alone give the total amount of data moved.

          Hi Peter,

          Pushed the changes to master branch.
          http://review.whamcloud.com/12020

          Thanks,
          Vinayak

          vinayak_clogeny Vinayak Hariharmath (Inactive) added a comment - Hi Peter, Pushed the changes to master branch. http://review.whamcloud.com/12020 Thanks, Vinayak
          pjones Peter Jones added a comment -

          Vinayak

          Could you please push your proposed patch against master?

          Thanks

          Peter

          pjones Peter Jones added a comment - Vinayak Could you please push your proposed patch against master? Thanks Peter

          Posted changes for fixing this issue on b2_6

          http://review.whamcloud.com/12006

          vinayak_clogeny Vinayak Hariharmath (Inactive) added a comment - Posted changes for fixing this issue on b2_6 http://review.whamcloud.com/12006

          copytool is standard posix copytool.

          fzago Frank Zago (Inactive) added a comment - copytool is standard posix copytool.

          People

            wc-triage WC Triage
            fzago Frank Zago (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            12 Start watching this issue

            Dates

              Created:
              Updated: