Details

    • Improvement
    • Resolution: Unresolved
    • Minor
    • None
    • None
    • 16922

    Description

      Currently file migration is done by "lfs migrate". This has a couple problems:

      • It doesn't scale well to large files and/or large number of files to copy.
      • If an external tools (such as Robinhood) wants to migrate a file, it has to spawn lfs to do the job, for every file, which has a big overhead.

      A solution to this issue is to move the copy operation into the copytool. The copytool already knows how to copy files, and Lustre has control over these files. This imply to re-use and extend the infrastructure created for HSM, and would solve the two issues mentioned above:

      • Migrating a file, from an application point of view, becomes more or less the same as archiving a file.
      • Since the file copy is done by the copytool, this copy can be optimized in one place.

      Attachments

        Issue Links

          Activity

            [LU-6081] hsm: add file migrate support

            generic enough to allow both migrate, mirror, and resync operations

            The mirror op is addressed in LU-12890, which in turn is blocked on this one. So iiuc we should land them both?

            nrutman Nathan Rutman added a comment - generic enough to allow both migrate, mirror, and resync operations The mirror op is addressed in LU-12890 , which in turn is blocked on this one. So iiuc we should land them both?

            I can't find the specific comment you are referring to, but I don't think we need to implement all migration via mirroring right now. I think my suggestion is that if we are implementing an interface to send "data movement" commands to HSM agent nodes, then it needs to be generic enough to allow both migrate, mirror, and resync operations.

            adilger Andreas Dilger added a comment - I can't find the specific comment you are referring to, but I don't think we need to implement all migration via mirroring right now. I think my suggestion is that if we are implementing an interface to send "data movement" commands to HSM agent nodes, then it needs to be generic enough to allow both migrate, mirror, and resync operations.

            @Cory Spitz, if we were to follow the suggestion by Andreas, I believe we would have to change migrate to use FLR mirrors and work on landing LU-12890. I don't have a clear understanding of what this would involve yet, but Andreas mentioned in https://review.whamcloud.com/#/c/13243/29/lustre/utils/lhsmtool_posix.c@1392 that this should be done via "mirror extend" and "mirror split -d".

            @Andreas Dilger, do you think we should work on integrating migrate with FLR mirrors at this point, or could we try to land the remaining patch for this ticket (http://review.whamcloud.com/13243) and work on integrating FLR in a separate patch?

            nangelinas Nikitas Angelinas added a comment - @Cory Spitz, if we were to follow the suggestion by Andreas, I believe we would have to change migrate to use FLR mirrors and work on landing LU-12890 . I don't have a clear understanding of what this would involve yet, but Andreas mentioned in https://review.whamcloud.com/#/c/13243/29/lustre/utils/lhsmtool_posix.c@1392 that this should be done via "mirror extend" and "mirror split -d". @Andreas Dilger, do you think we should work on integrating migrate with FLR mirrors at this point, or could we try to land the remaining patch for this ticket ( http://review.whamcloud.com/13243 ) and work on integrating FLR in a separate patch?
            spitzcor Cory Spitz added a comment -

            nangelinas, do you have an opinion about how to wrap this up? We now have the patches for FLR mirroring as with LU-12890.

            spitzcor Cory Spitz added a comment - nangelinas , do you have an opinion about how to wrap this up? We now have the patches for FLR mirroring as with LU-12890 .

            No has been put on the back burner. Andreas recommend that instead of using HSM as the backend for migration that we use FLR mirroring instead. This is being discussed on the lustreclient slack channel.

            simmonsja James A Simmons added a comment - No has been put on the back burner. Andreas recommend that instead of using HSM as the backend for migration that we use FLR mirroring instead. This is being discussed on the lustreclient slack channel.

            Marked as "merged" - is this landed? Fix version?

            nrutman Nathan Rutman added a comment - Marked as "merged" - is this landed? Fix version?

            People

              bevans Ben Evans (Inactive)
              fzago Frank Zago (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              29 Start watching this issue

              Dates

                Created:
                Updated: