Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.4.0
    • Lustre 2.4.0
    • None
    • 3
    • 5779

    Description

      Once the "layout swap" support in LU-2017 (http://review.whamcloud.com/4507) is landed, the client will have an API to atomically swap the layout between two files.

      In order for this functionality to be useful for users, the llapi_layout_swap() function needs to be wrapped in some additional code compared to lfs swap_layouts to ensure that the file isn't changing during the copy, and to do the actual copy.

      I propose an lfs migrate command that takes the same arguments as lfs setstripe (calling into lfs_setstripe() to create the target file, for simplicity and compatibility, though with some option to make it an open-unlinked file via LU-2441), gets an exclusive lock on the file (group lock or client-side layout lock), and then copies the file contents from the source to the target. By reading the whole source file, this will natually cause any clients with unwritten data to flush their caches when their write-extent locks are cancelled. Once the copy is completed (optionally verifying the data checksum after flushing the client cache?) the llapi_layout_swap() function is called, and the "new" open-unlinked file descriptor layout (which now points to the old objects) is closed, causing those objects to be destroyed. The exclusive lock can be dropped.

      If the MDS does not support MDS_SWAP_LAYOUTS then lfs migrate should return an -EOPNOTSUPP error, so that users are aware that atomic layout swap is not available.

      The lfs_migrate script should call the lfs migrate command to do the migrate/copy (instead of rsync + mv). but lfs_migrate probably needs to fall back to rsync+mv again. The lfs_migrate script has not previously guaranteed atomic migration, so it should continue to work using rsync+mv as it has in the past if "lfs migrate" returns EOPNOTSUPP, with a comment to the effect that this functionality should be removed after Lustre 2.10 or so.

      Attachments

        Issue Links

          Activity

            [LU-2445] add "lfs migrate" support

            JC, since the current behaviour of setstripe on an existing file is to return an error, it isn't clear to me if changing this to do internal migration and data copying would be "obvious" to users or not. It does have a certain appeal, however, so I think it should be discussed more.

            Could you please send and email (hpdd-discuss and lustre-discuss) to ask for done input on the user interface. I was thinking it makes more sense to require an explicit call to migrate the file data, since this may take a long time.

            If the setstripe approach is taken, it should definitely only migrate the file if a different layout is explicitly given.

            adilger Andreas Dilger added a comment - JC, since the current behaviour of setstripe on an existing file is to return an error, it isn't clear to me if changing this to do internal migration and data copying would be "obvious" to users or not. It does have a certain appeal, however, so I think it should be discussed more. Could you please send and email (hpdd-discuss and lustre-discuss) to ask for done input on the user interface. I was thinking it makes more sense to require an explicit call to migrate the file data, since this may take a long time. If the setstripe approach is taken, it should definitely only migrate the file if a different layout is explicitly given.

            OpenEx + dataversion is perfect I will implement this soon

            jcl jacques-charles lafoucriere added a comment - OpenEx + dataversion is perfect I will implement this soon

            Here is a flow chart for migration I proposed in Beijing. Please take a look.

            jay Jinshan Xiong (Inactive) added a comment - Here is a flow chart for migration I proposed in Beijing. Please take a look.

            I will initially implement it in lfs_setstripe() directly so any re-stripe call will do the migration. I think I have to take a group lock for force flush from others and get exclusive access during copy.

            jcl jacques-charles lafoucriere added a comment - I will initially implement it in lfs_setstripe() directly so any re-stripe call will do the migration. I think I have to take a group lock for force flush from others and get exclusive access during copy.

            Please assign me this LU (or to jody) , I will work on a patch

            jcl jacques-charles lafoucriere added a comment - Please assign me this LU (or to jody) , I will work on a patch
            rread Robert Read added a comment -

            If the new objects have earlier version numbers than the old objects (because they are on new OSTs perhaps), then will this have a effect on HSM? Will HSM still archive the data if the version looks older than what is currently in the archive?

            rread Robert Read added a comment - If the new objects have earlier version numbers than the old objects (because they are on new OSTs perhaps), then will this have a effect on HSM? Will HSM still archive the data if the version looks older than what is currently in the archive?

            Johann, can you please file a bug with the details of what still needs to be implemented for the full layout lock support, or if one already exists please link it to this bug.

            adilger Andreas Dilger added a comment - Johann, can you please file a bug with the details of what still needs to be implemented for the full layout lock support, or if one already exists please link it to this bug.

            Andreas, i am afraid that the current layout lock implementation is only suitable for HSM and more work is required to support layout revocation of opened files (for which there can be read/write request in flight).

            johann Johann Lombardi (Inactive) added a comment - Andreas, i am afraid that the current layout lock implementation is only suitable for HSM and more work is required to support layout revocation of opened files (for which there can be read/write request in flight).

            People

              jay Jinshan Xiong (Inactive)
              adilger Andreas Dilger
              Votes:
              0 Vote for this issue
              Watchers:
              16 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: