Details
-
Improvement
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
None
-
9223372036854775807
Description
Currently, if a directory migration has been interrupted (e.g. MDS crash), nothing is done to resume the migration operation. This is precautionary, in case there was something with the migration operation itself that caused the problem.
However, to resume the migration operation, the exact same options for "lfs migrate -m" need to be specified by the user in order to resume/finish the migration. It would be much more convenient in this case if the migration could be resumed without having to specify the same options, and the MDS would just "know" these options if asked to migrate the directory again. That simplifies the user handling, and (AFAICS) does not add any risk since the migration request from the user will fail if the same options are not specified.
Attachments
Issue Links
- is related to
-
LU-14719 "lfs migrate -m" creates broken agent inodes when target MDT full
-
- Resolved
-
-
LU-14975 DNE3: directory migration in non-recursive mode
-
- Resolved
-
-
LU-14211 DNE3: mechanism to interrupt and resume migration
-
- Open
-
-
LU-11776 add "lfs find" support for directory hash flags
-
- Resolved
-
-
LU-15990 "lfs find" to scan for directory hash flags
-
- Resolved
-
- is related to
-
LU-13492 lfs migrate -m returns Operation not permitted
-
- Open
-
-
LU-14975 DNE3: directory migration in non-recursive mode
-
- Resolved
-
If letting the MDS continue the migration for that directory is easier, then that is fine with me. If it doesn't happen automatically, then there is less concern if there is a problem with the migration itself. I think in most cases of interrupted migration it is because the MDS rebooted for another reason, especially because recursive migrate of a large directory tree (possibly the whole filesystem) may take a very long time, and there is (AFAIK) no easy way to monitor if it is finished or how close it is to being finished.
I think the most important part is to make it easy to finish the migration (not require users to specify the options that the MDS already knows).
Separately, it probably makes sense for the migration to stop itself when it is close to doing something bad (eg. target MDT is almost full), and refuse to do "bad" migration requests (eg. migrate a whole directory tree to be striped), but those should be separate patches.