Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4640

Last unlink should trigger HSM remove by default

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.10.0
    • Lustre 2.5.0
    • 3
    • 12691

    Description

      In current code, when an archived file is removed from the file system, no HSM request is triggered. We rely on Changelogs and RobinHood reading them to have it sending corresponding hsm_remove requests to clean those orphans in HSM backend. This is done in purpose, to provide a way to implement "soft unlink". RobinHood will remove the file in the backend after a grace time. During this time, Admins could restore the file from the HSM if they want to (using import feature).

      Requiring a RobinHood setup to handle this cleaning is a too big limitation. We should consider modifying this behaviour.

      By default, Lustre should automatically add a HSM_REMOVE request for any last unlink. This way, no file will be leaked in the archive.
      A tunable should be added to disable this behaviour (should we add this to hsm_policy?) and go back to a mode where an external component is responsible for tracking UNLINK changelogs and add hsm_remove requests when needed (Robinhood) (current behaviour).

      Attachments

        Issue Links

          Activity

            [LU-4640] Last unlink should trigger HSM remove by default
            mdiep Minh Diep made changes -
            Link Original: This issue is related to JFC-19 [ JFC-19 ]
            mdiep Minh Diep made changes -
            Labels Original: HSM New: HSM patch
            lixi Li Xi (Inactive) made changes -
            Link New: This issue is related to LU-9654 [ LU-9654 ]
            pjones Peter Jones made changes -
            Resolution New: Fixed [ 1 ]
            Status Original: Reopened [ 4 ] New: Resolved [ 5 ]
            pjones Peter Jones added a comment -

            Landed for 2.10

            pjones Peter Jones added a comment - Landed for 2.10
            pjones Peter Jones made changes -
            Fix Version/s New: Lustre 2.10.0 [ 12204 ]
            Fix Version/s Original: Lustre 2.11.0 [ 13091 ]

            Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/18946/
            Subject: LU-4640 mdt: implement Remove Archive on Last Unlink policy
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: dd4b034540d7dda499ebbb8c465d3435ad46b82a

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/18946/ Subject: LU-4640 mdt: implement Remove Archive on Last Unlink policy Project: fs/lustre-release Branch: master Current Patch Set: Commit: dd4b034540d7dda499ebbb8c465d3435ad46b82a

            The patch #26980 was on the top of version 6 of path #18946. And version 8 of patch #18946 looks better than #26980 now.

            lixi Li Xi (Inactive) added a comment - The patch #26980 was on the top of version 6 of path #18946. And version 8 of patch #18946 looks better than #26980 now.

            I could have pushed a new patch using the the same change ID of #18946. But I am still not sure whether my thought is correct or not

            lixi Li Xi (Inactive) added a comment - I could have pushed a new patch using the the same change ID of #18946. But I am still not sure whether my thought is correct or not

            Hi Bruno,

            Thanks for helping. Yeah, I create a new patch based on your latest patch.

            I first improved the test 26c and 26d. And after that, the test results immediately shows that the mdt_attr_get_complex returns -2 because the object is deleted. That means, the MA_HSM and MA_INODE needs to be read before unlink happens. That is why I split mdt_handle_last_unlink to mdt_handle_last_unlink_prepare and mdt_handle_last_unlink_commit.

            Let's see what is the test result thi time.

            lixi Li Xi (Inactive) added a comment - Hi Bruno, Thanks for helping. Yeah, I create a new patch based on your latest patch. I first improved the test 26c and 26d. And after that, the test results immediately shows that the mdt_attr_get_complex returns -2 because the object is deleted. That means, the MA_HSM and MA_INODE needs to be read before unlink happens. That is why I split mdt_handle_last_unlink to mdt_handle_last_unlink_prepare and mdt_handle_last_unlink_commit. Let's see what is the test result thi time.

            People

              bfaccini Bruno Faccini (Inactive)
              adegremont Aurelien Degremont (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: