Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-16159

remove update llog files after recovery abort

Details

    • 2
    • 9223372036854775807

    Description

      Once recovery is aborted, the existing update logs should be removed, because they are used for recovery only, and if it's corrupt, or inaccessible, if they are kept there after recovery abort, the next recovery will meet with the same issue again, besides, the log file size may become large, retrieving them in recovery may lead to recovery timeout.

      Besides this, in LOD device initialization, it should do some sanity check for all update logs on it, if the log file is not accessible (e.g. OI mapping mismatch, which may lead to recovery timeout too), delete the log file FID from update catalog (do not delete this file at the moment because it's inaccessible) so that the log file is not visible to others.

      Attachments

        Issue Links

          Activity

            [LU-16159] remove update llog files after recovery abort
            pjones Peter Jones added a comment -

            Patches already landed are what will be in 2.16.

            pjones Peter Jones added a comment - Patches already landed are what will be in 2.16.

            "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51224
            Subject: LU-16159 osp: destroy should not overtake writes
            Project: fs/lustre-release
            Branch: b2_15
            Current Patch Set: 1
            Commit: acfee963c6999785afdd3b25c83015695099a3b9

            gerrit Gerrit Updater added a comment - "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51224 Subject: LU-16159 osp: destroy should not overtake writes Project: fs/lustre-release Branch: b2_15 Current Patch Set: 1 Commit: acfee963c6999785afdd3b25c83015695099a3b9

            "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51223
            Subject: LU-16159 lod: cancel update llogs upon recovery abort
            Project: fs/lustre-release
            Branch: b2_15
            Current Patch Set: 1
            Commit: a544f6c69fe4b2eb81f4b05581654325ecc96f93

            gerrit Gerrit Updater added a comment - "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/51223 Subject: LU-16159 lod: cancel update llogs upon recovery abort Project: fs/lustre-release Branch: b2_15 Current Patch Set: 1 Commit: a544f6c69fe4b2eb81f4b05581654325ecc96f93

            Is there anything left for this ticket, or can it be resolved?

            adilger Andreas Dilger added a comment - Is there anything left for this ticket, or can it be resolved?

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/49787/
            Subject: LU-16159 osp: destroy should not overtake writes
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 5a5bd5b4dafaf252c641b8afd2cd809de7384f4f

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/49787/ Subject: LU-16159 osp: destroy should not overtake writes Project: fs/lustre-release Branch: master Current Patch Set: Commit: 5a5bd5b4dafaf252c641b8afd2cd809de7384f4f
            hxing Xing Huang added a comment -

            "Alex Zhuravlev <bzzz@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49787
            Subject: LU-16159 osp: destroy should not overtake writes
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 0e0993837d0e13bd31d53daea04e7259be6c1c4c

            hxing Xing Huang added a comment - "Alex Zhuravlev <bzzz@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49787 Subject: LU-16159 osp: destroy should not overtake writes Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 0e0993837d0e13bd31d53daea04e7259be6c1c4c

            "Andreas Dilger <adilger@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49633
            Subject: LU-16159 tests: cleanup replay-single code style
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: cb4e810ddfb11aab30ea6ad6b40ff371c04ddac1

            gerrit Gerrit Updater added a comment - "Andreas Dilger <adilger@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49633 Subject: LU-16159 tests: cleanup replay-single code style Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: cb4e810ddfb11aab30ea6ad6b40ff371c04ddac1

            Reopening this so that it is being tracked for 2.16 due to latest patch.

            adilger Andreas Dilger added a comment - Reopening this so that it is being tracked for 2.16 due to latest patch.

            "Lai Siyao <lai.siyao@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49526
            Subject: LU-16159 target: race in update log cancel
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 983a53137bfd26e6baaba4575a9dbb379e358b76

            gerrit Gerrit Updater added a comment - "Lai Siyao <lai.siyao@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49526 Subject: LU-16159 target: race in update log cancel Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 983a53137bfd26e6baaba4575a9dbb379e358b76

            But won't that defeat the whole purpose of the fix? Could something else be done, like forcing rollover to new logs and then cancelling the old logs, so that in-use logs are not removed?

            adilger Andreas Dilger added a comment - But won't that defeat the whole purpose of the fix? Could something else be done, like forcing rollover to new logs and then cancelling the old logs, so that in-use logs are not removed?

            People

              laisiyao Lai Siyao
              laisiyao Lai Siyao
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: