Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7340

ChangeLogs catalog full condition should be handled more gracefully

Details

    • Improvement
    • Resolution: Fixed
    • Critical
    • Lustre 2.11.0, Lustre 2.10.3
    • Lustre 2.8.0, Lustre 2.5.4
    • None
    • 9223372036854775807

    Description

      Presently when a LLOG Catalog wraps and its latest assigned index collides with its oldest and still in use index, ENOSPC is returned and the caller just ignore the fact that LLOG record could not be written.

      For ChangeLogs specific usage, some actions could be attempted to recover space/records, some ideas have already been detailed in LU-6556, but it seems better to address them in this separate ticket.

      Input from Andreas :
      I think the other thing that is needed here is to automatically unregister ChangeLog watcher(s) if the changelog is full or the MDS runs out of space (by default), or block all MDS operations until the ChangeLog can be written (if /proc tunable is set to make ChangeLog updates mandatory). It should unregister starting with the oldest watcher on the assumption that the older watcher was forgotten and newer ones are still running, and that this will release the most space. The unregistration should cancel records up to the next watcher, or all remaining records if no other watchers are left.

      Input from Robert :
      I suggest going a step further and proactively remove stale watchers after a configurable period or when hitting a max watermark to try o avoid running out of space. Also, being unregistered is a reasonable notification to the application that they've lost their changelog feed and need to resync.

      Attachments

        Issue Links

          Activity

            [LU-7340] ChangeLogs catalog full condition should be handled more gracefully
            chunteraa Chris Hunter (Inactive) made changes -
            Link New: This issue is related to DDN-2154 [ DDN-2154 ]
            adilger Andreas Dilger made changes -
            Link New: This issue is related to LU-12871 [ LU-12871 ]
            chunteraa Chris Hunter (Inactive) made changes -
            Link New: This issue is related to DDN-881 [ DDN-881 ]
            adilger Andreas Dilger made changes -
            Link New: This issue is related to LU-10680 [ LU-10680 ]
            chunteraa Chris Hunter (Inactive) made changes -
            Link New: This issue is related to DDN-499 [ DDN-499 ]
            mdiep Minh Diep made changes -
            Labels Original: LTS
            bfaccini Bruno Faccini (Inactive) made changes -
            Link New: This issue is related to SEA-483 [ SEA-483 ]

            John L. Hammond (john.hammond@intel.com) merged in patch https://review.whamcloud.com/30573/
            Subject: LU-7340 mdd: changelogs garbage collection
            Project: fs/lustre-release
            Branch: b2_10
            Current Patch Set:
            Commit: b87511a0578a03447c51a8495966d60c90fcee61

            gerrit Gerrit Updater added a comment - John L. Hammond (john.hammond@intel.com) merged in patch https://review.whamcloud.com/30573/ Subject: LU-7340 mdd: changelogs garbage collection Project: fs/lustre-release Branch: b2_10 Current Patch Set: Commit: b87511a0578a03447c51a8495966d60c90fcee61
            mdiep Minh Diep made changes -
            Fix Version/s New: Lustre 2.10.3 [ 13591 ]

            Minh Diep (minh.diep@intel.com) uploaded a new patch: https://review.whamcloud.com/30573
            Subject: LU-7340 mdd: changelogs garbage collection
            Project: fs/lustre-release
            Branch: b2_10
            Current Patch Set: 1
            Commit: 12c8163c17149d2afed78e2ba84da624dd920b34

            gerrit Gerrit Updater added a comment - Minh Diep (minh.diep@intel.com) uploaded a new patch: https://review.whamcloud.com/30573 Subject: LU-7340 mdd: changelogs garbage collection Project: fs/lustre-release Branch: b2_10 Current Patch Set: 1 Commit: 12c8163c17149d2afed78e2ba84da624dd920b34

            People

              bfaccini Bruno Faccini (Inactive)
              bfaccini Bruno Faccini (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: