Details

    • Technical task
    • Resolution: Fixed
    • Critical
    • Lustre 2.6.0
    • Lustre 2.6.0
    • None
    • 12614

    Description

      LFSCK needs to log with D_LFSCK all fixes that it makes and any inconsistencies that it finds that it does not repair (e.g. unknown LOV magic layouts). Otherwise it will be making secret changes to the filesystem and when there are problems they will be impossible to debug.

      Also, if LFSCK is fining inconsistencies in the filesystem it is important for administrators and developers to know this in case they are being caused by bugs in the Lustre code or due to hardware or other problems.

      I think logging inconsistencies (whether repaired or not) via D_LFSCK is the best way to do this. It would be useful to have a common format for such messages, so they could be parsed later to generate pathnames from the FIDs in the log messages. We should avoid pruning D_LFSCK messages that are not related to problems in the filesystem, except for limited start and stop messages. It might be necessary to quiet messages about adding individual FIDs to the OI file if the file was removed, since that will be too many messages on a large filesystem.

      The D_LFSCK flag should be added to the default flags that are enabled all the time, since this should not normally generate any log messages on a correct filesystem (except limited start/stop messages).

      Attachments

        Issue Links

          Activity

            [LU-4610] LFSCK needs to log all changes and errors found
            doug Doug Oucharek (Inactive) made changes -
            Resolution New: Fixed [ 1 ]
            Status Original: Reopened [ 4 ] New: Closed [ 6 ]
            adilger Andreas Dilger made changes -
            Resolution Original: Fixed [ 1 ]
            Status Original: Resolved [ 5 ] New: Reopened [ 4 ]

            Still one patch to land to turn the LFSCK debugging on by default.

            adilger Andreas Dilger added a comment - Still one patch to land to turn the LFSCK debugging on by default.

            Patch to turn on LFSCK debug messages in our test infrastructure is at: http://review.whamcloud.com/#/c/10814

            jamesanunez James Nunez (Inactive) added a comment - Patch to turn on LFSCK debug messages in our test infrastructure is at: http://review.whamcloud.com/#/c/10814
            pjones Peter Jones made changes -
            Resolution New: Fixed [ 1 ]
            Status Original: Open [ 1 ] New: Resolved [ 5 ]
            pjones Peter Jones added a comment -

            Landed for 2.6

            pjones Peter Jones added a comment - Landed for 2.6
            adilger Andreas Dilger made changes -
            Link New: This issue is related to LU-5202 [ LU-5202 ]

            I have enhanced the patch http://review.whamcloud.com/#/c/10579/ to add more D_LFSCK message for LFSCK check/repair, and enable D_LFSCK by default for tests.

            yong.fan nasf (Inactive) added a comment - I have enhanced the patch http://review.whamcloud.com/#/c/10579/ to add more D_LFSCK message for LFSCK check/repair, and enable D_LFSCK by default for tests.

            I think it also makes sense to enable D_LFSCK logging by default, once noisy messages are quieted (in patch 10579).

            adilger Andreas Dilger added a comment - I think it also makes sense to enable D_LFSCK logging by default, once noisy messages are quieted (in patch 10579).

            Since then, I prefer to split the work into two parts:

            1) The first part will add/remove/change the D_LFSCK debug log in master, and the users can dump the LFSCK log via "lctl debug_kernel". Such patch should be landed to Lustre-2.6

            2) The second part will implement some mechanism to start/stop/filter/collect the LFSCK logs automatically. Such patch will be landed to Lustre-2.7 or later.

            yong.fan nasf (Inactive) added a comment - Since then, I prefer to split the work into two parts: 1) The first part will add/remove/change the D_LFSCK debug log in master, and the users can dump the LFSCK log via "lctl debug_kernel". Such patch should be landed to Lustre-2.6 2) The second part will implement some mechanism to start/stop/filter/collect the LFSCK logs automatically. Such patch will be landed to Lustre-2.7 or later.

            People

              yong.fan nasf (Inactive)
              adilger Andreas Dilger
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: