Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-4876

LFSCK remove entry from /REMOTE_PARENT_DIR if MDT-object name reside on the same MDT after migration

Details

    • Bug
    • Resolution: Unresolved
    • Major
    • None
    • None
    • 3
    • 13487

    Description

      When migrate directory MDT-object_A's metadata from MDT_x to MDT_y, all its children name entries needs to be moved to MDT_y. If some name entry originally referenced a remote MDT-object_B which was linked under "/REMOTE_PARENT_DIR" with a dummy name entry on MDT_y, but after the migration, the MDT-object_B and its new name entry resides on the same MDT_y, then the dummy entry for the MDT-object_B should be removed from "REMOTE_PARENT_DIR/" iff there are no links remaining on that MDT.

      Attachments

        Issue Links

          Activity

            [LU-4876] LFSCK remove entry from /REMOTE_PARENT_DIR if MDT-object name reside on the same MDT after migration
            adilger Andreas Dilger made changes -
            Link New: This issue is related to LU-15383 [ LU-15383 ]
            adilger Andreas Dilger made changes -
            Link New: This issue is related to LU-15388 [ LU-15388 ]
            adilger Andreas Dilger made changes -
            Link New: This issue is related to LU-14168 [ LU-14168 ]

            I've also seen a few cases where e2fsck complains about hard-linked directories in REMOTE_PARENT_DIR:

            Entry '0x200007168:0x4af:0x0' in /REMOTE_PARENT_DIR (1119354881) is a link to directory /ROOT/.lustre/lost+found/MDT0000/[0x200005233:0x89:0x0]-O-0 (8938935).
            Clear? yes
            

            It looks like e2fsck fixes this by removing the entry from REMOTE_PARENT_DIR, but LFSCK shouldn't get into this situation in the first place.

            adilger Andreas Dilger added a comment - I've also seen a few cases where e2fsck complains about hard-linked directories in REMOTE_PARENT_DIR : Entry '0x200007168:0x4af:0x0' in /REMOTE_PARENT_DIR (1119354881) is a link to directory /ROOT/.lustre/lost+found/MDT0000/[0x200005233:0x89:0x0]-O-0 (8938935). Clear? yes It looks like e2fsck fixes this by removing the entry from REMOTE_PARENT_DIR, but LFSCK shouldn't get into this situation in the first place.
            chunteraa Chris Hunter (Inactive) made changes -
            Link New: This issue is related to DDN-1999 [ DDN-1999 ]

            Artem, it just looks that way because of the e2fsck messages:

            Pass 5: Checking group summary information
            Free blocks count wrong (32862, counted=32853).
            Free inodes count wrong (99721, counted=99718).
            

            which usually means that the filesystem is mounted, or at least was not unmounted cleanly.

            adilger Andreas Dilger added a comment - Artem, it just looks that way because of the e2fsck messages: Pass 5: Checking group summary information Free blocks count wrong (32862, counted=32853). Free inodes count wrong (99721, counted=99718). which usually means that the filesystem is mounted, or at least was not unmounted cleanly.

            adilger Thanks for idea. I believe the target was not mounted during fsck. I used CLEANUP=cleanupall option.
            laisiyao will try with patches from LU-4684. Thanks.

            artem_blagodarenko Artem Blagodarenko (Inactive) added a comment - adilger Thanks for idea. I believe the target was not mounted during fsck. I used CLEANUP=cleanupall option. laisiyao will try with patches from LU-4684 . Thanks.
            adilger Andreas Dilger made changes -
            Description Original: When migrate directory MDT-object_A's metadata from MDT_x to MDT_y, all its children name entries needs to be moved to MDT_y. If some name entry originally referenced a remote MDT-object_B which was linked under "/REMOTE_PARENT_DIR" with a dummy name entry on MDT_y, but after the migration, the MDT-object_B and its new name entry resides on the same MDT_y, then the dummy entry for the MDT-object_B should be removed from "/REMOTE_PARENT_DIR". New: When migrate directory MDT-object_A's metadata from MDT_x to MDT_y, all its children name entries needs to be moved to MDT_y. If some name entry originally referenced a remote MDT-object_B which was linked under "/REMOTE_PARENT_DIR" with a dummy name entry on MDT_y, but after the migration, the MDT-object_B and its new name entry resides on the same MDT_y, then the dummy entry for the MDT-object_B should be removed from "{{REMOTE_PARENT_DIR/}}" *iff* there are no links remaining on that MDT.

            Artem, I also noticed this, but it was fixed (at least for 2.12) by unmounting the MDT and running e2fsck on the unmounted device. While it was mounted, the change was only in the journal and did not show up in the filesystem itself.

            adilger Andreas Dilger added a comment - Artem, I also noticed this, but it was fixed (at least for 2.12) by unmounting the MDT and running e2fsck on the unmounted device. While it was mounted, the change was only in the journal and did not show up in the filesystem itself.
            adilger Andreas Dilger made changes -
            Link New: This issue is related to LU-10329 [ LU-10329 ]

            People

              laisiyao Lai Siyao
              yong.fan nasf (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated: