Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-3006

failover nids added after format don't propogate to clients

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • Lustre 2.4.0
    • Lustre 2.4.0
    • Lustre 2.3.61 server. The lack of failover nids in the osc imports field also happens on 1.8 clients so it is a server side issue.
    • 3
    • 7319

    Description

      While preparing for our test shot after we formatted the new file system we remembered to add the failover nids for IR testing. So we umounted the file system and used tunefs.lustre to add the failover nids. Then proceeded remount the file system in order of MGS, MDT, then serially the OSTs. After mounting a client to validate that the proper failover nids toke hold we discovered that in the osc imports failover nids were never updated. So with a second try we ran tunefs.lustre to remove the failover nids. Then started and stopped the file system. Added the failover nids with tunefs.lustre and then restarted the file system the second time. Again lctl get_param osc-*..../import | grep failover_nids did not show the failover nids getting updated.

      Attachments

        Issue Links

          Activity

            [LU-3006] failover nids added after format don't propogate to clients

            landed

            bzzz Alex Zhuravlev added a comment - landed

            not a final one, but the idea was to search for another place in the superblock to store the flag. working on this...

            bzzz Alex Zhuravlev added a comment - not a final one, but the idea was to search for another place in the superblock to store the flag. working on this...

            Alex, did we come up with a solution to this problem? This is one of the more noticeable regressions in 2.4.0 for end users.

            adilger Andreas Dilger added a comment - Alex, did we come up with a solution to this problem? This is one of the more noticeable regressions in 2.4.0 for end users.

            Alex, I'm not a big fan of abusing the label for an increasing number of different flag meanings. The main problem is that this permanently changes the target name that is visible to applications, and there are times when it is not reset. For example, this would prevent the filesystem from being mounted by label after a reboot.

            Is there no other field we can use to pass information to the MGS?

            adilger Andreas Dilger added a comment - Alex, I'm not a big fan of abusing the label for an increasing number of different flag meanings. The main problem is that this permanently changes the target name that is visible to applications, and there are times when it is not reset. For example, this would prevent the filesystem from being mounted by label after a reboot. Is there no other field we can use to pass information to the MGS?
            bzzz Alex Zhuravlev added a comment - http://review.whamcloud.com/5982

            As long as it preserves previous behavior it sounds reasonable. Looking forward to a patch for this.

            simmonsja James A Simmons added a comment - As long as it preserves previous behavior it sounds reasonable. Looking forward to a patch for this.

            I'm tempted to introduce another delimiter in the label to store UPDATE.. should be enough and trivial to implement. any thoughts?

            bzzz Alex Zhuravlev added a comment - I'm tempted to introduce another delimiter in the label to store UPDATE.. should be enough and trivial to implement. any thoughts?

            People

              bzzz Alex Zhuravlev
              simmonsja James A Simmons
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: