Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10360

use Imperative Recovery logs for client->MDT/OST connections

Details

    • New Feature
    • Resolution: Unresolved
    • Minor
    • None
    • None
    • None
    • 9223372036854775807

    Description

      The Imperative Recovery (IR) feature landed in LU-19 created a dynamic list of active server NIDs on the MGS for purposes of speeding up client recovery when a target failed over to another server node. A server failure triggered a notification from the MGS to the client to update its target NIDs to reconnect to the recovered server more quickly.

      It would be possible to extend this mechanism to also use the MGS IR log to do initial client mount, so that the MGS did not need to store the OST/MDT NIDs statically in the config log, but rather get the current NIDs directly from the dynamic MGS log. This would facilitate Lustre running in configurations where the server NIDs are not static (e.g. cloud, DHCP, etc). The initial connection to the MGS node(s) can already be done using the MGS hostname, since mount.lustre will do DNS name resolution.

      Some care would be needed when OSTs are being registered with the MGS, especially in testing environments where OSTs are reformatted regularly and often use the same fsname, since this may allow OSTs to register with the MGS that do not actually belong to the same filesystem.

      Attachments

        Issue Links

          Activity

            [LU-10360] use Imperative Recovery logs for client->MDT/OST connections

            "Mikhail Pershin <mpershin@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/58595
            Subject: LU-10360 mgc: client to mount without NIDs in config
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 5434b0582c2ffb46de2883e2fd882d89dc9f79bd

            gerrit Gerrit Updater added a comment - "Mikhail Pershin <mpershin@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/58595 Subject: LU-10360 mgc: client to mount without NIDs in config Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 5434b0582c2ffb46de2883e2fd882d89dc9f79bd

            It seems possible to transition systems to using dynamic NIDs by putting "lctl set_param mgc.*.dynamic_nids=1" as one of the first records in the config llog, so that clients will allow IR to determine where the MDTs and OSTs are located. After that is done, it would just be a matter of how long to create config llog records with the NIDs in them for backward compatibility. The original patch landed as v2_13_55-106-g37be05eca3, so it is in all 2.14.0 and later releases.

            adilger Andreas Dilger added a comment - It seems possible to transition systems to using dynamic NIDs by putting " lctl set_param mgc.*.dynamic_nids=1 " as one of the first records in the config llog, so that clients will allow IR to determine where the MDTs and OSTs are located. After that is done, it would just be a matter of how long to create config llog records with the NIDs in them for backward compatibility. The original patch landed as v2_13_55-106-g37be05eca3, so it is in all 2.14.0 and later releases.

            I think in conjunction with LU-10359 it should be possible to test a configuration that doesn't have server NIDs in the configuration at all, or totally incorrect NIDs in the config llog, to confirm that this is working properly.

            A conf-sanity test case should be added to confirm that this is working properly.

            adilger Andreas Dilger added a comment - I think in conjunction with LU-10359 it should be possible to test a configuration that doesn't have server NIDs in the configuration at all, or totally incorrect NIDs in the config llog, to confirm that this is working properly. A conf-sanity test case should be added to confirm that this is working properly.

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/50000/
            Subject: LU-10360 ldlm: remove client_import_find_conn()
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 14544bdca5cc42a3ea80fe665e332fe4c88b081a

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/50000/ Subject: LU-10360 ldlm: remove client_import_find_conn() Project: fs/lustre-release Branch: master Current Patch Set: Commit: 14544bdca5cc42a3ea80fe665e332fe4c88b081a

            "Neil Brown <neilb@suse.de>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/50000
            Subject: LU-10360 ldlm: remove client_import_find_conn()
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 12cbeaf1fb7bc83d7a842b71d6e8a33601e085ce

            gerrit Gerrit Updater added a comment - "Neil Brown <neilb@suse.de>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/50000 Subject: LU-10360 ldlm: remove client_import_find_conn() Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 12cbeaf1fb7bc83d7a842b71d6e8a33601e085ce

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/39911/
            Subject: LU-10360 tests: test dynamic NIDs feature
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 2553f2fc8630061a8b6dbc5504d3f5277cb1cecf

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/39911/ Subject: LU-10360 tests: test dynamic NIDs feature Project: fs/lustre-release Branch: master Current Patch Set: Commit: 2553f2fc8630061a8b6dbc5504d3f5277cb1cecf

            "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/45905
            Subject: LU-10360 mgc: Use IR for client->MDS/OST connections
            Project: fs/lustre-release
            Branch: b2_12
            Current Patch Set: 1
            Commit: b1c09656513f3198adf849182617e6eafef76954

            gerrit Gerrit Updater added a comment - "Etienne AUJAMES <eaujames@ddn.com>" uploaded a new patch: https://review.whamcloud.com/45905 Subject: LU-10360 mgc: Use IR for client->MDS/OST connections Project: fs/lustre-release Branch: b2_12 Current Patch Set: 1 Commit: b1c09656513f3198adf849182617e6eafef76954

            Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/40736
            Subject: LU-10360 mgs: Mount to dynamically added networks
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 4c68340088f2f56d16f6b1392de5ad7f7d139ff4

            gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/40736 Subject: LU-10360 mgs: Mount to dynamically added networks Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 4c68340088f2f56d16f6b1392de5ad7f7d139ff4

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/39613/
            Subject: LU-10360 mgc: Use IR for client->MDS/OST connections
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 37be05eca3f4aee15c946656a77f56967c15253d

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/39613/ Subject: LU-10360 mgc: Use IR for client->MDS/OST connections Project: fs/lustre-release Branch: master Current Patch Set: Commit: 37be05eca3f4aee15c946656a77f56967c15253d

            Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/39911
            Subject: LU-10360 tests: test dynamic NIDs feature
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: d0bfbcb3bb643ce6dc33590bd937cb3c935ac88a

            gerrit Gerrit Updater added a comment - Amir Shehata (ashehata@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/39911 Subject: LU-10360 tests: test dynamic NIDs feature Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: d0bfbcb3bb643ce6dc33590bd937cb3c935ac88a

            People

              tappro Mikhail Pershin
              adilger Andreas Dilger
              Votes:
              1 Vote for this issue
              Watchers:
              16 Start watching this issue

              Dates

                Created:
                Updated: