Details

    • Bug
    • Resolution: Won't Fix
    • Minor
    • Lustre 2.1.0
    • Lustre 2.0.0
    • None
    • 3
    • 22,423
    • 4933

    Description

      It seems that clients can flood a server with reconnect requests
      when this one is returning EBUSY because it is still processing
      requests from the old connection.

      e.g. seen on 1.8.2 with a cluster having ~800 clients:

      Mar 3 20:26:16 md061i kernel: Lustre: 27033:0:(ldlm_lib.c:835:target_handle_connect())
      share3-MDT0000: refuse reconnection from eb9b2b28-5e23-8dc1-7024-7810bf8a74ff@173.25.10.184@o2ib to
      0xffff81039e70f000; still busy with 1 active RPCs
      Mar 3 20:26:16 md061i kernel: Lustre: 27033:0:(ldlm_lib.c:835:target_handle_connect()) Skipped
      4527 previous similar messages
      Mar 3 20:26:18 md061i kernel: Lustre: 27116:0:(ldlm_lib.c:835:target_handle_connect())
      share3-MDT0000: refuse reconnection from eb9b2b28-5e23-8dc1-7024-7810bf8a74ff@173.25.10.184@o2ib to
      0xffff81039e70f000; still busy with 1 active RPCs
      Mar 3 20:26:18 md061i kernel: Lustre: 27116:0:(ldlm_lib.c:835:target_handle_connect()) Skipped
      10580 previous similar messages

      From code review, this looks like a side effect of bug 18674.
      Since we now bypass import_select_connection() on EBUSY and EAGAIN,
      ptlrpc_connect_interpret->ptlrpc_maybe_ping_import_soon always triggers
      an immediate ping causing clients to reconnect in a busy loop.
      ------- Comment #1 From Johann Lombardi 2010-03-31 16:01:54

      Attachments

        Activity

          [LU-290] Reconnects are not throttled
          simmonsja James A Simmons made changes -
          Resolution New: Won't Fix [ 2 ]
          Status Original: Open [ 1 ] New: Closed [ 6 ]

          Old ticket for unsupported version

          simmonsja James A Simmons added a comment - Old ticket for unsupported version

          Commit: 526c43ec2e47ead878f0df552b74c78b4fc79d1f (Jan 13, 2012)
          Maloo: https://maloo.whamcloud.com/test_sets/6638310e-3f5f-11e1-990e-5254004bbbd3

          Another flood.

          liwei Li Wei (Inactive) added a comment - Commit: 526c43ec2e47ead878f0df552b74c78b4fc79d1f (Jan 13, 2012) Maloo: https://maloo.whamcloud.com/test_sets/6638310e-3f5f-11e1-990e-5254004bbbd3 Another flood.
          liwei Li Wei (Inactive) added a comment - - edited

          Several occurrences on Orion with the crazy "recovery is timed out, evict stale exports" flood:

          https://maloo.whamcloud.com/test_sets/8d84d62e-ce24-11e0-8d02-52540025f9af (8c8e6dc)
          https://maloo.whamcloud.com/test_sets/15d34bc8-cf7e-11e0-8d02-52540025f9af (448fc34)

          liwei Li Wei (Inactive) added a comment - - edited Several occurrences on Orion with the crazy "recovery is timed out, evict stale exports" flood: https://maloo.whamcloud.com/test_sets/8d84d62e-ce24-11e0-8d02-52540025f9af (8c8e6dc) https://maloo.whamcloud.com/test_sets/15d34bc8-cf7e-11e0-8d02-52540025f9af (448fc34)

          please see this:

          https://maloo.whamcloud.com/test_sets/e098dda6-c262-11e0-8bdf-52540025f9af

          ~30K messages "recovery is timed out, evict stale exports" in conf-sanity.test_47.console.client-9-ib.log

          bzzz Alex Zhuravlev added a comment - please see this: https://maloo.whamcloud.com/test_sets/e098dda6-c262-11e0-8bdf-52540025f9af ~30K messages "recovery is timed out, evict stale exports" in conf-sanity.test_47.console.client-9-ib.log

          Integrated in lustre-master » i686,client,el5,ofa #241
          LU-290 Reconnects are not throttled

          Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
          Files :

          • lustre/tests/conf-sanity.sh
          • lustre/ptlrpc/import.c
          hudson Build Master (Inactive) added a comment - Integrated in lustre-master » i686,client,el5,ofa #241 LU-290 Reconnects are not throttled Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1 Files : lustre/tests/conf-sanity.sh lustre/ptlrpc/import.c

          Integrated in lustre-master » i686,server,el5,inkernel #241
          LU-290 Reconnects are not throttled

          Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
          Files :

          • lustre/tests/conf-sanity.sh
          • lustre/ptlrpc/import.c
          hudson Build Master (Inactive) added a comment - Integrated in lustre-master » i686,server,el5,inkernel #241 LU-290 Reconnects are not throttled Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1 Files : lustre/tests/conf-sanity.sh lustre/ptlrpc/import.c

          Integrated in lustre-master » i686,client,el6,inkernel #241
          LU-290 Reconnects are not throttled

          Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
          Files :

          • lustre/ptlrpc/import.c
          • lustre/tests/conf-sanity.sh
          hudson Build Master (Inactive) added a comment - Integrated in lustre-master » i686,client,el6,inkernel #241 LU-290 Reconnects are not throttled Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1 Files : lustre/ptlrpc/import.c lustre/tests/conf-sanity.sh

          Integrated in lustre-master » i686,server,el5,ofa #241
          LU-290 Reconnects are not throttled

          Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
          Files :

          • lustre/tests/conf-sanity.sh
          • lustre/ptlrpc/import.c
          hudson Build Master (Inactive) added a comment - Integrated in lustre-master » i686,server,el5,ofa #241 LU-290 Reconnects are not throttled Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1 Files : lustre/tests/conf-sanity.sh lustre/ptlrpc/import.c

          Integrated in lustre-master » i686,client,el5,inkernel #241
          LU-290 Reconnects are not throttled

          Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
          Files :

          • lustre/ptlrpc/import.c
          • lustre/tests/conf-sanity.sh
          hudson Build Master (Inactive) added a comment - Integrated in lustre-master » i686,client,el5,inkernel #241 LU-290 Reconnects are not throttled Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1 Files : lustre/ptlrpc/import.c lustre/tests/conf-sanity.sh

          People

            laisiyao Lai Siyao
            laisiyao Lai Siyao
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: