Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13049

LNet: Handle shutdown properly

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: Lustre 2.14.0
    • Labels:
      None
    • Severity:
      3
    • Rank (Obsolete):
      9223372036854775807

      Description

      There are two code paths which can lookup a peer while the system is shutting down:

      1. net_shutdown_lndnet()>lnet_peer_tables_cleanup()>lnet_peer_table_del_rtrs_locked()->lnet_del_route()
      2. lnet_mt_event_handler()>lnet_handle_recovery_reply()>lnet_find_peer_ni_locked()

      In both of these cases the_lnet.ln_state might be shutting down. Currently lnet_get_peer_ni_locked() asserts on ln_stat == LNET_STATE_RUNNING.

      This should be handled gracefully. If the state isn't running, then it should return NULL. Callers of the function handle NULL return code.

        Attachments

          Activity

            People

            • Assignee:
              ashehata Amir Shehata
              Reporter:
              ashehata Amir Shehata
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: