Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8334

OSS lockup

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • None
    • Lustre 2.7.0
    • None
    • 2.7.1-fe
    • 3
    • 9223372036854775807

    Description

      OSS deadlocked unable to ping ethernet or IB interfaces. Console showed no errors.
      Attaching full trace of all threads. Most notable are kiblnd

      ID: 8711   TASK: ffff882020b12ab0  CPU: 4   COMMAND: "kiblnd_sd_01_01"
       #0 [ffff880060c86e90] crash_nmi_callback at ffffffff81032256
       #1 [ffff880060c86ea0] notifier_call_chain at ffffffff81568515
       #2 [ffff880060c86ee0] atomic_notifier_call_chain at ffffffff8156857a
       #3 [ffff880060c86ef0] notify_die at ffffffff810a44fe
       #4 [ffff880060c86f20] do_nmi at ffffffff8156618f
       #5 [ffff880060c86f50] nmi at ffffffff815659f0
          [exception RIP: _spin_lock+33]
          RIP: ffffffff81565261  RSP: ffff882021b75b70  RFLAGS: 00000293
          RAX: 0000000000002b8e  RBX: ffff880ffe7dd240  RCX: 0000000000000000
          RDX: 0000000000002b8b  RSI: 0000000000000003  RDI: ffff88201ee3f140
          RBP: ffff882021b75b70   R8: 6950000000000000   R9: 4a80000000000000
          R10: 0000000000000001  R11: 0000000000000001  R12: 0000000000000018
          R13: ffff881013262e40  R14: ffff8820268ecac0  R15: 0000000000000004
          ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
      --- <NMI exception stack> ---
       #6 [ffff882021b75b70] _spin_lock at ffffffff81565261
       #7 [ffff882021b75b78] cfs_percpt_lock at ffffffffa049edab [libcfs]
       #8 [ffff882021b75bb8] lnet_ptl_match_md at ffffffffa0529605 [lnet]
       #9 [ffff882021b75c38] lnet_parse_local at ffffffffa05306e7 [lnet]
      #10 [ffff882021b75cd8] lnet_parse at ffffffffa05316da [lnet]
      #11 [ffff882021b75d68] kiblnd_handle_rx at ffffffffa0a16f3b [ko2iblnd]
      #12 [ffff882021b75db8] kiblnd_scheduler at ffffffffa0a182be [ko2iblnd]
      #13 [ffff882021b75ee8] kthread at ffffffff8109dc8e
      #14 [ffff882021b75f48] kernel_thread at ffffffff8100c28a
      

      Attachments

        Issue Links

          Activity

            People

              bfaccini Bruno Faccini (Inactive)
              mhanafi Mahmoud Hanafi
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: