Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-12225

jobid_get_from_cache stalled on spin_lock(&pidmap->jp_lock)

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: Lustre 2.13.0, Lustre 2.12.3
    • Labels:
      None
    • Severity:
      3
    • Rank (Obsolete):
      9223372036854775807

      Description

      ne process got stalled on spin_lock(&pidmap->jp_lock).

      PID: 15240 TASK: ffff8848c676edd0 CPU: 16 COMMAND: "wrf.exe"
      #0 [ffff882f80005e48] crash_nmi_callback at ffffffff8104d342
      #1 [ffff882f80005e58] nmi_handle at ffffffff816901b7
      #2 [ffff882f80005eb0] do_nmi at ffffffff816903c3
      #3 [ffff882f80005ef0] end_repeat_nmi at ffffffff8168f5d3
      [exception RIP: _raw_spin_lock+50]
      RIP: ffffffff8168ea92 RSP: ffff8848c7e47b48 RFLAGS: 00000206
      RAX: 0000000000006d1c RBX: ffff883138e99240 RCX: 0000000000000004
      RDX: 0000000000005a5a RSI: 0000000000005a5a RDI: ffff883138e99258
      RBP: ffff8848c7e47b48 R8: 0000000000019b80 R9: ffffffffa091e1da
      R10: ffff882f80019b80 R11: ffffea00bd578e00 R12: ffff883138e99258
      R13: 0000000000000000 R14: 0000000000000020 R15: ffff882efccfeb24
      ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
      — <NMI exception stack> —
      #4 [ffff8848c7e47b48] _raw_spin_lock at ffffffff8168ea92
      #5 [ffff8848c7e47b50] jobid_get_from_cache at ffffffffa0a739e8 [obdclass]
      #6 [ffff8848c7e47bb8] lustre_get_jobid at ffffffffa0a7429c [obdclass]
      #7 [ffff8848c7e47c20] vvp_io_init at ffffffffa0e89ff4 [lustre]
      #8 [ffff8848c7e47c70] cl_io_submit_rw at ffffffffa0a6ebb8 [obdclass]
      #9 [ffff8848c7e47ca8] cl_io_init at ffffffffa0a6ed4a [obdclass]
      #10 [ffff8848c7e47cd8] cl_io_rw_init at ffffffffa0a6f743 [obdclass]
      #11 [ffff8848c7e47d28] ll_file_io_generic at ffffffffa0e33232 [lustre]
      #12 [ffff8848c7e47e40] ll_file_aio_write at ffffffffa0e33fad [lustre]
      #13 [ffff8848c7e47ea0] ll_file_write at ffffffffa0e3413e [lustre]
      #14 [ffff8848c7e47ef8] vfs_write at ffffffff811fe9fd
      #15 [ffff8848c7e47f38] sys_write at ffffffff811ff51f
      #16 [ffff8848c7e47f80] system_call_fastpath at ffffffff81697809
      RIP: 00002b798058943d RSP: 00007ffeb53c6998 RFLAGS: 00010206
      RAX: 0000000000000001 RBX: ffffffff81697809 RCX: 0000000000000000
      RDX: 0000000000000033 RSI: 0000000007d4a6c0 RDI: 0000000000000002
      RBP: 00007ffeb53c6180 R8: 0000000000000000 R9: 0000000007b6a7e0
      R10: 0000000000000000 R11: 0000000000000293 R12: 0000000007d4a6c0
      R13: 0000000000020000 R14: 0000000000000033 R1

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                wshilong Wang Shilong
                Reporter:
                wshilong Wang Shilong
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: