Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-12225

jobid_get_from_cache stalled on spin_lock(&pidmap->jp_lock)

Details

    • 3
    • 9223372036854775807

    Description

      ne process got stalled on spin_lock(&pidmap->jp_lock).

      PID: 15240 TASK: ffff8848c676edd0 CPU: 16 COMMAND: "wrf.exe"
      #0 [ffff882f80005e48] crash_nmi_callback at ffffffff8104d342
      #1 [ffff882f80005e58] nmi_handle at ffffffff816901b7
      #2 [ffff882f80005eb0] do_nmi at ffffffff816903c3
      #3 [ffff882f80005ef0] end_repeat_nmi at ffffffff8168f5d3
      [exception RIP: _raw_spin_lock+50]
      RIP: ffffffff8168ea92 RSP: ffff8848c7e47b48 RFLAGS: 00000206
      RAX: 0000000000006d1c RBX: ffff883138e99240 RCX: 0000000000000004
      RDX: 0000000000005a5a RSI: 0000000000005a5a RDI: ffff883138e99258
      RBP: ffff8848c7e47b48 R8: 0000000000019b80 R9: ffffffffa091e1da
      R10: ffff882f80019b80 R11: ffffea00bd578e00 R12: ffff883138e99258
      R13: 0000000000000000 R14: 0000000000000020 R15: ffff882efccfeb24
      ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
      — <NMI exception stack> —
      #4 [ffff8848c7e47b48] _raw_spin_lock at ffffffff8168ea92
      #5 [ffff8848c7e47b50] jobid_get_from_cache at ffffffffa0a739e8 [obdclass]
      #6 [ffff8848c7e47bb8] lustre_get_jobid at ffffffffa0a7429c [obdclass]
      #7 [ffff8848c7e47c20] vvp_io_init at ffffffffa0e89ff4 [lustre]
      #8 [ffff8848c7e47c70] cl_io_submit_rw at ffffffffa0a6ebb8 [obdclass]
      #9 [ffff8848c7e47ca8] cl_io_init at ffffffffa0a6ed4a [obdclass]
      #10 [ffff8848c7e47cd8] cl_io_rw_init at ffffffffa0a6f743 [obdclass]
      #11 [ffff8848c7e47d28] ll_file_io_generic at ffffffffa0e33232 [lustre]
      #12 [ffff8848c7e47e40] ll_file_aio_write at ffffffffa0e33fad [lustre]
      #13 [ffff8848c7e47ea0] ll_file_write at ffffffffa0e3413e [lustre]
      #14 [ffff8848c7e47ef8] vfs_write at ffffffff811fe9fd
      #15 [ffff8848c7e47f38] sys_write at ffffffff811ff51f
      #16 [ffff8848c7e47f80] system_call_fastpath at ffffffff81697809
      RIP: 00002b798058943d RSP: 00007ffeb53c6998 RFLAGS: 00010206
      RAX: 0000000000000001 RBX: ffffffff81697809 RCX: 0000000000000000
      RDX: 0000000000000033 RSI: 0000000007d4a6c0 RDI: 0000000000000002
      RBP: 00007ffeb53c6180 R8: 0000000000000000 R9: 0000000007b6a7e0
      R10: 0000000000000000 R11: 0000000000000293 R12: 0000000007d4a6c0
      R13: 0000000000020000 R14: 0000000000000033 R1

      Attachments

        Issue Links

          Activity

            [LU-12225] jobid_get_from_cache stalled on spin_lock(&pidmap->jp_lock)

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35008/
            Subject: LU-12225 obdclass: improve jobid memory reclaim policy
            Project: fs/lustre-release
            Branch: b2_12
            Current Patch Set:
            Commit: cffbcfd60b4a55387e968c92c8f6cfeb0d17a35f

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35008/ Subject: LU-12225 obdclass: improve jobid memory reclaim policy Project: fs/lustre-release Branch: b2_12 Current Patch Set: Commit: cffbcfd60b4a55387e968c92c8f6cfeb0d17a35f

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35007/
            Subject: LU-12225 obdclass: fix race access vs removal of jobid_hash
            Project: fs/lustre-release
            Branch: b2_12
            Current Patch Set:
            Commit: 1dd46563439cc35b54ca97a6446eeaf0fe1ccd8c

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35007/ Subject: LU-12225 obdclass: fix race access vs removal of jobid_hash Project: fs/lustre-release Branch: b2_12 Current Patch Set: Commit: 1dd46563439cc35b54ca97a6446eeaf0fe1ccd8c

            Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35008
            Subject: LU-12225 obdclass: improve jobid memory reclaim policy
            Project: fs/lustre-release
            Branch: b2_12
            Current Patch Set: 1
            Commit: 0ecbff4eeaf55c76c946bc2f37633f9e64d42d3b

            gerrit Gerrit Updater added a comment - Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35008 Subject: LU-12225 obdclass: improve jobid memory reclaim policy Project: fs/lustre-release Branch: b2_12 Current Patch Set: 1 Commit: 0ecbff4eeaf55c76c946bc2f37633f9e64d42d3b

            Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35007
            Subject: LU-12225 obdclass: fix race access vs removal of jobid_hash
            Project: fs/lustre-release
            Branch: b2_12
            Current Patch Set: 1
            Commit: eec34ebbc5fa041f78ae97e1bbd8517943f29502

            gerrit Gerrit Updater added a comment - Minh Diep (mdiep@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35007 Subject: LU-12225 obdclass: fix race access vs removal of jobid_hash Project: fs/lustre-release Branch: b2_12 Current Patch Set: 1 Commit: eec34ebbc5fa041f78ae97e1bbd8517943f29502
            pjones Peter Jones added a comment -

            Landed for 2.13

            pjones Peter Jones added a comment - Landed for 2.13

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/34775/
            Subject: LU-12225 obdclass: improve jobid memory reclaim policy
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 3e9fedfa7ea52a03a1975572ab37cc1ae9344a8a

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/34775/ Subject: LU-12225 obdclass: improve jobid memory reclaim policy Project: fs/lustre-release Branch: master Current Patch Set: Commit: 3e9fedfa7ea52a03a1975572ab37cc1ae9344a8a

            Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/34763/
            Subject: LU-12225 obdclass: fix race access vs removal of jobid_hash
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: b664182e0361731fa409ac6a0a0f19637a7e5288

            gerrit Gerrit Updater added a comment - Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/34763/ Subject: LU-12225 obdclass: fix race access vs removal of jobid_hash Project: fs/lustre-release Branch: master Current Patch Set: Commit: b664182e0361731fa409ac6a0a0f19637a7e5288

            Wang Shilong (wshilong@ddn.com) uploaded a new patch: https://review.whamcloud.com/34775
            Subject: LU-12225 obdclass: improve jobid memory reclaim policy
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 1865943dec15f409afba14ca80e068c0422f041e

            gerrit Gerrit Updater added a comment - Wang Shilong (wshilong@ddn.com) uploaded a new patch: https://review.whamcloud.com/34775 Subject: LU-12225 obdclass: improve jobid memory reclaim policy Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 1865943dec15f409afba14ca80e068c0422f041e

            Wang Shilong (wshilong@ddn.com) uploaded a new patch: https://review.whamcloud.com/34763
            Subject: LU-12225 obdclass: fix race access vs removal of jobid_hash
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 291c004743fa7f17428227ac41c2d037925d813c

            gerrit Gerrit Updater added a comment - Wang Shilong (wshilong@ddn.com) uploaded a new patch: https://review.whamcloud.com/34763 Subject: LU-12225 obdclass: fix race access vs removal of jobid_hash Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 291c004743fa7f17428227ac41c2d037925d813c

            People

              wshilong Wang Shilong (Inactive)
              wshilong Wang Shilong (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: