Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17596

ptlrpc_save_lock()) ASSERTION( rs->rs_nlocks < 14 ) failed

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Major
    • None
    • Lustre 2.16.0
    • 3
    • 9223372036854775807

    Description

      I think this was introduced by https://review.whamcloud.com/c/fs/lustre-release/+/35034

      at least it always hits in a test added by that patch

       [114877.640493] Lustre: DEBUG MARKER: == sanity test 300ug: migrate overstriped dirs =========== 21:41:45 (1709260905)
      [114879.322848] LustreError: 8393:0:(service.c:189:ptlrpc_save_lock()) ASSERTION( rs->rs_nlocks < 14 ) failed: 
      [114879.324154] LustreError: 8393:0:(service.c:189:ptlrpc_save_lock()) LBUG
      [114879.324842] CPU: 4 PID: 8393 Comm: mdt_io02_002 Kdump: loaded Tainted: P        W  OE  ------------   3.10.0-7.9-debug #2
      [114879.326174] Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
      [114879.327364] Call Trace:
      [114879.327930]  [<ffffffff817d93f8>] dump_stack+0x19/0x1b
      [114879.328558]  [<ffffffffa0251a9d>] lbug_with_loc+0x4d/0xa0 [libcfs]
      [114879.329246]  [<ffffffffa061ca87>] ptlrpc_save_lock+0x107/0x140 [ptlrpc]
      [114879.329978]  [<ffffffffa1282cd3>] mdt_save_lock+0x233/0x3e0 [mdt]
      [114879.330642]  [<ffffffffa129324f>] mdt_object_unlock+0x5f/0x170 [mdt]
      [114879.331295]  [<ffffffffa12b15ac>] mdt_object_stripes_unlock+0xfc/0x2d0 [mdt]
      [114879.332267]  [<ffffffffa12b3490>] mdt_reint_migrate+0x1d10/0x2420 [mdt]
      [114879.332955]  [<ffffffffa12b9567>] mdt_reint_rec+0x87/0x240 [mdt]
      [114879.333618]  [<ffffffffa128e5ac>] mdt_reint_internal+0x74c/0xbb0 [mdt]
      [114879.334280]  [<ffffffffa1296265>] ? mdt_thread_info_init+0xa5/0xc0 [mdt]
      [114879.334900]  [<ffffffffa1298f77>] mdt_reint+0x67/0x150 [mdt]
      [114879.335481]  [<ffffffffa06e3ade>] tgt_request_handle+0x74e/0x1a10 [ptlrpc]
      [114879.336089]  [<ffffffffa062683c>] ptlrpc_server_handle_request+0x26c/0xcb0 [ptlrpc]
      [114879.337307]  [<ffffffffa06285d6>] ptlrpc_main+0xc66/0x1670 [ptlrpc]
      [114879.337970]  [<ffffffff810dbb51>] ? put_prev_entity+0x31/0x400
      [114879.338623]  [<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90
      [114879.339303]  [<ffffffffa0627970>] ? ptlrpc_wait_event+0x610/0x610 [ptlrpc]
      [114879.339958]  [<ffffffff810ba114>] kthread+0xe4/0xf0
      [114879.340580]  [<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
      [114879.341241]  [<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
      [114879.341878]  [<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
      [114879.342522] Kernel panic - not syncing: LBUG

      Example crashes with crashdumps: http://testing.linuxhacker.ru/lustre-reports/external/crashes/boilpot-bigmem-18-2024-02-29-21:41:53/

      https://testing.whamcloud.com/gerrit-janitor/external/crashes/boilpot-bigmem-8-2024-02-22-22:11:41

      All such failures could be seen here for now: https://knox.linuxhacker.ru/crashdb_ui_external.py.cgi?newid=70311

      Attachments

        Activity

          People

            laisiyao Lai Siyao
            green Oleg Drokin
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: