Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-7728

soft lockup in osp_precreate_reserve()

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: Lustre 2.9.0
    • Labels:
    • Severity:
      3
    • Rank (Obsolete):
      9223372036854775807

      Description

      Jan 5 23:14:11 snx11139n003 kernel: BUG: soft lockup - CPU#12 stuck for 67s! [mdt03_093:66043]
      ...
      Jan 5 23:14:11 snx11139n003 kernel:
      Jan 5 23:14:11 snx11139n003 kernel: Pid: 66043, comm: mdt03_093 Not tainted 2.6.32-431.17.1.x2.0.61.x86_64 #1 Intel Corporation S2600JF/S2600JF
      Jan 5 23:14:11 snx11139n003 kernel: RIP: 0010:[<ffffffff81527fe7>] [<ffffffff81527fe7>] _spin_unlock_irqrestore+0x17/0x20
      Jan 5 23:14:11 snx11139n003 kernel: RSP: 0018:ffff8806921d9430 EFLAGS: 00000282
      Jan 5 23:14:11 snx11139n003 kernel: RAX: ffff8806d983e1c0 RBX: ffff8806921d9430 RCX: 0000000000000005
      Jan 5 23:14:11 snx11139n003 kernel: RDX: ffff8806d983e1b8 RSI: 0000000000000282 RDI: 0000000000000282
      Jan 5 23:14:11 snx11139n003 kernel: RBP: ffffffff8100bb8e R08: 0000000000000000 R09: 00000000fffffffb
      Jan 5 23:14:11 snx11139n003 kernel: R10: 0000000000000002 R11: 0000000000000001 R12: 00000000000000cf
      Jan 5 23:14:11 snx11139n003 kernel: R13: 0000000000000082 R14: 00000000000000cf R15: 0000000000000000
      Jan 5 23:14:11 snx11139n003 kernel: FS: 0000000000000000(0000) GS:ffff88085c480000(0000) knlGS:0000000000000000
      Jan 5 23:14:11 snx11139n003 kernel: CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
      Jan 5 23:14:11 snx11139n003 kernel: CR2: 00007f558d977d50 CR3: 0000000001a85000 CR4: 00000000000407e0
      Jan 5 23:14:11 snx11139n003 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      Jan 5 23:14:11 snx11139n003 kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      Jan 5 23:14:11 snx11139n003 kernel: Process mdt03_093 (pid: 66043, threadinfo ffff8806921d8000, task ffff880692122aa0)
      Jan 5 23:14:11 snx11139n003 kernel: Stack:
      Jan 5 23:14:11 snx11139n003 kernel: ffff8806921d9470 ffffffff81058c53 ffff8806921d9470 ffff8806d983e000
      Jan 5 23:14:11 snx11139n003 kernel: <d> 00000000ffffffff ffffffff00000000 ffff8806d983e110 000000012ace16d6
      Jan 5 23:14:11 snx11139n003 kernel: <d> ffff8806921d9550 ffffffffa105e2ce 0000000059447a20 00000000013a8480
      Jan 5 23:14:11 snx11139n003 kernel: Call Trace:
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffff81058c53>] ? __wake_up+0x53/0x70
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa105e2ce>] ? osp_precreate_reserve+0x2be/0x840 [osp]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa105a8ac>] ? osp_declare_object_create+0x16c/0x4f0 [osp]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa1018c54>] ? lod_qos_declare_object_on+0x124/0x4e0 [lod]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa101ade9>] ? lod_alloc_rr.clone.2+0x7f9/0xc80 [lod]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa101c370>] ? lod_qos_prep_create+0x1100/0x1b5c [lod]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0b5be4a>] ? fld_cache_lookup+0x3a/0x1e0 [fld]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0b5fc32>] ? fld_server_lookup+0x72/0x440 [fld]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa10149f4>] ? lod_declare_striped_object+0x154/0x940 [lod]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa1016eb8>] ? lod_declare_object_create+0x518/0x7e0 [lod]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0dcaf4d>] ? mdd_declare_object_create_internal+0x11d/0x340 [mdd]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0f77a60>] ? osd_xattr_get+0x230/0x2e0 [osd_ldiskfs]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0dbe51e>] ? mdd_declare_create+0x4e/0xa60 [mdd]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0dbf927>] ? mdd_linkea_prepare+0x387/0x4d0 [mdd]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0dc311b>] ? mdd_create+0x75b/0x1930 [mdd]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0f77927>] ? osd_xattr_get+0xf7/0x2e0 [osd_ldiskfs]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0e3f808>] ? mdo_create+0x18/0x50 [mdt]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0e49a41>] ? mdt_reint_open+0x1401/0x20b0 [mdt]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0528246>] ? upcall_cache_get_entry+0x296/0x870 [libcfs]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0e3365d>] ? mdt_reint_rec+0x5d/0x200 [mdt]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0e19a0b>] ? mdt_reint_internal+0x4cb/0x760 [mdt]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0e19e96>] ? mdt_intent_reint+0x1f6/0x430 [mdt]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0e18594>] ? mdt_intent_policy+0x494/0xce0 [mdt]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa0871739>] ? ldlm_lock_enqueue+0x129/0x9d0 [ptlrpc]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa089dadb>] ? ldlm_handle_enqueue0+0x51b/0x13b0 [ptlrpc]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa050c63e>] ? cfs_timer_arm+0xe/0x10 [libcfs]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa091f231>] ? tgt_enqueue+0x61/0x230 [ptlrpc]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa091fafe>] ? tgt_request_handle+0x6fe/0xaf0 [ptlrpc]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa08cf751>] ? ptlrpc_main+0xe41/0x1930 [ptlrpc]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffff810096f0>] ? __switch_to+0xd0/0x320
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffff8152557e>] ? thread_return+0x4e/0x760
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffffa08ce910>] ? ptlrpc_main+0x0/0x1930 [ptlrpc]
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffff8109ac66>] ? kthread+0x96/0xa0
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffff8100c20a>] ? child_rip+0xa/0x20
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffff8109abd0>] ? kthread+0x0/0xa0
      Jan 5 23:14:11 snx11139n003 kernel: [<ffffffff8100c200>] ? child_rip+0x0/0x20

        Attachments

          Activity

            People

            Assignee:
            wc-triage WC Triage
            Reporter:
            askulysh Andriy Skulysh
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: