Details
-
Bug
-
Resolution: Cannot Reproduce
-
Minor
-
None
-
Lustre 1.8.6
-
None
-
3
-
10155
Description
at the our customer site, we saw a thread hangs on MDS with the normal operation.
May 21 12:58:00 ALPL506 kernel: Call Trace: May 21 12:58:00 ALPL506 kernel: [<ffffffff80009852>] __d_lookup+0xb0/0xff May 21 12:58:00 ALPL506 kernel: [<ffffffff80063c4f>] __mutex_lock_slowpath+0x60/0x9b May 21 12:58:00 ALPL506 kernel: [<ffffffff80063c99>] .text.lock.mutex+0xf/0x14 May 21 12:58:00 ALPL506 kernel: [<ffffffff88c9ffd7>] :mds:mds_get_md+0x47/0x1c0 May 21 12:58:00 ALPL506 kernel: [<ffffffff88ca03ee>] :mds:mds_pack_md+0x29e/0x370 May 21 12:58:00 ALPL506 kernel: [<ffffffff88ca06df>] :mds:mds_getattr_internal+0x21f/0x840 May 21 12:58:00 ALPL506 kernel: [<ffffffff88ca3ad5>] :mds:mds_getattr_lock+0xab5/0xc90 May 21 12:58:00 ALPL506 kernel: [<ffffffff88c9edda>] :mds:fixup_handle_for_resent_req+0x5a/0x2c0 May 21 12:58:00 ALPL506 kernel: [<ffffffff88ca9d83>] :mds:mds_intent_policy+0x623/0xc20 May 21 12:58:00 ALPL506 kernel: [<ffffffff88944270>] :ptlrpc:ldlm_resource_putref_internal+0x230/0x460 May 21 12:58:00 ALPL506 kernel: [<ffffffff88941eb6>] :ptlrpc:ldlm_lock_enqueue+0x186/0xb20 May 21 12:58:00 ALPL506 kernel: [<ffffffff8893e7fd>] :ptlrpc:ldlm_lock_create+0x9bd/0x9f0 May 21 12:58:00 ALPL506 kernel: [<ffffffff88966870>] :ptlrpc:ldlm_server_blocking_ast+0x0/0x83d May 21 12:58:00 ALPL506 kernel: [<ffffffff88963b39>] :ptlrpc:ldlm_handle_enqueue+0xc09/0x1210 May 21 12:58:00 ALPL506 kernel: [<ffffffff88ca8b30>] :mds:mds_handle+0x40e0/0x4d10 May 21 12:58:00 ALPL506 kernel: [<ffffffff800774ed>] smp_send_reschedule+0x4e/0x53 May 21 12:58:00 ALPL506 kernel: [<ffffffff8008ddcd>] enqueue_task+0x41/0x56 May 21 12:58:00 ALPL506 kernel: [<ffffffff88987d55>] :ptlrpc:lustre_msg_get_conn_cnt+0x35/0xf0 May 21 12:58:00 ALPL506 kernel: [<ffffffff889916d9>] :ptlrpc:ptlrpc_server_handle_request+0x989/0xe00 May 21 12:58:00 ALPL506 kernel: [<ffffffff88991e35>] :ptlrpc:ptlrpc_wait_event+0x2e5/0x310 May 21 12:58:00 ALPL506 kernel: [<ffffffff8008c85d>] __wake_up_common+0x3e/0x68 May 21 12:58:00 ALPL506 kernel: [<ffffffff88992dc6>] :ptlrpc:ptlrpc_main+0xf66/0x1120 May 21 12:58:00 ALPL506 kernel: [<ffffffff8005dfb1>] child_rip+0xa/0x11 May 21 12:58:00 ALPL506 kernel: [<ffffffff88991e60>] :ptlrpc:ptlrpc_main+0x0/0x1120 May 21 12:58:00 ALPL506 kernel: [<ffffffff8005dfa7>] child_rip+0x0/0x11
Attachments
Issue Links
- Trackbacks
-
Lustre 1.8.x known issues tracker
While testing against Lustre b18 branch, we would hit known bugs which were already reported in Lustre Bugzilla https://bugzilla.lustre.org/. In order to move away from relying on Bugzilla, we would create a JIRA