<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:15:10 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1274] Client threads block for sometime before being evicted and can never reconnect afterward</title>
                <link>https://jira.whamcloud.com/browse/LU-1274</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We have a new customer hitting something very similar to &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-816&quot; title=&quot;Possible bug/dead-lock in Lustre-Lock algorithm/protocol may lead to multiple Clients/processes to blocked for ever&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-816&quot;&gt;&lt;del&gt;LU-816&lt;/del&gt;&lt;/a&gt;.&lt;br/&gt;
The difference is they are running Lustre 2.1.&lt;/p&gt;

&lt;p&gt;The sequence of the problem is as follows (giving hours corresponding to the logs):&lt;br/&gt;
1/ The client reports a process that was blocked for more than 120 seconds with the following trace (no kernel or Lustre messages before this in the 40 minutes preceding):&lt;br/&gt;
(thread blocked for 2 minutes reported at 13:43:27, so blocked since 13:41:27)&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;INFO: task ls:12058 blocked for more than 120 seconds.&lt;br/&gt;
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.&lt;br/&gt;
 ls            D 0000000000000008     0 12058  11471 0x00000084&lt;br/&gt;
 ffff8805ee2dfa58 0000000000000086 ffff8805ee2dfa68 ffff880bbc28e600&lt;br/&gt;
 ffff8805ee2dfd08 ffff880616207800 ffff8805ee2dfad8 ffff880620230a38&lt;br/&gt;
 ffff880604529038 ffff8805ee2dffd8 000000000000f598 ffff880604529038&lt;br/&gt;
 Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc84e&amp;gt;&amp;#93;&lt;/span&gt; __mutex_lock_slowpath+0x13e/0x180&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc6eb&amp;gt;&amp;#93;&lt;/span&gt; mutex_lock+0x2b/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa068c7b8&amp;gt;&amp;#93;&lt;/span&gt; cl_lock_mutex_get+0x78/0xd0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa068eb4a&amp;gt;&amp;#93;&lt;/span&gt; cl_lock_hold_mutex+0xca/0x710 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa09dff73&amp;gt;&amp;#93;&lt;/span&gt; ? lov_io_init_raid0+0x4b3/0x920 &lt;span class=&quot;error&quot;&gt;&amp;#91;lov&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa069060e&amp;gt;&amp;#93;&lt;/span&gt; cl_lock_request+0x5e/0x1f0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa09d6bf9&amp;gt;&amp;#93;&lt;/span&gt; ? lov_io_init+0x99/0x120 &lt;span class=&quot;error&quot;&gt;&amp;#91;lov&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0a73ab0&amp;gt;&amp;#93;&lt;/span&gt; cl_glimpse_lock+0x180/0x390 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0a74274&amp;gt;&amp;#93;&lt;/span&gt; cl_glimpse_size+0x184/0x190 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0a2a28f&amp;gt;&amp;#93;&lt;/span&gt; ll_inode_revalidate_it+0x6f/0x1f0 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81181804&amp;gt;&amp;#93;&lt;/span&gt; ? do_path_lookup+0x94/0xa0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8117efc5&amp;gt;&amp;#93;&lt;/span&gt; ? putname+0x45/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0a2a459&amp;gt;&amp;#93;&lt;/span&gt; ll_getattr_it+0x49/0x170 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0a2a5b7&amp;gt;&amp;#93;&lt;/span&gt; ll_getattr+0x37/0x40 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81177c11&amp;gt;&amp;#93;&lt;/span&gt; vfs_getattr+0x51/0x80&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0a1572e&amp;gt;&amp;#93;&lt;/span&gt; ? ll_ddelete+0x5e/0x300 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81177ca0&amp;gt;&amp;#93;&lt;/span&gt; vfs_fstatat+0x60/0x80&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81177d2e&amp;gt;&amp;#93;&lt;/span&gt; vfs_lstat+0x1e/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81177d54&amp;gt;&amp;#93;&lt;/span&gt; sys_newlstat+0x24/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810d1b62&amp;gt;&amp;#93;&lt;/span&gt; ? audit_syscall_entry+0x272/0x2a0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c635&amp;gt;&amp;#93;&lt;/span&gt; ? math_state_restore+0x45/0x60&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100b172&amp;gt;&amp;#93;&lt;/span&gt; system_call_fastpath+0x16/0x1b&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;2/ A thread on the server side is blocked and reported 200secs later with the following trace:&lt;br/&gt;
(thread reported blocked for 200s at 13:45:42, so blocked since 13:42:22)&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt; Lustre: Service thread pid 24618 was inactive for 200.00s. The thread might be hung, or it might only be slow and will resume lat&lt;br/&gt;
er. Dumping the stack trace for debugging purposes:&lt;br/&gt;
Pid: 24618, comm: ll_ost_425&lt;/p&gt;

&lt;p&gt;Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810657ec&amp;gt;&amp;#93;&lt;/span&gt; ? lock_timer_base+0x3c/0x70&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8147eab0&amp;gt;&amp;#93;&lt;/span&gt; schedule_timeout+0x190/0x2d0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81065900&amp;gt;&amp;#93;&lt;/span&gt; ? process_timeout+0x0/0x10&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0595731&amp;gt;&amp;#93;&lt;/span&gt; cfs_waitq_timedwait+0x11/0x20 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0747453&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_set_wait+0x323/0x730 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8104c780&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x0/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0750e84&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_msg_set_status+0x94/0x110 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0747916&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_queue_wait+0xb6/0x290 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa02a85ec&amp;gt;&amp;#93;&lt;/span&gt; ? lprocfs_counter_add+0x12c/0x170 &lt;span class=&quot;error&quot;&gt;&amp;#91;lvfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa072cf3e&amp;gt;&amp;#93;&lt;/span&gt; ldlm_server_glimpse_ast+0x10e/0x3d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa081130d&amp;gt;&amp;#93;&lt;/span&gt; filter_intent_policy+0x35d/0x710 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdfilter&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa05a4805&amp;gt;&amp;#93;&lt;/span&gt; ? cfs_hash_bd_lookup_intent+0xe5/0x130 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa070bb8a&amp;gt;&amp;#93;&lt;/span&gt; ldlm_lock_enqueue+0x2da/0xa50 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa072a485&amp;gt;&amp;#93;&lt;/span&gt; ? ldlm_export_lock_get+0x15/0x20 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa05a3872&amp;gt;&amp;#93;&lt;/span&gt; ? cfs_hash_bd_add_locked+0x62/0x90 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0732407&amp;gt;&amp;#93;&lt;/span&gt; ldlm_handle_enqueue0+0x447/0x1090 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0754240&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_swab_ldlm_request+0x0/0x30 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa07330b6&amp;gt;&amp;#93;&lt;/span&gt; ldlm_handle_enqueue+0x66/0x70 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa07330c0&amp;gt;&amp;#93;&lt;/span&gt; ? ldlm_server_completion_ast+0x0/0x680 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa06dc870&amp;gt;&amp;#93;&lt;/span&gt; ? ost_blocking_ast+0x0/0xe40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ost&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa072ce30&amp;gt;&amp;#93;&lt;/span&gt; ? ldlm_server_glimpse_ast+0x0/0x3d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa06e4c6a&amp;gt;&amp;#93;&lt;/span&gt; ost_handle+0x22ba/0x4b90 &lt;span class=&quot;error&quot;&gt;&amp;#91;ost&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8104c792&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x12/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8103b9b9&amp;gt;&amp;#93;&lt;/span&gt; ? __wake_up_common+0x59/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0750444&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_msg_get_opc+0x94/0x100 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0761459&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xc79/0x19d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810017ac&amp;gt;&amp;#93;&lt;/span&gt; ? __switch_to+0x1ac/0x320&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa07607e0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_main+0x0/0x19d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810041aa&amp;gt;&amp;#93;&lt;/span&gt; child_rip+0xa/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa07607e0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_main+0x0/0x19d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810041a0&amp;gt;&amp;#93;&lt;/span&gt; ? child_rip+0x0/0x20&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;3/ Client reports another thread which was blocked since 13:43:27 (reported at 13:45:27):&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;kernel INFO: task ldlm_cb_05:9780 blocked for more than 120 seconds.&lt;br/&gt;
kernel &quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.&lt;br/&gt;
kernel ldlm_cb_05    D 0000000000000006     0  9780      2 0x00000080&lt;br/&gt;
 ffff880604557b90 0000000000000046 ffff880604557b58 ffff880604557b54&lt;br/&gt;
 ffff880604557b50 ffff88063fc24f00 ffff880655435f80 0000000100610371&lt;br/&gt;
 ffff88060ac5e6b8 ffff880604557fd8 000000000000f598 ffff88060ac5e6b8&lt;br/&gt;
 Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc84e&amp;gt;&amp;#93;&lt;/span&gt; __mutex_lock_slowpath+0x13e/0x180&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff814dc6eb&amp;gt;&amp;#93;&lt;/span&gt; mutex_lock+0x2b/0x50&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa068c7b8&amp;gt;&amp;#93;&lt;/span&gt; cl_lock_mutex_get+0x78/0xd0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa097d91e&amp;gt;&amp;#93;&lt;/span&gt; osc_ldlm_glimpse_ast+0x7e/0x160 &lt;span class=&quot;error&quot;&gt;&amp;#91;osc&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa076e304&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_msg_get_opc+0x94/0x100 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa074d6bb&amp;gt;&amp;#93;&lt;/span&gt; ldlm_callback_handler+0x142b/0x21c0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa07709ac&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_msg_get_transno+0x7c/0xe0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa076e304&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_msg_get_opc+0x94/0x100 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa077ef59&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xbb9/0x1990 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa077e3a0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_main+0x0/0x1990 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c1ca&amp;gt;&amp;#93;&lt;/span&gt; child_rip+0xa/0x20&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa077e3a0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_main+0x0/0x1990 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c1c0&amp;gt;&amp;#93;&lt;/span&gt; ? child_rip+0x0/0x20&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;4/ The 2 threads on the client stay blocked (and are reported every 2 minutes) until the server finally evicts the client a first time at 13:49:38 :&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;LustreError: 0:0:(ldlm_lockd.c:356:waiting_locks_callback()) ### lock callback timer expired after 155s: evicting client at 172.17.1.&lt;br/&gt;
230@o2ib  ns: filter-scratch-OST001c_UUID lock: ffff8800774e0d80/0xaa844a441355c6d7 lrc: 3/0,0 mode: PW/PW res: 3545740/0 rrc: 2 type: EXT &lt;span class=&quot;error&quot;&gt;&amp;#91;0-&amp;gt;18446744073709551615&amp;#93;&lt;/span&gt; (req 0-&amp;gt;18446744073709551&lt;br/&gt;
615) flags: 0x10020 remote: 0x49baa5ebdb4811c0 expref: 15 pid: 5451 timeout 4476555308&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;5/ The client side &quot;ls&quot; thread unblocks (no more reports of the watchdog) with the message:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1333111826 2012 Mar 30 13:50:26 willowb11 kern err kernel LustreError: 11982:0:(cl_io.c:1700:cl_sync_io_wait()) SYNC IO failed with error: -110, try to cancel 14 remaining pages&lt;br/&gt;
1333111826 2012 Mar 30 13:50:26 willowb11 kern err kernel LustreError: 11982:0:(cl_io.c:965:cl_io_cancel()) Canceling ongoing page trasmission&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;But the ldlm_cb_02 is still locked at that point.&lt;/p&gt;

&lt;p&gt;6/ The server evicts the client a second time at 13:53:34 :&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Lustre: scratch-OST001c: haven&apos;t heard from client e33a9e90-14c3-9556-290a-e2f09abad36a (at 172.17.1.230@o2ib) in 227 seconds. I think it&apos;s dead, and I am evicting it. exp ffff8805ebfa1400, cur 1333112014 expire 1333111864 last 1333111787&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;7/ At that point, no more threads are locked, but it is impossible for the client to reconnect to the OST, getting these messages:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt; LustreError: 167-0: This client was evicted by scratch-OST0023; in progress operations using this service will fail.&lt;br/&gt;
 LustreError: 2829:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8805f7714000 x1397861693879877/t0(0) o-1-&amp;gt;scratch-OST0023_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2830:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8805e483d400 x1397861693879881/t0(0) o-1-&amp;gt;scratch-OST0023_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2830:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8805e4beec00 x1397861693879901/t0(0) o-1-&amp;gt;scratch-OST0023_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2830:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 1 previous similar message&lt;br/&gt;
 LustreError: 11-0: an error occurred while communicating with 172.17.0.7@o2ib. The ldlm_enqueue operation failed with -107&lt;br/&gt;
 Lustre: scratch-OST001c-osc-ffff880620978000: Connection to service scratch-OST001c via nid 172.17.0.7@o2ib was lost; in progress operations using this service will wait for recovery to complete.&lt;br/&gt;
 LustreError: 167-0: This client was evicted by scratch-OST001c; in progress operations using this service will fail.&lt;br/&gt;
 LustreError: 2831:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8805e5af9800 x1397861693879966/t0(0) o-1-&amp;gt;scratch-OST0023_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2831:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 2 previous similar messages&lt;br/&gt;
 Lustre: 9014:0:(cl_lock.c:2025:cl_lock_page_out()) Writing 1 pages error: -108&lt;br/&gt;
 LustreError: 2825:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8805da30f400 x1397861693880082/t0(0) o-1-&amp;gt;scratch-OST001c_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2825:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 17 previous similar messages&lt;br/&gt;
 LustreError: 2825:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8805d9f09400 x1397861693880324/t0(0) o-1-&amp;gt;scratch-OST0023_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2825:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 1 previous similar message&lt;br/&gt;
 LustreError: 2825:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8805ec787c00 x1397861693881645/t0(0) o-1-&amp;gt;scratch-OST0023_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2825:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 14 previous similar messages&lt;br/&gt;
 LustreError: 2833:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8806228ff000 x1397861693882624/t0(0) o-1-&amp;gt;scratch-OST001c_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2833:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 13 previous similar messages&lt;br/&gt;
 LustreError: 2825:0:(client.c:1076:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff8805e4a01c00 x1397861693883727/t0(0) o-1-&amp;gt;scratch-OST001c_UUID@172.17.0.7@o2ib:28/4 lens 296/352 e 0 to 0 dl 0 ref 1 fl Rpc:/ffffffff/ffffffff rc 0/-1&lt;br/&gt;
 LustreError: 2825:0:(client.c:1076:ptlrpc_import_delay_req()) Skipped 14 previous similar messages&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I attach the full logs of client, server holding the concerned OST, and a stack trace of all threads on the client once in the final state (unable to reconnect).&lt;/p&gt;

&lt;p&gt;The only issue found is to reboot the client. This is critical as this can happen several times an hour and is happening on login nodes of the cluster, thus impacting a lot of users at once.&lt;/p&gt;
</description>
                <environment>rhel 6.1 clients with Bull &amp;#39;s Advanced Edition suite.&lt;br/&gt;
Client version: lustre-modules-2.1.0-2.6.32_131.0.15.el6.x86_64_Bull.2.204.el6&lt;br/&gt;
Server version: lustre-modules-2.1.0-2.6.32_131.17.1.bl6.Bull.27.0.x86_64_Bull.2.207.bl6</environment>
        <key id="13806">LU-1274</key>
            <summary>Client threads block for sometime before being evicted and can never reconnect afterward</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="jay">Jinshan Xiong</assignee>
                                    <reporter username="spiechurski">Sebastien Piechurski</reporter>
                        <labels>
                    </labels>
                <created>Fri, 30 Mar 2012 13:14:02 +0000</created>
                <updated>Fri, 1 Jun 2012 14:05:04 +0000</updated>
                            <resolved>Fri, 1 Jun 2012 14:05:04 +0000</resolved>
                                    <version>Lustre 2.1.0</version>
                                    <fixVersion>Lustre 2.3.0</fixVersion>
                    <fixVersion>Lustre 2.1.2</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="32957" author="jay" created="Fri, 30 Mar 2012 14:23:42 +0000"  >&lt;p&gt;This issue is because it holds mutex of cl_lock when it&apos;s flushing pages. If the OST is really slow to finish that IO(pretty common in 2.x), glimpse ast will be blocked for a long while and finally caused this client to be evicted.&lt;/p&gt;

&lt;p&gt;To fix this issue, we shouldn&apos;t hold mutex of cl_lock when writing the pages out. I&apos;ll work out a workaround soon.&lt;/p&gt;</comment>
                            <comment id="32977" author="jay" created="Fri, 30 Mar 2012 16:00:39 +0000"  >&lt;p&gt;I think this patch will help a lot: &lt;a href=&quot;http://review.whamcloud.com/2426&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/2426&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="33028" author="hudson" created="Fri, 30 Mar 2012 23:58:43 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://iu-build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://iu-build.whamcloud.com/job/lustre-reviews/./arch=i686,build_type=client,distro=el6,ib_stack=inkernel/4602/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-reviews &#187; i686,client,el6,inkernel #4602&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1274&quot; title=&quot;Client threads block for sometime before being evicted and can never reconnect afterward&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1274&quot;&gt;&lt;del&gt;LU-1274&lt;/del&gt;&lt;/a&gt; osc: Do not grab mutex of cl_lock for glimpse (Revision 803b8d90fbe0f86c0b78c6c03401652061501a31)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Jinshan Xiong : &lt;a href=&quot;http://git.whamcloud.com/gitweb/?p=fs/lustre-release.git&amp;amp;a=commit&amp;amp;h=803b8d90fbe0f86c0b78c6c03401652061501a31&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;803b8d90fbe0f86c0b78c6c03401652061501a31&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/osc/osc_lock.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="33029" author="hudson" created="Sat, 31 Mar 2012 00:00:40 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://iu-build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://iu-build.whamcloud.com/job/lustre-reviews/./arch=x86_64,build_type=client,distro=el6,ib_stack=inkernel/4602/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-reviews &#187; x86_64,client,el6,inkernel #4602&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1274&quot; title=&quot;Client threads block for sometime before being evicted and can never reconnect afterward&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1274&quot;&gt;&lt;del&gt;LU-1274&lt;/del&gt;&lt;/a&gt; osc: Do not grab mutex of cl_lock for glimpse (Revision 803b8d90fbe0f86c0b78c6c03401652061501a31)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Jinshan Xiong : &lt;a href=&quot;http://git.whamcloud.com/gitweb/?p=fs/lustre-release.git&amp;amp;a=commit&amp;amp;h=803b8d90fbe0f86c0b78c6c03401652061501a31&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;803b8d90fbe0f86c0b78c6c03401652061501a31&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/osc/osc_lock.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="33030" author="hudson" created="Sat, 31 Mar 2012 00:02:26 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://iu-build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://iu-build.whamcloud.com/job/lustre-reviews/./arch=x86_64,build_type=client,distro=el5,ib_stack=inkernel/4602/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-reviews &#187; x86_64,client,el5,inkernel #4602&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1274&quot; title=&quot;Client threads block for sometime before being evicted and can never reconnect afterward&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1274&quot;&gt;&lt;del&gt;LU-1274&lt;/del&gt;&lt;/a&gt; osc: Do not grab mutex of cl_lock for glimpse (Revision 803b8d90fbe0f86c0b78c6c03401652061501a31)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Jinshan Xiong : &lt;a href=&quot;http://git.whamcloud.com/gitweb/?p=fs/lustre-release.git&amp;amp;a=commit&amp;amp;h=803b8d90fbe0f86c0b78c6c03401652061501a31&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;803b8d90fbe0f86c0b78c6c03401652061501a31&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/osc/osc_lock.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="33031" author="hudson" created="Sat, 31 Mar 2012 00:05:16 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://iu-build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://iu-build.whamcloud.com/job/lustre-reviews/./arch=i686,build_type=server,distro=el5,ib_stack=inkernel/4602/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-reviews &#187; i686,server,el5,inkernel #4602&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1274&quot; title=&quot;Client threads block for sometime before being evicted and can never reconnect afterward&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1274&quot;&gt;&lt;del&gt;LU-1274&lt;/del&gt;&lt;/a&gt; osc: Do not grab mutex of cl_lock for glimpse (Revision 803b8d90fbe0f86c0b78c6c03401652061501a31)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Jinshan Xiong : &lt;a href=&quot;http://git.whamcloud.com/gitweb/?p=fs/lustre-release.git&amp;amp;a=commit&amp;amp;h=803b8d90fbe0f86c0b78c6c03401652061501a31&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;803b8d90fbe0f86c0b78c6c03401652061501a31&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/osc/osc_lock.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="33032" author="hudson" created="Sat, 31 Mar 2012 00:06:22 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://iu-build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://iu-build.whamcloud.com/job/lustre-reviews/./arch=i686,build_type=server,distro=el6,ib_stack=inkernel/4602/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-reviews &#187; i686,server,el6,inkernel #4602&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1274&quot; title=&quot;Client threads block for sometime before being evicted and can never reconnect afterward&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1274&quot;&gt;&lt;del&gt;LU-1274&lt;/del&gt;&lt;/a&gt; osc: Do not grab mutex of cl_lock for glimpse (Revision 803b8d90fbe0f86c0b78c6c03401652061501a31)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Jinshan Xiong : &lt;a href=&quot;http://git.whamcloud.com/gitweb/?p=fs/lustre-release.git&amp;amp;a=commit&amp;amp;h=803b8d90fbe0f86c0b78c6c03401652061501a31&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;803b8d90fbe0f86c0b78c6c03401652061501a31&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/osc/osc_lock.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="33033" author="hudson" created="Sat, 31 Mar 2012 00:11:29 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://iu-build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://iu-build.whamcloud.com/job/lustre-reviews/./arch=x86_64,build_type=server,distro=el6,ib_stack=inkernel/4602/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-reviews &#187; x86_64,server,el6,inkernel #4602&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1274&quot; title=&quot;Client threads block for sometime before being evicted and can never reconnect afterward&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1274&quot;&gt;&lt;del&gt;LU-1274&lt;/del&gt;&lt;/a&gt; osc: Do not grab mutex of cl_lock for glimpse (Revision 803b8d90fbe0f86c0b78c6c03401652061501a31)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Jinshan Xiong : &lt;a href=&quot;http://git.whamcloud.com/gitweb/?p=fs/lustre-release.git&amp;amp;a=commit&amp;amp;h=803b8d90fbe0f86c0b78c6c03401652061501a31&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;803b8d90fbe0f86c0b78c6c03401652061501a31&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/osc/osc_lock.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="33034" author="hudson" created="Sat, 31 Mar 2012 00:14:59 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://iu-build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://iu-build.whamcloud.com/job/lustre-reviews/./arch=x86_64,build_type=server,distro=el5,ib_stack=inkernel/4602/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-reviews &#187; x86_64,server,el5,inkernel #4602&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1274&quot; title=&quot;Client threads block for sometime before being evicted and can never reconnect afterward&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1274&quot;&gt;&lt;del&gt;LU-1274&lt;/del&gt;&lt;/a&gt; osc: Do not grab mutex of cl_lock for glimpse (Revision 803b8d90fbe0f86c0b78c6c03401652061501a31)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Jinshan Xiong : &lt;a href=&quot;http://git.whamcloud.com/gitweb/?p=fs/lustre-release.git&amp;amp;a=commit&amp;amp;h=803b8d90fbe0f86c0b78c6c03401652061501a31&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;803b8d90fbe0f86c0b78c6c03401652061501a31&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/osc/osc_lock.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="33036" author="hudson" created="Sat, 31 Mar 2012 00:16:20 +0000"  >&lt;p&gt;Integrated in &lt;span class=&quot;image-wrap&quot; style=&quot;&quot;&gt;&lt;img src=&quot;http://iu-build.whamcloud.com/images/16x16/blue.png&quot; style=&quot;border: 0px solid black&quot; /&gt;&lt;/span&gt; &lt;a href=&quot;http://iu-build.whamcloud.com/job/lustre-reviews/./arch=i686,build_type=client,distro=el5,ib_stack=inkernel/4602/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;lustre-reviews &#187; i686,client,el5,inkernel #4602&lt;/a&gt;&lt;br/&gt;
     &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1274&quot; title=&quot;Client threads block for sometime before being evicted and can never reconnect afterward&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1274&quot;&gt;&lt;del&gt;LU-1274&lt;/del&gt;&lt;/a&gt; osc: Do not grab mutex of cl_lock for glimpse (Revision 803b8d90fbe0f86c0b78c6c03401652061501a31)&lt;/p&gt;

&lt;p&gt;     Result = SUCCESS&lt;br/&gt;
Jinshan Xiong : &lt;a href=&quot;http://git.whamcloud.com/gitweb/?p=fs/lustre-release.git&amp;amp;a=commit&amp;amp;h=803b8d90fbe0f86c0b78c6c03401652061501a31&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;803b8d90fbe0f86c0b78c6c03401652061501a31&lt;/a&gt;&lt;br/&gt;
Files : &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lustre/osc/osc_lock.c&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="33075" author="spiechurski" created="Sat, 31 Mar 2012 10:41:24 +0000"  >&lt;p&gt;Thanks for this lightening fast answer and patch !&lt;/p&gt;</comment>
                            <comment id="38917" author="bogl" created="Wed, 16 May 2012 10:46:06 +0000"  >&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/#change,2808&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,2808&lt;/a&gt;&lt;br/&gt;
back port to b2_1&lt;/p&gt;</comment>
                            <comment id="39499" author="patrick.valentin" created="Tue, 29 May 2012 06:53:40 +0000"  >&lt;p&gt;On site support reports that the problem did not occur again since the installation of the efix containing the patch proposed in this Jira ticket, one month ago.&lt;/p&gt;</comment>
                            <comment id="39826" author="pjones" created="Fri, 1 Jun 2012 14:05:04 +0000"  >&lt;p&gt;Landed for 2.1.2 and 2.3&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="11033" name="syslog-CLIENT_willowb11-20120330" size="34539" author="spiechurski" created="Fri, 30 Mar 2012 13:14:02 +0000"/>
                            <attachment id="11034" name="syslog-CLIENT_willowb11-20120330-sysrq-trigger" size="1067701" author="spiechurski" created="Fri, 30 Mar 2012 13:14:02 +0000"/>
                            <attachment id="11035" name="syslog-OSS_willowb6-20120330" size="18805" author="spiechurski" created="Fri, 30 Mar 2012 13:14:02 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzv6lb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>4602</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>