<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:26:29 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-16375] dump more information for threads blocked on local DLM locks</title>
                <link>https://jira.whamcloud.com/browse/LU-16375</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;When a server thread is blocked on a DLM lock, it is often difficult to see what the other threads in the system are doing with that lock, and why it is being held for a long time:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;lfs02-n29 kernel: Pid: 16214, comm: ll_ost11_002 3.10.0-1160.45.1.el7.x86_64
lfs02-n29 kernel: Call Trace:
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ldlm_completion_ast+0x777/0x9d0 [ptlrpc]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ldlm_cli_enqueue_local+0x25c/0x850 [ptlrpc]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ofd_destroy_by_fid+0x1d1/0x500 [ofd]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ofd_destroy_hdl+0x267/0xa00 [ofd]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] tgt_request_handle+0x7f3/0x1760 [ptlrpc]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ptlrpc_server_handle_request+0x253/0xb30 [ptlrpc]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ptlrpc_main+0xb3c/0x14d0 [ptlrpc]
lfs02-n29 kernel: Pid: 16274, comm: ll_ost14_005 3.10.0-1160.45.1.el7.x86_64
lfs02-n29 kernel: Call Trace:
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ldlm_completion_ast+0x777/0x9d0 [ptlrpc]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ldlm_cli_enqueue_local+0x25c/0x850 [ptlrpc]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ofd_destroy_by_fid+0x1d1/0x500 [ofd]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ofd_destroy_hdl+0x267/0xa00 [ofd]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] tgt_request_handle+0x7f3/0x1760 [ptlrpc]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ptlrpc_server_handle_request+0x253/0xb30 [ptlrpc]
lfs02-n29 kernel: [&amp;lt;0&amp;gt;] ptlrpc_main+0xb3c/0x14d0 [ptlrpc]
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;and then later these threads time out on their DLM blocking AST, which dumps the resource FID for their request, but no information about which thread is holding the lock:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;lfs02-n29 kernel: LustreError: 16214:0:(ldlm_request.c:124:ldlm_expired_completion_wait())
    ### lock timed out (enqueued at 1670282438, 300s ago); not entering recovery in
    server code, just going back to sleep ns: filter-lfs02-OST0038_UUID
    lock: ffff984ae2538480/0x4bc7fa9cb92be450 lrc: 3/0,1 mode: --/PW
    res: [0x800000410:0x813652:0x0].0x0 rrc: 3 type: EXT [0-&amp;gt;18446744073709551615]
    (req 0-&amp;gt;18446744073709551615) gid 0 flags: 0x40010080000000 nid: local
    remote: 0x0 expref: -99 pid: 16214 timeout: 0 lvb_type: 0
lfs02-n29 kernel: LustreError: 16274:0:(ldlm_request.c:124:ldlm_expired_completion_wait())
    ### lock timed out (enqueued at 1670282473, 300s ago); not entering recovery in
    server code, just going back to sleep ns: filter-lfs02-OST0039_UUID
    lock: ffff98394b735680/0x4bc7fa9cb94f8af9 lrc: 3/0,1 mode: --/PW
    res: [0x84000040c:0x8b52d3:0x0].0x0 rrc: 3 type: EXT [0-&amp;gt;18446744073709551615]
    (req 0-&amp;gt;18446744073709551615) gid 0 flags: 0x40010080000000 nid: local
    remote: 0x0 expref: -99 pid: 16274 timeout: 0 lvb_type: 0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Since we &lt;b&gt;know&lt;/b&gt; this is a local lock holder (&quot;&lt;tt&gt;not entering recovery in server code&lt;/tt&gt;&quot;) it should be possible to add &lt;tt&gt;LDLM_ERROR()&lt;/tt&gt; printing of the conflicting locks held on that resource, and &lt;tt&gt;libcfs_debug_dumpstack()&lt;/tt&gt; for the PID(s) that are holding the lock(s).&lt;/p&gt;</description>
                <environment></environment>
        <key id="73538">LU-16375</key>
            <summary>dump more information for threads blocked on local DLM locks</summary>
                <type id="4" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11310&amp;avatarType=issuetype">Improvement</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="arshad512">Arshad Hussain</assignee>
                                    <reporter username="adilger">Andreas Dilger</reporter>
                        <labels>
                    </labels>
                <created>Thu, 8 Dec 2022 14:50:13 +0000</created>
                <updated>Mon, 22 Jan 2024 16:12:24 +0000</updated>
                                                                                <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="74971">LU-16625</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="65257">LU-14858</issuekey>
        </issuelink>
                            </outwardlinks>
                                                                <inwardlinks description="is related to">
                                                        </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i037ev:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>