<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:13:26 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1094] general protection fault in _debug_req()</title>
                <link>https://jira.whamcloud.com/browse/LU-1094</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We had five occurrences of this crash on OSS nodes in our classified Lustre 2.1 cluster.  Timeframe coincided with &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1085&quot; title=&quot;ASSERTION(cfs_atomic_read(&amp;amp;exp-&amp;gt;exp_refcount) == 0) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1085&quot;&gt;&lt;del&gt;LU-1085&lt;/del&gt;&lt;/a&gt;. Like the other bugs in that window, this crash was preceded by hundreds of messages like:&lt;/p&gt;

&lt;p&gt;LustreError: 14210:0:(genops.c:1270:class_disconnect_stale_exports()) ls5-OST0349: disconnect stale client &lt;span class=&quot;error&quot;&gt;&amp;#91;UUID&amp;#93;&lt;/span&gt;@&amp;lt;unknown&amp;gt;&lt;/p&gt;

&lt;p&gt;general protection fault: 0000 &lt;a href=&quot;#1&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;1&lt;/a&gt; SMP&lt;br/&gt;
Pid: 13890, comm: ll_ost_34&lt;/p&gt;

&lt;p&gt;machine_kexec&lt;br/&gt;
crash_kexec&lt;br/&gt;
oops_end&lt;br/&gt;
die&lt;br/&gt;
do_general_protection&lt;br/&gt;
general_protection&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;exception RIP: strnlen+9&amp;#93;&lt;/span&gt;&lt;br/&gt;
string&lt;br/&gt;
vsnprintf&lt;br/&gt;
libcfs_debug_vmsg2&lt;br/&gt;
_debug_req&lt;br/&gt;
target_send_reply_msg&lt;br/&gt;
target_send_reply&lt;br/&gt;
ost_handle&lt;br/&gt;
ptlrpc_main&lt;br/&gt;
kernel_thread&lt;/p&gt;</description>
                <environment>&lt;a href=&quot;https://github.com/chaos/lustre/commits/2.1.0-llnl&quot;&gt;https://github.com/chaos/lustre/commits/2.1.0-llnl&lt;/a&gt;&lt;br/&gt;
RHEL 6.2 </environment>
        <key id="13158">LU-1094</key>
            <summary>general protection fault in _debug_req()</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="green">Oleg Drokin</assignee>
                                    <reporter username="nedbass">Ned Bass</reporter>
                        <labels>
                            <label>paj</label>
                    </labels>
                <created>Fri, 10 Feb 2012 17:58:25 +0000</created>
                <updated>Mon, 30 Apr 2012 11:57:31 +0000</updated>
                            <resolved>Mon, 30 Apr 2012 11:57:31 +0000</resolved>
                                    <version>Lustre 2.1.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="28390" author="nedbass" created="Fri, 10 Feb 2012 18:02:19 +0000"  >&lt;p&gt;Comment copied from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1085&quot; title=&quot;ASSERTION(cfs_atomic_read(&amp;amp;exp-&amp;gt;exp_refcount) == 0) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1085&quot;&gt;&lt;del&gt;LU-1085&lt;/del&gt;&lt;/a&gt;:&lt;/p&gt;

&lt;p&gt;I did some digging in crash to see what state the ptlrpc_reqeust was in.  I dug up the pointer address from the backtrace (let&apos;s call it &amp;lt;addr1&amp;gt; to save typing).  Then resolving some of the strings that get passed to libcfs_debug_vmsg2() from _debug_req(), I see:&lt;/p&gt;


&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;crash&amp;gt; struct ptlrpc_request.rq_import &amp;lt;addr1&amp;gt;
 rp_import = 0x0 
crash&amp;gt; struct ptlrpc_request.rq_export &amp;lt;addr1&amp;gt;
 rp_export = &amp;lt;addr2&amp;gt;
crash&amp;gt; struct obd_export.exp_connection &amp;lt;addr2&amp;gt;
 exp_connection = 0x5a5a5a5a5a5a5a5a
crash&amp;gt; struct obd_export.exp_client_uuid &amp;lt;addr2&amp;gt;
 exp_client_uuid = { 
        uuid = &quot;ZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZ&quot;
 }
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;So the presence of poison value and bogus uuid suggests this export has already been destroyed.&lt;/p&gt;

&lt;p&gt;For reference, here a snippet from from _debug_req() that uses these values:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;2271 void _debug_req(struct ptlrpc_request *req,
2272                 struct libcfs_debug_msg_data *msgdata,
2273                 const char *fmt, ... )
2274 {       
2275         va_list args;
2276         va_start(args, fmt);
2277         libcfs_debug_vmsg2(msgdata, fmt, args,
2278                            &quot; req@%p x&quot;LPU64&quot;/t&quot;LPD64&quot;(&quot;LPD64&quot;) o%d-&amp;gt;%s@%s:%d/%d&quot;
2279                            &quot; lens %d/%d e %d to %d dl &quot;CFS_TIME_T&quot; ref %d &quot;
2280                            &quot;fl &quot;REQ_FLAGS_FMT&quot;/%x/%x rc %d/%d\n&quot;,
2281                            req, req-&amp;gt;rq_xid, req-&amp;gt;rq_transno,
2282                            req-&amp;gt;rq_reqmsg ? lustre_msg_get_transno(req-&amp;gt;rq_reqmsg) : 0,
2283                            req-&amp;gt;rq_reqmsg &amp;amp;&amp;amp; req_ptlrpc_body_swabbed(req) ?
2284                            lustre_msg_get_opc(req-&amp;gt;rq_reqmsg) : -1, 
2285                            req-&amp;gt;rq_import ? obd2cli_tgt(req-&amp;gt;rq_import-&amp;gt;imp_obd) :
2286                            req-&amp;gt;rq_export ?
2287                            (char*)req-&amp;gt;rq_export-&amp;gt;exp_client_uuid.uuid : &quot;&amp;lt;?&amp;gt;&quot;,
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="28438" author="green" created="Sat, 11 Feb 2012 00:09:05 +0000"  >&lt;p&gt;I think this one also has a chance of being related to lu-106, so let&apos;s see if the runs with the patch would help.&lt;/p&gt;</comment>
                            <comment id="35061" author="nedbass" created="Wed, 18 Apr 2012 19:45:04 +0000"  >&lt;p&gt;FYI, we did in fact hit this again with the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-106&quot; title=&quot;unable to handle kernel paging request in lprocfs_stats_collect()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-106&quot;&gt;&lt;del&gt;LU-106&lt;/del&gt;&lt;/a&gt; patch here:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/326&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/326&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="35062" author="nedbass" created="Wed, 18 Apr 2012 19:48:16 +0000"  >&lt;p&gt;Sorry, disregard previous comment.  We hit a new GPF, not this one.  &lt;/p&gt;</comment>
                            <comment id="35884" author="pjones" created="Mon, 30 Apr 2012 11:57:31 +0000"  >&lt;p&gt;Believed to be a duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1092&quot; title=&quot;NULL pointer dereference in filter_export_stats_init()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1092&quot;&gt;&lt;del&gt;LU-1092&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvhcv:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6461</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>