<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:49:07 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-5169] Lustre client panic during MDS failover</title>
                <link>https://jira.whamcloud.com/browse/LU-5169</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;The setup is as follows:&lt;/p&gt;

&lt;p&gt;There are two filesystems:  pfs2dat2 and pfs2wor2&lt;/p&gt;

&lt;p&gt;Clients:&lt;br/&gt;
uc1n996&lt;br/&gt;
uc1n997&lt;/p&gt;

&lt;p&gt;For pfs2dat2:&lt;br/&gt;
MDS: pfs2n12/13&lt;br/&gt;
OSS: pfs2n14/15&lt;/p&gt;

&lt;p&gt;For pfs2wor2:&lt;br/&gt;
MDS: pfs2n16/17&lt;br/&gt;
OSS: pfs2n18/19/20/21&lt;/p&gt;

&lt;p&gt;The two MDSes involved in failover were pfs2n12 and pds2n13. The client uc1n996 panicked with the following stack trace:&lt;br/&gt;
last sysfs file: &lt;br/&gt;
/sys/devices/system/cpu/online&lt;br/&gt;
CPU 5 &lt;br/&gt;
Modules linked in: iptable_filter ip_tables &lt;br/&gt;
nfs lockd fscache auth_rpcgss nfs_acl sunrpc lmv(U) fld(U) mgc(U) lustre(U) &lt;br/&gt;
lov(U) osc(U) mdc(U) fid(U) ko2iblnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) &lt;br/&gt;
sha512_generic sha256_generic crc32c_intel libcfs(U) ib_ipoib rdma_ucm ib_ucm &lt;br/&gt;
ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 dm_multipath vhost_net &lt;br/&gt;
macvtap macvlan tun kvm_intel kvm uinput microcode iTCO_wdt &lt;br/&gt;
iTCO_vendor_support acpi_pad power_meter dcdbas sg mlx4_ib ib_sa ib_mad &lt;br/&gt;
ib_core mlx4_en mlx4_core sb_edac edac_core lpc_ich mfd_core shpchp igb &lt;br/&gt;
i2c_algo_bit i2c_core ixgbe dca ptp pps_core mdio xfs exportfs sd_mod &lt;br/&gt;
crc_t10dif wmi ahci megaraid_sas dm_mirror dm_region_hash dm_log dm_mod [last &lt;br/&gt;
unloaded: speedstep_lib]&lt;/p&gt;

&lt;p&gt;Pid: 2895, comm: ptlrpcd_rcv Not tainted &lt;br/&gt;
2.6.32-431.11.2.el6.x86_64 #1 Dell Inc. PowerEdge R620/0PXXHP&lt;br/&gt;
RIP: 0010:&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0708bde&amp;gt;&amp;#93;&lt;/span&gt;  &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0708bde&amp;gt;&amp;#93;&lt;/span&gt; lustre_msg_get_opc+0xe/0x110 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
RSP: 0018:ffff88082b5ddc80  EFLAGS: 00010282&lt;br/&gt;
RAX: ffff8800a585e208 RBX: 0000000000000000 &lt;br/&gt;
RCX: ffff8801a22893a0&lt;br/&gt;
RDX: 0000000000000002 RSI: 0000000000000000 &lt;br/&gt;
RDI: 3237323033093932&lt;br/&gt;
RBP: ffff88082b5ddc90 R08: 0000000000000000 &lt;br/&gt;
R09: 00000000fffffffc&lt;br/&gt;
R10: 0000000000000002 R11: 0000000000000004 &lt;br/&gt;
R12: ffff8809421d7000&lt;br/&gt;
R13: ffff8800a585e208 R14: 00000032a434f11a &lt;br/&gt;
R15: ffff8801a22890c8&lt;br/&gt;
FS:  0000000000000000(0000) &lt;br/&gt;
GS:ffff88085c440000(0000) knlGS:0000000000000000&lt;br/&gt;
CS:  0010 DS: 0018 ES: 0018 CR0: &lt;br/&gt;
000000008005003b&lt;br/&gt;
CR2: 000000346b2727d0 CR3: 000000102a8e5000 &lt;br/&gt;
CR4: 00000000000407e0&lt;br/&gt;
DR0: 0000000000000000 DR1: 0000000000000000 &lt;br/&gt;
DR2: 0000000000000000&lt;br/&gt;
DR3: 0000000000000000 DR6: 00000000ffff0ff0 &lt;br/&gt;
DR7: 0000000000000400&lt;br/&gt;
Process ptlrpcd_rcv (pid: 2895, threadinfo &lt;br/&gt;
ffff88082b5dc000, task ffff8808314deaa0)&lt;br/&gt;
Stack:&lt;br/&gt;
ffff88082b5ddc90 0000000000000000 &lt;br/&gt;
ffff88082b5ddcd0 ffffffffa08b6c2d&lt;br/&gt;
&amp;lt;d&amp;gt; ffff880563411000 ffff8801a2289000 &lt;br/&gt;
ffff8801a2289000 ffff88102d915800&lt;br/&gt;
&amp;lt;d&amp;gt; ffff8801a22892e0 00000032a434f11a &lt;br/&gt;
ffff88082b5ddd00 ffffffffa06fd312&lt;br/&gt;
Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa08b6c2d&amp;gt;&amp;#93;&lt;/span&gt; &lt;br/&gt;
mdc_replay_open+0xad/0x420 &lt;span class=&quot;error&quot;&gt;&amp;#91;mdc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa06fd312&amp;gt;&amp;#93;&lt;/span&gt; &lt;br/&gt;
ptlrpc_replay_interpret+0x142/0x740 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa06fe994&amp;gt;&amp;#93;&lt;/span&gt; &lt;br/&gt;
ptlrpc_check_set+0x2c4/0x1b40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0729ebb&amp;gt;&amp;#93;&lt;/span&gt; ptlrpcd_check+0x53b/0x560 &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa072a3db&amp;gt;&amp;#93;&lt;/span&gt; ptlrpcd+0x20b/0x370 &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81065df0&amp;gt;&amp;#93;&lt;/span&gt; ? &lt;br/&gt;
default_wake_function+0x0/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa072a1d0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpcd+0x0/0x370 &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8109aee6&amp;gt;&amp;#93;&lt;/span&gt; kthread+0x96/0xa0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c20a&amp;gt;&amp;#93;&lt;/span&gt; child_rip+0xa/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8109ae50&amp;gt;&amp;#93;&lt;/span&gt; ? kthread+0x0/0xa0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c200&amp;gt;&amp;#93;&lt;/span&gt; ? child_rip+0x0/0x20&lt;br/&gt;
Code: 24 48 48 83 c4 68 4c 89 e0 5b 41 5c 41 &lt;br/&gt;
5d 41 5e 41 5f c9 c3 45 31 e4 e9 26 ff ff ff 90 55 48 89 e5 53 48 83 ec 08 0f &lt;br/&gt;
1f 44 00 00 &amp;lt;81&amp;gt; 7f 08 d3 0b d0 0b 48 89 fb 74 76 c7 05 fc 7e 0a 00 00 01 00 &lt;br/&gt;
RIP  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0708bde&amp;gt;&amp;#93;&lt;/span&gt; &lt;br/&gt;
lustre_msg_get_opc+0xe/0x110 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
RSP &amp;lt;ffff88082b5ddc80&amp;gt;&lt;br/&gt;
--&lt;del&gt;[ end trace ee65cdcf6a61aa8a ]&lt;/del&gt;--&lt;/p&gt;

</description>
                <environment>Lustre servers: 2.4.3&lt;br/&gt;
Lustre clients: 2.5.1</environment>
        <key id="25083">LU-5169</key>
            <summary>Lustre client panic during MDS failover</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="hongchao.zhang">Hongchao Zhang</assignee>
                                    <reporter username="spimpale">Swapnil Pimpale</reporter>
                        <labels>
                    </labels>
                <created>Tue, 10 Jun 2014 12:11:25 +0000</created>
                <updated>Fri, 29 Jan 2016 00:29:09 +0000</updated>
                            <resolved>Fri, 29 Jan 2016 00:29:09 +0000</resolved>
                                    <version>Lustre 2.5.1</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>10</watches>
                                                                            <comments>
                            <comment id="86203" author="spimpale" created="Tue, 10 Jun 2014 12:16:37 +0000"  >&lt;p&gt;I have uploaded the following logs to ftp.whamcloud.com (/uploads/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5169&quot; title=&quot;Lustre client panic during MDS failover&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5169&quot;&gt;&lt;del&gt;LU-5169&lt;/del&gt;&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;Client logs: 2014-06-05-ddn_lustre_showall_clients_case_kernel_panic_20140605.tar&lt;br/&gt;
Server logs: 2014-06-05-SR31415_es_lustre_showall_2014-06-05_091605.tar.bz2&lt;/p&gt;</comment>
                            <comment id="86312" author="pjones" created="Wed, 11 Jun 2014 13:20:40 +0000"  >&lt;p&gt;Hongchao&lt;/p&gt;

&lt;p&gt;Could you please assist with this issue?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="86334" author="green" created="Wed, 11 Jun 2014 17:08:45 +0000"  >&lt;p&gt;This looks like &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3333&quot; title=&quot;lustre_msg_get_opc()) incorrect message magic: a0b03b5 LBUG&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3333&quot;&gt;&lt;del&gt;LU-3333&lt;/del&gt;&lt;/a&gt; to me&lt;/p&gt;</comment>
                            <comment id="86411" author="hongchao.zhang" created="Thu, 12 Jun 2014 09:26:56 +0000"  >&lt;p&gt;Is the logs around the panic available? and I can&apos;t find it in the uploaded logs,  Thanks.&lt;br/&gt;
btw, could you please print the code lines at &quot;lustre_msg_get_opc + 0xe&quot; for the module(ptlrpc) could be different, Thanks.&lt;/p&gt;</comment>
                            <comment id="87348" author="rganesan@ddn.com" created="Tue, 24 Jun 2014 09:14:37 +0000"  >&lt;p&gt;there is no logs collected during the panic. Is it possible to get the patch (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3333&quot; title=&quot;lustre_msg_get_opc()) incorrect message magic: a0b03b5 LBUG&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3333&quot;&gt;&lt;del&gt;LU-3333&lt;/del&gt;&lt;/a&gt;) for 2.5.1&lt;/p&gt;</comment>
                            <comment id="87366" author="pjones" created="Tue, 24 Jun 2014 14:21:11 +0000"  >&lt;p&gt;Perhaps it would make more sense to upgrade to 2.5.2 (Due out imminently) in order to get this fix?&lt;/p&gt;</comment>
                            <comment id="87573" author="rganesan@ddn.com" created="Thu, 26 Jun 2014 15:41:49 +0000"  >&lt;p&gt;Hello,&lt;/p&gt;

&lt;p&gt;Cu. is fine to upgrade to 2.5.2 Could you please give me link to download the very latest master build of 2.5.2. So that the following patch will be covered. &lt;/p&gt;


&lt;p&gt;1. the do_statahead_enter() LBUG ( &lt;a href=&quot;http://review.whamcloud.com/10363&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/10363&lt;/a&gt; )&lt;br/&gt;
patch reports included in v2_5_2_RC2.&lt;br/&gt;
2. the lovsub_lock_state() LBUG patch (&lt;a href=&quot;http://review.whamcloud.com/9881&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/9881&lt;/a&gt;)&lt;br/&gt;
reports included in v2_5_60_0 and branch master&lt;/p&gt;




&lt;p&gt;Thanks,&lt;br/&gt;
Rajesh&lt;/p&gt;</comment>
                            <comment id="87591" author="pjones" created="Thu, 26 Jun 2014 17:26:14 +0000"  >&lt;p&gt;2.5.2 also includes the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-4558&quot; title=&quot;Crash in cl_lock_put on racer&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-4558&quot;&gt;&lt;del&gt;LU-4558&lt;/del&gt;&lt;/a&gt; fix - &lt;a href=&quot;http://git.whamcloud.com/fs/lustre-release.git/commit/deb1e8aa6836ad073d53bf3e4dd29a2cb5696f2e&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://git.whamcloud.com/fs/lustre-release.git/commit/deb1e8aa6836ad073d53bf3e4dd29a2cb5696f2e&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The release can be accessed at &lt;a href=&quot;http://downloads.whamcloud.com/public/lustre/latest-maintenance-release/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://downloads.whamcloud.com/public/lustre/latest-maintenance-release/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="113836" author="icostelloddn" created="Thu, 30 Apr 2015 02:21:31 +0000"  >&lt;p&gt;Is this a duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5507&quot; title=&quot;sanity-quota test_18: Oops: IP: lustre_msg_get_opc+0xe/0x110 [ptlrpc]&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5507&quot;&gt;&lt;del&gt;LU-5507&lt;/del&gt;&lt;/a&gt;? As it looks like the patch &lt;a href=&quot;http://review.whamcloud.com/#/c/12667/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#/c/12667/&lt;/a&gt; will resolve the issue above?  As I have seen the above problem at the ANU/NCI site...&lt;/p&gt;</comment>
                            <comment id="113845" author="hongchao.zhang" created="Thu, 30 Apr 2015 06:08:19 +0000"  >&lt;p&gt;Yes, it seems to be the same issue! thank!&lt;/p&gt;</comment>
                            <comment id="140469" author="jfc" created="Fri, 29 Jan 2016 00:29:09 +0000"  >&lt;p&gt;We are marking as resolved/duplicate.&lt;/p&gt;

&lt;p&gt;Many thanks,&lt;br/&gt;
~ jfc.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="26074">LU-5507</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzwo1j:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>14248</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10021"><![CDATA[2]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>