<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:35:40 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-3643] hsm_restore caused OSS node crash</title>
                <link>https://jira.whamcloud.com/browse/LU-3643</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;How to reproduce:&lt;br/&gt;
1. copy a file ~85M to lustre file system&lt;br/&gt;
2. archive and release the file&lt;br/&gt;
3. calculate md5sum of the file which will trigger restore&lt;/p&gt;

&lt;p&gt;OSS nodes crash with the following message on console:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;BUG: unable to handle kernel NULL pointer dereference at 0000000000000040
IP: [&amp;lt;ffffffffa0b44fc5&amp;gt;] osd_xattr_get+0x155/0x2d0 [osd_ldiskfs]
PGD 0 
Oops: 0000 [#1] SMP 
last sysfs file: /sys/devices/system/cpu/possible
CPU 0 
Modules linked in: lustre(U) ofd(U) osp(U) lod(U) ost(U) mdt(U) osd_ldiskfs(U) fsfilt_ldiskfs(U) ldiskfs(U) mdd(U) mgs(U) lquota(U) lfsck(U) jbd2 jbd obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) sha512_generic sha256_generic libcfs(U) mbcache exportfs virtio_balloon i2c_piix4 i2c_core sg virtio_blk virtio_net sr_mod cdrom virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod nfs lockd fscache auth_rpcgss nfs_acl sunrpc be2iscsi bnx2i cnic uio ipv6 cxgb4i cxgb4 cxgb3i libcxgbi cxgb3 mdio libiscsi_tcp qla4xxx iscsi_boot_sysfs libiscsi scsi_transport_iscsi [last unloaded: scsi_wait_scan]

Pid: 2778, comm: ll_ost00_003 Not tainted 2.6.32-358.11.1.el6.x86_64 #1 Bochs Bochs
RIP: 0010:[&amp;lt;ffffffffa0b44fc5&amp;gt;]  [&amp;lt;ffffffffa0b44fc5&amp;gt;] osd_xattr_get+0x155/0x2d0 [osd_ldiskfs]
RSP: 0018:ffff8801032c1ad0  EFLAGS: 00010202
RAX: 0000000000000000 RBX: ffffffffffffff30 RCX: 0000000000000000
RDX: ffff880116f2c200 RSI: ffffffffa0b73540 RDI: ffffffffa0b882e0
RBP: ffff8801032c1b10 R08: fffffffffffffffe R09: 00000000ffffffef
R10: 000000000000000f R11: 000000000000000f R12: ffff8801032c1b38
R13: ffff8801032c1b20 R14: 0000000000000000 R15: ffffffffa0500dbb
FS:  0000000000000000(0000) GS:ffff880028200000(0000) knlGS:0000000000000000
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 0000000000000040 CR3: 0000000117b17000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process ll_ost00_003 (pid: 2778, threadinfo ffff8801032c0000, task ffff880116ea0040)
Stack:
 fffffffffffffffe ffff880101dae000 fffffffffffffffe ffff880119c2bc40
&amp;lt;d&amp;gt; 00000000fffffffe ffff880102c55440 ffff880105bc2bc0 ffff880101fda1d0
&amp;lt;d&amp;gt; ffff8801032c1b40 ffffffffa04c0284 ffff8801032c1b38 0000000000000008
Call Trace:
 [&amp;lt;ffffffffa04c0284&amp;gt;] dt_version_get+0x54/0x170 [obdclass]
 [&amp;lt;ffffffffa0d4248e&amp;gt;] ofd_getattr+0x30e/0x610 [ofd]
 [&amp;lt;ffffffffa0c94efc&amp;gt;] ost_getattr+0x40c/0x950 [ost]
 [&amp;lt;ffffffffa0351fd1&amp;gt;] ? libcfs_debug_msg+0x41/0x50 [libcfs]
 [&amp;lt;ffffffffa0c9fa6b&amp;gt;] ost_handle+0x1ebb/0x40e0 [ost]
 [&amp;lt;ffffffffa034dd84&amp;gt;] ? libcfs_id2str+0x74/0xb0 [libcfs]
 [&amp;lt;ffffffffa0659158&amp;gt;] ptlrpc_server_handle_request+0x398/0xc60 [ptlrpc]
 [&amp;lt;ffffffffa034254e&amp;gt;] ? cfs_timer_arm+0xe/0x10 [libcfs]
 [&amp;lt;ffffffffa0353a8f&amp;gt;] ? lc_watchdog_touch+0x6f/0x170 [libcfs]
 [&amp;lt;ffffffffa0650569&amp;gt;] ? ptlrpc_wait_event+0xa9/0x290 [ptlrpc]
 [&amp;lt;ffffffffa0351fd1&amp;gt;] ? libcfs_debug_msg+0x41/0x50 [libcfs]
 [&amp;lt;ffffffff81055af3&amp;gt;] ? __wake_up+0x53/0x70
 [&amp;lt;ffffffffa065a4dd&amp;gt;] ptlrpc_main+0xabd/0x1700 [ptlrpc]
 [&amp;lt;ffffffffa0659a20&amp;gt;] ? ptlrpc_main+0x0/0x1700 [ptlrpc]
 [&amp;lt;ffffffff810969e6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c0ca&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff81096950&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c0c0&amp;gt;] ? child_rip+0x0/0x20
Code: 52 33 04 00 6e 0a 00 00 48 c7 c6 40 35 b7 a0 48 c7 05 4c 33 04 00 00 00 00 00 c7 05 3a 33 04 00 02 00 00 00 48 c7 c7 e0 82 b8 a0 &amp;lt;48&amp;gt; 8b 48 40 48 8b 93 10 04 00 00 31 c0 e8 b9 cf 80 ff 48 8b 83 
RIP  [&amp;lt;ffffffffa0b44fc5&amp;gt;] osd_xattr_get+0x155/0x2d0 [osd_ldiskfs]
 RSP &amp;lt;ffff8801032c1ad0&amp;gt;
CR2: 0000000000000040
---[ end trace 124bcb67ec32d88c ]---
Kernel panic - not syncing: Fatal exception
Pid: 2778, comm: ll_ost00_003 Tainted: G      D    ---------------    2.6.32-358.11.1.el6.x86_64 #1
Call Trace:
 [&amp;lt;ffffffff8150e4ac&amp;gt;] ? panic+0xa7/0x16f
 [&amp;lt;ffffffff815126d4&amp;gt;] ? oops_end+0xe4/0x100
 [&amp;lt;ffffffff81046c3b&amp;gt;] ? no_context+0xfb/0x260
 [&amp;lt;ffffffff81046ec5&amp;gt;] ? __bad_area_nosemaphore+0x125/0x1e0
 [&amp;lt;ffffffff81281a16&amp;gt;] ? vsnprintf+0x336/0x5e0
 [&amp;lt;ffffffff81046f93&amp;gt;] ? bad_area_nosemaphore+0x13/0x20
 [&amp;lt;ffffffff810476f1&amp;gt;] ? __do_page_fault+0x321/0x480
 [&amp;lt;ffffffffa034127b&amp;gt;] ? cfs_set_ptldebug_header+0x2b/0xc0 [libcfs]
 [&amp;lt;ffffffffa03518eb&amp;gt;] ? libcfs_debug_vmsg2+0x50b/0xbb0 [libcfs]
 [&amp;lt;ffffffffa03518eb&amp;gt;] ? libcfs_debug_vmsg2+0x50b/0xbb0 [libcfs]
 [&amp;lt;ffffffff815145fe&amp;gt;] ? do_page_fault+0x3e/0xa0
 [&amp;lt;ffffffff815119b5&amp;gt;] ? page_fault+0x25/0x30
 [&amp;lt;ffffffffa0b44fc5&amp;gt;] ? osd_xattr_get+0x155/0x2d0 [osd_ldiskfs]
 [&amp;lt;ffffffffa04c0284&amp;gt;] ? dt_version_get+0x54/0x170 [obdclass]
 [&amp;lt;ffffffffa0d4248e&amp;gt;] ? ofd_getattr+0x30e/0x610 [ofd]
 [&amp;lt;ffffffffa0c94efc&amp;gt;] ? ost_getattr+0x40c/0x950 [ost]
 [&amp;lt;ffffffffa0351fd1&amp;gt;] ? libcfs_debug_msg+0x41/0x50 [libcfs]
 [&amp;lt;ffffffffa0c9fa6b&amp;gt;] ? ost_handle+0x1ebb/0x40e0 [ost]
 [&amp;lt;ffffffffa034dd84&amp;gt;] ? libcfs_id2str+0x74/0xb0 [libcfs]
 [&amp;lt;ffffffffa0659158&amp;gt;] ? ptlrpc_server_handle_request+0x398/0xc60 [ptlrpc]
 [&amp;lt;ffffffffa034254e&amp;gt;] ? cfs_timer_arm+0xe/0x10 [libcfs]
 [&amp;lt;ffffffffa0353a8f&amp;gt;] ? lc_watchdog_touch+0x6f/0x170 [libcfs]
 [&amp;lt;ffffffffa0650569&amp;gt;] ? ptlrpc_wait_event+0xa9/0x290 [ptlrpc]
 [&amp;lt;ffffffffa0351fd1&amp;gt;] ? libcfs_debug_msg+0x41/0x50 [libcfs]
 [&amp;lt;ffffffff81055af3&amp;gt;] ? __wake_up+0x53/0x70
 [&amp;lt;ffffffffa065a4dd&amp;gt;] ? ptlrpc_main+0xabd/0x1700 [ptlrpc]
 [&amp;lt;ffffffffa0659a20&amp;gt;] ? ptlrpc_main+0x0/0x1700 [ptlrpc]
 [&amp;lt;ffffffff810969e6&amp;gt;] ? kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c0ca&amp;gt;] ? child_rip+0xa/0x20
 [&amp;lt;ffffffff81096950&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c0c0&amp;gt;] ? child_rip+0x0/0x20
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;It looks like inode dt_object was NULL for some reason, I will take a further look.&lt;/p&gt;

&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;jxiong@titan:osd-ldiskfs$ gdb osd_ldiskfs.ko
GNU gdb (GDB) Fedora (7.3.50.20110722-16.fc16)
Copyright (C) 2011 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later &amp;lt;http:&lt;span class=&quot;code-comment&quot;&gt;//gnu.org/licenses/gpl.html&amp;gt;
&lt;/span&gt;This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type &lt;span class=&quot;code-quote&quot;&gt;&quot;show copying&quot;&lt;/span&gt;
and &lt;span class=&quot;code-quote&quot;&gt;&quot;show warranty&quot;&lt;/span&gt; &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; details.
This GDB was configured as &lt;span class=&quot;code-quote&quot;&gt;&quot;x86_64-redhat-linux-gnu&quot;&lt;/span&gt;.
For bug reporting instructions, please see:
&amp;lt;http:&lt;span class=&quot;code-comment&quot;&gt;//www.gnu.org/software/gdb/bugs/&amp;gt;...
&lt;/span&gt;Reading symbols from /exports/nfsroot/home/jxiong/srcs/lustre/lustre/osd-ldiskfs/osd_ldiskfs.ko...done.
(gdb) l *(osd_xattr_get+0x155)
0x9ff5 is in osd_xattr_get (/home/jxiong/srcs/lustre/lustre/osd-ldiskfs/osd_handler.c:2669).
2664	&lt;span class=&quot;code-keyword&quot;&gt;static&lt;/span&gt; &lt;span class=&quot;code-object&quot;&gt;int&lt;/span&gt; osd_object_version_get(&lt;span class=&quot;code-keyword&quot;&gt;const&lt;/span&gt; struct lu_env *env,
2665	                                  struct dt_object *dt, dt_obj_version_t *ver)
2666	{
2667	        struct inode *inode = osd_dt_obj(dt)-&amp;gt;oo_inode;
2668	
2669	        CDEBUG(D_INODE, &lt;span class=&quot;code-quote&quot;&gt;&quot;Get version &quot;&lt;/span&gt;LPX64&lt;span class=&quot;code-quote&quot;&gt;&quot; &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; inode %lu\n&quot;&lt;/span&gt;,
2670	               LDISKFS_I(inode)-&amp;gt;i_fs_version, inode-&amp;gt;i_ino);
2671	        *ver = LDISKFS_I(inode)-&amp;gt;i_fs_version;
2672	        &lt;span class=&quot;code-keyword&quot;&gt;return&lt;/span&gt; 0;
2673	}
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment></environment>
        <key id="20013">LU-3643</key>
            <summary>hsm_restore caused OSS node crash</summary>
                <type id="7" iconUrl="https://jira.whamcloud.com/images/icons/issuetypes/task_agile.png">Technical task</type>
                            <parent id="20020">LU-3647</parent>
                                    <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="jay">Jinshan Xiong</assignee>
                                    <reporter username="jay">Jinshan Xiong</reporter>
                        <labels>
                            <label>HSM</label>
                    </labels>
                <created>Fri, 26 Jul 2013 00:06:34 +0000</created>
                <updated>Fri, 13 Sep 2013 04:02:10 +0000</updated>
                            <resolved>Mon, 29 Jul 2013 18:07:33 +0000</resolved>
                                    <version>Lustre 2.5.0</version>
                                    <fixVersion>Lustre 2.4.1</fixVersion>
                    <fixVersion>Lustre 2.5.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="63018" author="jay" created="Fri, 26 Jul 2013 00:54:03 +0000"  >&lt;p&gt;here is the patch: &lt;a href=&quot;http://review.whamcloud.com/7124&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/7124&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="63020" author="adegremont" created="Fri, 26 Jul 2013 08:32:46 +0000"  >&lt;p&gt;We hit this bug a few times during our tests, but I never find a way to reproduce it.&lt;br/&gt;
I will update your patch to add more improvements.&lt;/p&gt;</comment>
                            <comment id="63021" author="adegremont" created="Fri, 26 Jul 2013 09:15:02 +0000"  >&lt;p&gt;This should also land on b2_4. The patch could be cherry-pick&apos;d as-is for b2_4. Do we need a specific gerrit for it anyway?&lt;/p&gt;</comment>
                            <comment id="63059" author="jay" created="Fri, 26 Jul 2013 17:21:56 +0000"  >&lt;p&gt;if there is no conflict and obvious, we can cherry-pick it for 2.4 directly&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                            <customfield id="customfield_10070" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Project</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10040"><![CDATA[HSM]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvw9j:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9379</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>