<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:33:41 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-3411] Encountered at NULL pointer exception for function osd_read_prep</title>
                <link>https://jira.whamcloud.com/browse/LU-3411</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;While running test application against the file system one of OSS crashed and rebooted. We managed to get a crash dump and form the early analysis determined the crash to occur in osd_read_prep. Here is what was dmesg ran in crash showed.&lt;/p&gt;

&lt;p&gt;[ 6304.888367] BUG: unable to handle kernel NULL pointer dereference at 0000000000000036&lt;br/&gt;
[ 6304.889314] IP: &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d31776&amp;gt;&amp;#93;&lt;/span&gt; osd_read_prep+0x326/0x3b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;osd_ldiskfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.889314] PGD 3dd3bb067 PUD 3dd177067 PMD 0 &lt;br/&gt;
[ 6304.935543] Oops: 0000 &lt;a href=&quot;#1&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;1&lt;/a&gt; SMP &lt;br/&gt;
[ 6304.935543] last sysfs file: /sys/devices/pci0000:00/0000:00:04.0/0000:0a:00.0/infiniband_mad/umad0/port&lt;br/&gt;
[ 6304.935543] CPU 5 &lt;br/&gt;
[ 6304.935543] Modules linked in: osp(U) ofd(U) ost(U) mgc(U) fsfilt_ldiskfs(U) osd_ldiskfs(U) ldiskfs(U) mbcache mdd(U) jbd2 lustre(U) lov(U) osc(U) &lt;br/&gt;
mdc(U) lquota(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ko2iblnd(U) lnet(U) sha512_generic sha256_generic libcfs(U) autofs4 scsi_wait_scan dm_rou&lt;br/&gt;
nd_robin sd_mod crc_t10dif ib_srp scsi_transport_srp scsi_tgt nf_conntrack_netbios_ns nf_conntrack_broadcast ipt_REJECT xt_comment nf_conntrack_ipv4 n&lt;br/&gt;
f_defrag_ipv4 xt_state nf_conntrack iptable_filter ip_tables ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ib_sa dm_mirror dm&lt;br/&gt;
_region_hash dm_log dm_multipath dm_mod sg radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core sr_mod cdrom shpchp ib_mthca ib_mad ib_core microcode d&lt;br/&gt;
cdbas serio_raw iTCO_wdt iTCO_vendor_support ata_generic pata_acpi ata_piix i5000_edac edac_core i5k_amb ioatdma dca ipv6 nfs lockd fscache auth_rpcgs&lt;br/&gt;
s nfs_acl sunrpc bnx2 &lt;span class=&quot;error&quot;&gt;&amp;#91;last unloaded: scsi_wait_scan&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543] &lt;br/&gt;
[ 6304.935543] Pid: 2910, comm: ll_ost_io01_003 Not tainted 2.6.32-358.6.1.el6.wc.x86_64 #1 Dell Inc. PowerEdge 1950/0TT740&lt;br/&gt;
[ 6304.935543] RIP: 0010:&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d31776&amp;gt;&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d31776&amp;gt;&amp;#93;&lt;/span&gt; osd_read_prep+0x326/0x3b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;osd_ldiskfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543] RSP: 0018:ffff880385b3b8f0  EFLAGS: 00010246&lt;br/&gt;
[ 6304.935543] RAX: fffffffffffffffe RBX: ffff8803a376a000 RCX: ffff880385ba2000&lt;br/&gt;
[ 6304.935543] RDX: 0000000000000100 RSI: ffff880385a36000 RDI: ffff880378e4f5f0&lt;br/&gt;
[ 6304.935543] RBP: ffff880385b3b960 R08: 0000000000000000 R09: 0000000000000000&lt;br/&gt;
[ 6304.935543] R10: 0000000000000100 R11: 0000000000000000 R12: ffff880385b12fd0&lt;br/&gt;
[ 6304.935543] R13: ffff880378e4f5f0 R14: ffff880385b13000 R15: 0000000000000100&lt;br/&gt;
[ 6304.935543] FS:  00007f7ca8560700(0000) GS:ffff880028340000(0000) knlGS:0000000000000000&lt;br/&gt;
[ 6304.935543] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b&lt;br/&gt;
[ 6304.935543] CR2: 0000000000000036 CR3: 00000003dd429000 CR4: 00000000000007e0&lt;br/&gt;
[ 6304.935543] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000&lt;br/&gt;
[ 6304.935543] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400&lt;br/&gt;
[ 6304.935543] Process ll_ost_io01_003 (pid: 2910, threadinfo ffff880385b3a000, task ffff88038c4f8040)&lt;br/&gt;
[ 6304.935543] Stack:&lt;br/&gt;
[ 6304.935543]  ffff8803a9b4c000 0000000092fb0380 ffff8803a9b4cbd0 0000010000000000&lt;br/&gt;
[ 6304.935543] &amp;lt;d&amp;gt; 0000000051a4df4c 00000000000eef4c 0000000051a4df4c 00000000000eeef3&lt;br/&gt;
[ 6304.935543] &amp;lt;d&amp;gt; ffff880385b3b960 ffffc900322a4d80 0000000000000100 0000000000000000&lt;br/&gt;
[ 6304.935543] Call Trace:&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa08c1a43&amp;gt;&amp;#93;&lt;/span&gt; ofd_preprw_read+0x253/0x7f0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa08c274a&amp;gt;&amp;#93;&lt;/span&gt; ofd_preprw+0x76a/0x13c0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0776fc1&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_pack_reply_v2+0x1e1/0x280 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa06d1e0c&amp;gt;&amp;#93;&lt;/span&gt; obd_preprw+0x12c/0x3d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ost&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa06d9780&amp;gt;&amp;#93;&lt;/span&gt; ost_brw_read+0xd60/0x1340 &lt;span class=&quot;error&quot;&gt;&amp;#91;ost&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81277779&amp;gt;&amp;#93;&lt;/span&gt; ? cpumask_next_and+0x29/0x50&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81059784&amp;gt;&amp;#93;&lt;/span&gt; ? find_busiest_group+0x244/0x9f0&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0777e6c&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_msg_get_version+0x8c/0x100 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0777fc8&amp;gt;&amp;#93;&lt;/span&gt; ? lustre_msg_check_version+0xe8/0x100 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa06e0f28&amp;gt;&amp;#93;&lt;/span&gt; ost_handle+0x2ac8/0x48e0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ost&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa077ebab&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_update_export_timer+0x4b/0x560 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0787388&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_server_handle_request+0x398/0xc60 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa046b5de&amp;gt;&amp;#93;&lt;/span&gt; ? cfs_timer_arm+0xe/0x10 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa047cd8f&amp;gt;&amp;#93;&lt;/span&gt; ? lc_watchdog_touch+0x6f/0x170 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa077e6e9&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_wait_event+0xa9/0x290 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81055ab3&amp;gt;&amp;#93;&lt;/span&gt; ? __wake_up+0x53/0x70&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa078871e&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xace/0x1700 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0787c50&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_main+0x0/0x1700 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c0ca&amp;gt;&amp;#93;&lt;/span&gt; child_rip+0xa/0x20&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0787c50&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_main+0x0/0x1700 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0787c50&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_main+0x0/0x1700 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c0c0&amp;gt;&amp;#93;&lt;/span&gt; ? child_rip+0x0/0x20&lt;br/&gt;
[ 6304.935543] Code: 5c 41 5d 41 5e 41 5f c9 c3 0f 1f 40 00 48 8b 83 40 01 00 00 48 8b 8e 28 0c 00 00 45 31 c9 48 8b b6 10 0c 00 00 45 31 c0 4c 89 ef &amp;lt;ff&amp;gt; 50 38 48 8b 55 a0 4c 89 ee 48 89 df e8 68 d6 ff ff 89 45 9c &lt;br/&gt;
[ 6304.935543] RIP  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d31776&amp;gt;&amp;#93;&lt;/span&gt; osd_read_prep+0x326/0x3b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;osd_ldiskfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
[ 6304.935543]  RSP &amp;lt;ffff880385b3b8f0&amp;gt;&lt;br/&gt;
[ 6304.935543] CR2: 0000000000000036&lt;/p&gt;</description>
                <environment>OSS running Lustre 2.4.0-RC2 on a RHEL6.4 system.</environment>
        <key id="19174">LU-3411</key>
            <summary>Encountered at NULL pointer exception for function osd_read_prep</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bzzz">Alex Zhuravlev</assignee>
                                    <reporter username="simmonsja">James A Simmons</reporter>
                        <labels>
                    </labels>
                <created>Tue, 28 May 2013 17:52:03 +0000</created>
                <updated>Tue, 1 Oct 2013 14:40:29 +0000</updated>
                            <resolved>Tue, 6 Aug 2013 13:33:17 +0000</resolved>
                                    <version>Lustre 2.4.0</version>
                                    <fixVersion>Lustre 2.4.1</fixVersion>
                    <fixVersion>Lustre 2.5.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>7</watches>
                                                                            <comments>
                            <comment id="59466" author="pjones" created="Tue, 28 May 2013 17:58:41 +0000"  >&lt;p&gt;Alex&lt;/p&gt;

&lt;p&gt;Could you please comment on this one?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="59469" author="adilger" created="Tue, 28 May 2013 18:35:19 +0000"  >&lt;p&gt;When you get a chance, can you please decode &lt;tt&gt;osd_read_prep+0x326/0x3b0&lt;/tt&gt; to a specific line number, and verify that 0x36 is a valid structure offset for a NULL pointer that is being accessed on that line.&lt;/p&gt;</comment>
                            <comment id="59470" author="simmonsja" created="Tue, 28 May 2013 18:38:21 +0000"  >&lt;p&gt;Uploaded the vmcore and debuginfo rpms to ftp.whamcloud.com/uploads/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3411&quot; title=&quot;Encountered at NULL pointer exception for function osd_read_prep&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3411&quot;&gt;&lt;del&gt;LU-3411&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="59472" author="simmonsja" created="Tue, 28 May 2013 18:43:21 +0000"  >&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;crash&amp;gt; l *(osd_read_prep+0x326)
0xffffffffa0d31776 is in osd_read_prep (/usr/src/debug/lustre-2.4.0/lustre/osd-ldiskfs/osd_io.c:930).
925             cfs_gettimeofday(&amp;amp;end);
926             timediff = cfs_timeval_sub(&amp;amp;end, &amp;amp;start, NULL);
927             lprocfs_counter_add(osd-&amp;gt;od_stats, LPROC_OSD_GET_PAGE, timediff);
928     
929             &lt;span class=&quot;code-keyword&quot;&gt;if&lt;/span&gt; (iobuf-&amp;gt;dr_npages) {
930                 rc = osd-&amp;gt;od_fsops-&amp;gt;fs_map_inode_pages(inode, iobuf-&amp;gt;dr_pages,
931                                                            iobuf-&amp;gt;dr_npages,
932                                                            iobuf-&amp;gt;dr_blocks,
933                                                            0, NULL);
934                     rc = osd_do_bio(osd, inode, iobuf);
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="59561" author="bzzz" created="Wed, 29 May 2013 17:59:15 +0000"  >&lt;p&gt;given iobuf has been used before in few places and proximity of 0x36 to:&lt;br/&gt;
(gdb) p/x &amp;amp;((struct fsfilt_operations *)0)-&amp;gt;fs_map_inode_pages&lt;br/&gt;
$3 = 0x38&lt;/p&gt;

&lt;p&gt;I tend to think it&apos;s osd-&amp;gt;od_fsops being NULL. though I&apos;d expect struct fsfilt_operations to have exact offsets..&lt;/p&gt;

</comment>
                            <comment id="59562" author="bzzz" created="Wed, 29 May 2013 18:03:09 +0000"  >&lt;p&gt;just discussed with Oleg, seem to be a result of incorrect error handling in osd_mount():&lt;/p&gt;

&lt;p&gt;        o-&amp;gt;od_fsops = fsfilt_get_ops(mt_str(LDD_MT_LDISKFS));&lt;br/&gt;
        if (o-&amp;gt;od_fsops == NULL) {&lt;/p&gt;

&lt;p&gt;while fsfilt_get_ops() can actually return -ENOENT, giving us that -2 offset to 0x38.&lt;/p&gt;
</comment>
                            <comment id="59621" author="bzzz" created="Thu, 30 May 2013 11:25:11 +0000"  >&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/6496&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/6496&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="59652" author="green" created="Thu, 30 May 2013 16:46:10 +0000"  >&lt;p&gt;Ok, so the patch fixes the crash. The pressing question I have - how come this path triggered at all? It&apos;s not like there was no fsfilt_ldiskfs.ko module present, right? So how come?&lt;/p&gt;</comment>
                            <comment id="59657" author="simmonsja" created="Thu, 30 May 2013 17:15:10 +0000"  >&lt;p&gt;The second attempt we did not have this problem so fsfilt_ldiskfs.ko was there. Yes its very strange.&lt;/p&gt;</comment>
                            <comment id="63709" author="simmonsja" created="Tue, 6 Aug 2013 12:52:49 +0000"  >&lt;p&gt;This patch was merged in the 2.4 branch. We can close this ticket now.&lt;/p&gt;</comment>
                            <comment id="63711" author="pjones" created="Tue, 6 Aug 2013 13:33:17 +0000"  >&lt;p&gt;Landed for 2.5&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="21195">LU-4025</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvrz3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>8429</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>