<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:10:41 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-815] BUG: unable to handle kernel NULL pointer dereference&quot; in lprocfs_rd_import()</title>
                <link>https://jira.whamcloud.com/browse/LU-815</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We&apos;ve been hitting this problem for several months whe we reading in &quot;/proc/fs/lustre/osc/&amp;lt;OST&amp;gt;/import&quot;.&lt;/p&gt;

&lt;p&gt;I saw there&apos;s maybe a related patch (BZ#22032 - WC&apos;s git: 839280926956f16552194fe803ba21096770ebc4) which was integrated for Lustre-2.1. What do you think of this? If 22032&apos;s patch is not related, then does this sound to you as a know problem?&lt;/p&gt;

&lt;p&gt;==============================================================================&lt;br/&gt;
BUG: unable to handle kernel NULL pointer dereference at 0000000000000018&lt;br/&gt;
IP: &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0482d3d&amp;gt;&amp;#93;&lt;/span&gt; lprocfs_rd_import+0x32d/0x6b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
PGD c7cf9f067 PUD ae9bcc067 PMD 0&lt;br/&gt;
Oops: 0000 &lt;a href=&quot;#1&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;1&lt;/a&gt; SMP&lt;br/&gt;
last sysfs file: /sys/devices/pci0000:00/0000:00:05.0/0000:05:00.0/infiniband/mlx4_0/ports/1/rate&lt;br/&gt;
CPU 5&lt;br/&gt;
Modules linked in: sit(U) tunnel4(U) lmv(U) mgc(U) lustre(U) lov(U) osc(U) mdc(U) lquota(U) fid(U) fld(U) ko2iblnd(U)&lt;br/&gt;
ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) rdma_ucm(U) ib_sdp(U) rdma_cm(U) iw_cm(U) ib_addr(U) ib_ipoib(U) ib_cm(U)&lt;br/&gt;
ib_sa(U) ib_uverbs(U) ib_umad(U) mlx4_ib(U) mlx4_core(U) ib_mthca(U) ib_mad(U) ib_core(U) ipmi_devintf(U) ipmi_si(U)&lt;br/&gt;
ipmi_msghandler(U) iptable_filter(U) ip_tables(U) x_tables(U) nfs(U) lockd(U) fscache(U) nfs_acl(U) auth_rpcgss(U) sunrpc(U)&lt;br/&gt;
acpi_cpufreq(U) freq_table(U) vtune_drv(U) autofs4(U) ipv6(U) sg(U) i7core_edac(U) edac_core(U) i2c_i801(U) i2c_core(U)&lt;br/&gt;
igb(U) ioatdma(U) dca(U) iTCO_wdt(U) iTCO_vendor_support(U) ext3(U) jbd(U) mbcache(U) sd_mod(U) crc_t10dif(U) usbhid(U)&lt;br/&gt;
hid(U) ehci_hcd(U) ahci(U) uhci_hcd(U) dm_mod(U) &lt;span class=&quot;error&quot;&gt;&amp;#91;last unloaded: libcfs&amp;#93;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;Pid: 29413, comm: grep Not tainted 2.6.32-30.el6.Bull.14.x86_64 #1 bullx super-node&lt;br/&gt;
RIP: 0010:&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0482d3d&amp;gt;&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0482d3d&amp;gt;&amp;#93;&lt;/span&gt; lprocfs_rd_import+0x32d/0x6b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
RSP: 0018:ffff8806e57ffd78  EFLAGS: 00010206&lt;br/&gt;
RAX: 0000000000000000 RBX: ffff880c7db5a000 RCX: 0000000000000038&lt;br/&gt;
RDX: ffff880c6fd42105 RSI: 00000000fffffffe RDI: 0000000000000013&lt;br/&gt;
RBP: ffff8806e57ffe38 R08: 0000000000000000 R09: 00000000fffffffe&lt;br/&gt;
R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000&lt;br/&gt;
R13: 0000000000000105 R14: 0000000000000000 R15: 0000000000001000&lt;br/&gt;
FS:  00002b8d09d85f60(0000) GS:ffff88088e440000(0000) knlGS:0000000000000000&lt;br/&gt;
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033&lt;br/&gt;
CR2: 0000000000000018 CR3: 00000009da84e000 CR4: 00000000000006e0&lt;br/&gt;
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000&lt;br/&gt;
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400&lt;br/&gt;
Process grep (pid: 29413, threadinfo ffff8806e57fe000, task ffff8807bf266c50)&lt;br/&gt;
Stack:&lt;br/&gt;
 0000000000000000 ffffea0029a6ddf8 00000200011cce48 00000010e68896a0&lt;br/&gt;
&amp;lt;0&amp;gt; ffff880a4323e948 ffff880c7db5a000 ffff880a4323e438 ffff880c6fd42000&lt;br/&gt;
&amp;lt;0&amp;gt; ffff8806e57ffde8 ffff880880001d80 ffff880c7d7c2300 00000000000000d0&lt;br/&gt;
Call Trace:&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8113e377&amp;gt;&amp;#93;&lt;/span&gt; ? alloc_pages_current+0x87/0xd0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0480651&amp;gt;&amp;#93;&lt;/span&gt; lprocfs_fops_read+0xd1/0x1e0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff811b6a36&amp;gt;&amp;#93;&lt;/span&gt; proc_reg_read+0x76/0xb0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81157f55&amp;gt;&amp;#93;&lt;/span&gt; vfs_read+0xb5/0x1a0&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810c5282&amp;gt;&amp;#93;&lt;/span&gt; ? audit_syscall_entry+0x252/0x280&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81158091&amp;gt;&amp;#93;&lt;/span&gt; sys_read+0x51/0x90&lt;br/&gt;
 &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100c172&amp;gt;&amp;#93;&lt;/span&gt; system_call_fastpath+0x16/0x1b&lt;br/&gt;
Code: 18 08 75 a2 48 8b 9d 68 ff ff ff 66 ff 83 78 02 00 00 48 8b 43 60 44 8b 83 28 02 00 00 44 8b b3 14 01 00 00 44&lt;br/&gt;
8b a3 24 02 00 00 &amp;lt;48&amp;gt; 8b 78 18 44 89 85 58 ff ff ff e8 d3 5e dc ff 49 63 fd 48 03&lt;br/&gt;
RIP  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0482d3d&amp;gt;&amp;#93;&lt;/span&gt; lprocfs_rd_import+0x32d/0x6b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
 RSP &amp;lt;ffff8806e57ffd78&amp;gt;&lt;br/&gt;
==============================================================================&lt;/p&gt;

&lt;p&gt;And further+in-deep analysis clearly indicates this problem comes from a race between a process reading&lt;br/&gt;
&quot;/proc/fs/lustre/osc/&amp;lt;OST&amp;gt;/import&quot; special file via lprocfs layer and other Lustre layers dealing with&lt;br/&gt;
imports.&lt;/p&gt;

&lt;p&gt;Thanks,&lt;/p&gt;</description>
                <environment>Lustre-2.0, RHEL6.0</environment>
        <key id="12308">LU-815</key>
            <summary>BUG: unable to handle kernel NULL pointer dereference&quot; in lprocfs_rd_import()</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="adilger">Andreas Dilger</assignee>
                                    <reporter username="lustre-bull">Lustre Bull</reporter>
                        <labels>
                    </labels>
                <created>Wed, 2 Nov 2011 12:54:47 +0000</created>
                <updated>Thu, 8 Sep 2016 04:16:59 +0000</updated>
                            <resolved>Thu, 15 Dec 2011 20:00:36 +0000</resolved>
                                                    <fixVersion>Lustre 2.9.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="22297" author="adilger" created="Wed, 2 Nov 2011 15:58:27 +0000"  >&lt;p&gt;Looking at git for 839280926956f16552194fe803ba21096770ebc4, it definitely seems related, but &quot;git describe&quot; shows that this should be included into v2_0_0-rc1a, which means it should be in the Lustre 2.0.0 release already.  Are you running the official 2.0.0 release, or some earlier build?&lt;/p&gt;

&lt;p&gt;The other possibility is that this is related to the patch in &lt;a href=&quot;http://review.whamcloud.com/1544&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/1544&lt;/a&gt; (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-615&quot; title=&quot;sanity test 180a: awk: (FILENAME=/proc/fs/lustre/osc/lustre-OST0000-osc-ffff88032273d800/import FNR=32) fatal error: internal error&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-615&quot;&gt;&lt;del&gt;LU-615&lt;/del&gt;&lt;/a&gt;), which is fixing the reads from .../import to avoid problems overflowing the page buffer.  This patch has not yet been landed to the master (2.2) release branch, so I would recommend testing it first if you plan to apply it before testing and landing has completed.&lt;/p&gt;</comment>
                            <comment id="22386" author="dmoreno" created="Thu, 3 Nov 2011 07:43:46 +0000"  >&lt;p&gt;Actually I don&apos;t think the patch in 839280926956f16552194fe803ba21096770ebc4 is in the official 2.0.0 release. In git we can see this patch was introduced between 2.0.52.0 and 2.0.53.0 tags so the result shown by git describe is very strange.&lt;/p&gt;

&lt;p&gt;We&apos;re going to integrate this patch in our 2.0.0 (which is the official) and if we still have the problem we&apos;ll try with &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-615&quot; title=&quot;sanity test 180a: awk: (FILENAME=/proc/fs/lustre/osc/lustre-OST0000-osc-ffff88032273d800/import FNR=32) fatal error: internal error&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-615&quot;&gt;&lt;del&gt;LU-615&lt;/del&gt;&lt;/a&gt;. &lt;/p&gt;

&lt;p&gt;Thanks Andreas&lt;/p&gt;</comment>
                            <comment id="24824" author="pjones" created="Thu, 15 Dec 2011 09:58:38 +0000"  >&lt;p&gt;Any feedback on this ticket? Have you been able to try the suggested fix yet? If not, when do you expect to be able to do so?&lt;/p&gt;</comment>
                            <comment id="24834" author="sebastien.buisson" created="Thu, 15 Dec 2011 11:16:56 +0000"  >&lt;p&gt;Yes, we integrated the proposed patch, and delivered it to the customer. But we do not have any feedback yet.&lt;/p&gt;</comment>
                            <comment id="24853" author="adilger" created="Thu, 15 Dec 2011 20:00:36 +0000"  >&lt;p&gt;I&apos;m going to mark this fixed in 2.1.0.  Please reopen if the customer hits this problem again.&lt;/p&gt;</comment>
                            <comment id="46006" author="bfaccini" created="Thu, 4 Oct 2012 14:07:09 +0000"  >&lt;p&gt;Humm, even if running with lustre 2.1.1 (including fix for BZ#22032) we can still reproduce the same crash/Oops !! So would like to re-open this JIRA ...&lt;/p&gt;

&lt;p&gt;Again the crash is due to imp-&amp;gt;imp_connection beeing NULL and beeing dereferenced in lprocfs_rd_import().&lt;/p&gt;

&lt;p&gt;So I am back with my earlier fix idea, not choosen by Bull R&amp;amp;D in favor of BZ#22032 at that time ..., where imp-&amp;gt;imp_connection access must be done under imp-&amp;gt;imp_lock protection too and NULL value detected.&lt;/p&gt;

&lt;p&gt;Patch against b2_1 is at &lt;a href=&quot;http://review.whamcloud.com/4187&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/4187&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="46039" author="bfaccini" created="Fri, 5 Oct 2012 02:18:12 +0000"  >&lt;p&gt;Oops, thank&apos;s to Andreas asking me to review code+patches from top-down, starting with master branch !! And bingo, a similar patch has been already applied starting with b2_3, it comes from JIRA &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1448&quot; title=&quot;Disabled OSC can cause NULL pointer dereference when reading import&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1448&quot;&gt;&lt;del&gt;LU-1448&lt;/del&gt;&lt;/a&gt; where the same issue was found for disabled OSCs when in our case it also happen during OSC mount !!&lt;/p&gt;

&lt;p&gt;Patch on master is at &lt;a href=&quot;http://review.whamcloud.com/2995&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/2995&lt;/a&gt;, so it needs to be cherry-picked from there to be applied to b2_1/b2_2 branches.&lt;/p&gt;

&lt;p&gt;In the mean time, should I need to &quot;Abandon&quot; my change on Gerrit by pointing to master change ??&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                    <customfield id="customfield_10020" key="com.atlassian.jira.plugin.system.customfieldtypes:float">
                        <customfieldname>Bugzilla ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>24449.0</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvhs7:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6530</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>