<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:34:20 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-3487] Oops in vvp_io_fault_iter_init()</title>
                <link>https://jira.whamcloud.com/browse/LU-3487</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Running racer.sh 2.4.50-79-gaed8203 with some local patches (&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3072&quot; title=&quot;add more operations to racer&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3072&quot;&gt;&lt;del&gt;LU-3072&lt;/del&gt;&lt;/a&gt;, &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3348&quot; title=&quot;racer&amp;#39;s file_link doesn&amp;#39;t&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3348&quot;&gt;&lt;del&gt;LU-3348&lt;/del&gt;&lt;/a&gt;, &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3233&quot; title=&quot;tgt_cb_last_committed()) ASSERTION( ccb-&amp;gt;llcc_exp-&amp;gt;exp_obd == ccb-&amp;gt;llcc_tgt-&amp;gt;lut_obd ) failed: &quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3233&quot;&gt;&lt;del&gt;LU-3233&lt;/del&gt;&lt;/a&gt;, &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3448&quot; title=&quot;osc_page_delete()) ASSERTION(0) failed running racer&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3448&quot;&gt;&lt;del&gt;LU-3448&lt;/del&gt;&lt;/a&gt;) I can reproduce the following oops in vvp_io_fault_iter_init():&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;00000100:00100000:0.0:1371755423.271136:0:30780:0:(client.c:1805:ptlrpc_check_set()) Completed RPC pname:cluuid:pid:xid:nid:opc 19:1868864a-63ce-938d-b909-2dada17703ae:30780:1438389041886196:0@lo:49
00000080:00020000:0.0:1371755423.271146:0:30780:0:(vvp_io.c:1241:vvp_io_init()) lustre: refresh file layout [0x2c0000401:0x3467:0x0] error -13.

BUG: unable to handle kernel NULL pointer dereference at 00000000000000b0
IP: [&amp;lt;ffffffffa0ce2bbc&amp;gt;] vvp_io_fault_iter_init+0x4c/0xc0 [lustre]
...
Pid: 30780, comm: 19 Tainted: P           ---------------    2.6.32-279.19.1.el6_lustre_gcov.x86_64 #1 Bochs Bochs
RIP: 0010:[&amp;lt;ffffffffa0ce2bbc&amp;gt;]  [&amp;lt;ffffffffa0ce2bbc&amp;gt;] vvp_io_fault_iter_init+0x4c/0xc0 [lustre]
RSP: 0018:ffff88014196daf8  EFLAGS: 00010292
RAX: 0000000000000000 RBX: ffff88016554b870 RCX: 0000000000000000
RDX: ffff880160625400 RSI: ffffffffa0d0f0a0 RDI: ffff8801647e4ca0
RBP: ffff88014196db18 R08: 0000000000000000 R09: ffff880164fe68c8
R10: 0000000000000003 R11: 0000000000000000 R12: ffff8801647e4c68
R13: ffff880147805738 R14: ffff88016554c610 R15: ffff88014196dbd8
FS:  00007f6d52008700(0000) GS:ffff880028200000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000001eac45c CR3: 0000000141551000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process 19 (pid: 30780, threadinfo ffff88014196c000, task ffff88016547e040)
Stack:
 ffff88016554b870 ffff880164fe68c8 ffff8801647e4c68 ffff88014196dbc8
&amp;lt;d&amp;gt; ffff88014196db48 ffffffffa04a529d ffff88016547e040 ffff880164fe68c8
&amp;lt;d&amp;gt; ffff8801647e4c68 ffff88016356b148 ffff88014196db78 ffffffffa04a9cec
Call Trace:
 [&amp;lt;ffffffffa04a529d&amp;gt;] cl_io_iter_init+0x5d/0x110 [obdclass]
 [&amp;lt;ffffffffa04a9cec&amp;gt;] cl_io_loop+0x4c/0x1b0 [obdclass]
 [&amp;lt;ffffffffa0cc6552&amp;gt;] ll_fault+0x2c2/0x4d0 [lustre]
 [&amp;lt;ffffffff8113bd54&amp;gt;] __do_fault+0x54/0x510
 [&amp;lt;ffffffff81128750&amp;gt;] ? __lru_cache_add+0x40/0x90
 [&amp;lt;ffffffff8113c307&amp;gt;] handle_pte_fault+0xf7/0xb50
 [&amp;lt;ffffffff81278cec&amp;gt;] ? __bitmap_weight+0x8c/0xb0
 [&amp;lt;ffffffff8116ba07&amp;gt;] ? mem_cgroup_update_file_mapped+0x17/0x90
 [&amp;lt;ffffffff8114536a&amp;gt;] ? page_remove_rmap+0x7a/0xa0
 [&amp;lt;ffffffff8113cf9a&amp;gt;] handle_mm_fault+0x23a/0x310
 [&amp;lt;ffffffff810432d9&amp;gt;] __do_page_fault+0x139/0x480
 [&amp;lt;ffffffff81196b40&amp;gt;] ? mntput_no_expire+0x30/0x110
 [&amp;lt;ffffffff811793e1&amp;gt;] ? __fput+0x1a1/0x210
 [&amp;lt;ffffffff8113fcee&amp;gt;] ? remove_vma+0x6e/0x90
 [&amp;lt;ffffffff814f0f5e&amp;gt;] do_page_fault+0x3e/0xa0
 [&amp;lt;ffffffff814ee315&amp;gt;] page_fault+0x25/0x30
Code: 89 f3 49 89 fc e8 65 ff ff ff 48 8b 7b 08 49 89 c6 e8 69 71 ff ff 48 89 de 4c 89 e7 49 89 c5 e8 db 96 ff ff 48 8b 80 a8 00 00 00 &amp;lt;48&amp;gt; 8b 80 b0 00 00 00 48 8b 40 18 4c 3b 68 10 75 22 49 8b 85 80 
RIP  [&amp;lt;ffffffffa0ce2bbc&amp;gt;] vvp_io_fault_iter_init+0x4c/0xc0 [lustre]
 RSP &amp;lt;ffff88014196daf8&amp;gt;
CR2: 00000000000000b0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The oops is in the assertion &lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;LASSERT(inode == cl2ccc_io(env, ios)-&amp;gt;cui_fd-&amp;gt;fd_file-&amp;gt;f_dentry-&amp;gt;d_inode);
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;because the ccc_io has a NULL cui_fd member:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;crash&amp;gt; p *((struct cl_io *)0xffff880164fe68c8)
$3 = {
  ci_type = CIT_FAULT,
  ci_state = CIS_ZERO,
  ci_obj = 0xffff880142b48148,
  ci_parent = 0x0,
  ci_layers = {
    next = 0xffff88016554b888,
    prev = 0xffff88016554b888
  },
  ...
  },
  ci_lockreq = CILR_MANDATORY,
  u = {
    ...
    ci_fault = {
      ft_index = 3,
      ft_nob = 0,
      ft_writable = 0,
      ft_executable = 4,
      ft_mkwrite = 0,
      ft_page = 0x0
    },
    ...
  },
  ...
  ci_nob = 0,
  ci_result = 0,
  ci_continue = 0,
  ci_no_srvlock = 0,
  ci_need_restart = 0,
  ci_ignore_layout = 0,
  ci_verify_layout = 0,
  ci_owned_nr = 0
}

crash&amp;gt; p *((struct ccc_io *)0xffff88016554b870)
$9 = {
  cui_cl = {
    cis_io = 0xffff880164fe68c8,
    cis_obj = 0xffff880142b48148,
    cis_iop = 0xffffffffa0ce8200,
    cis_linkage = {
      next = 0xffff880164fe68e0,
      prev = 0xffff880164fe68e0
    }
  },
  cui_link = {
    cill_linkage = {
      next = 0x0,
      prev = 0x0
    },
    cill_descr = {
      cld_obj = 0x0,
      cld_start = 0,
      cld_end = 0,
      cld_gid = 0,
      cld_mode = CLM_PHANTOM,
      cld_enq_flags = 0
    },
    cill_lock = 0x0,
    cill_fini = 0
  },
  cui_iov = 0x0,
  cui_nrsegs = 0,
  cui_tot_nrsegs = 0,
  cui_iov_olen = 0,
  cui_tot_count = 0,
  u = {
    setattr = {
      cui_local_lock = SETATTR_NOLOCK
    }
  },
  cui_glimpse = 0,
  cui_layout_gen = 4294967294,
  cui_fd = 0x0,
  cui_iocb = 0x0
}
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;I&apos;m not sure why MDS_GETXATTR is returning -EACCES, but form here it&apos;s easy to see that the ll_fault_io_init() fails to handle the subsequent error from cl_io_init() and returns io without cui_fd being set.&lt;/p&gt;</description>
                <environment></environment>
        <key id="19501">LU-3487</key>
            <summary>Oops in vvp_io_fault_iter_init()</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="jhammond">John Hammond</assignee>
                                    <reporter username="jhammond">John Hammond</reporter>
                        <labels>
                            <label>llite</label>
                            <label>sdsc</label>
                    </labels>
                <created>Thu, 20 Jun 2013 19:50:30 +0000</created>
                <updated>Mon, 29 Sep 2014 02:08:52 +0000</updated>
                            <resolved>Fri, 28 Jun 2013 19:25:37 +0000</resolved>
                                    <version>Lustre 2.5.0</version>
                    <version>Lustre 2.4.3</version>
                                    <fixVersion>Lustre 2.5.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>7</watches>
                                                                            <comments>
                            <comment id="61005" author="jhammond" created="Fri, 21 Jun 2013 16:16:41 +0000"  >&lt;p&gt;Please see &lt;a href=&quot;http://review.whamcloud.com/6735&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/6735&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="61509" author="jhammond" created="Fri, 28 Jun 2013 19:25:37 +0000"  >&lt;p&gt;Patch landed to master.&lt;/p&gt;</comment>
                            <comment id="70359" author="m.magrys" created="Thu, 31 Oct 2013 13:28:38 +0000"  >&lt;p&gt;I think we hit the same bug with 2.4.X:&lt;/p&gt;

&lt;p&gt;Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa138825c&amp;gt;&amp;#93;&lt;/span&gt;: vvp_io_fault_iter_init+0x4c/0xc0 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus Call[]: Trace:&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0f8186d&amp;gt;&amp;#93;&lt;/span&gt;: cl_io_iter_init+0x5d/0x110 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0f85e0c&amp;gt;&amp;#93;&lt;/span&gt;: cl_io_loop+0x4c/0x1b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;obdclass&amp;#93;&lt;/span&gt;&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa136b6c2&amp;gt;&amp;#93;&lt;/span&gt;: ll_fault+0x2c2/0x4d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81143294&amp;gt;&amp;#93;&lt;/span&gt;: __do_fault+0x54/0x530&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8112fdd0&amp;gt;&amp;#93;&lt;/span&gt;: ? __lru_cache_add+0x40/0x90&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81143867&amp;gt;&amp;#93;&lt;/span&gt;: handle_pte_fault+0xf7/0xb50&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa1381171&amp;gt;&amp;#93;&lt;/span&gt;: ? cl2ccc_io+0x21/0xa0 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa13874a5&amp;gt;&amp;#93;&lt;/span&gt;: ? vvp_io_fini+0x25/0x1b0 &lt;span class=&quot;error&quot;&gt;&amp;#91;lustre&amp;#93;&lt;/span&gt;&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff811444fa&amp;gt;&amp;#93;&lt;/span&gt;: handle_mm_fault+0x23a/0x310&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810474e9&amp;gt;&amp;#93;&lt;/span&gt;: __do_page_fault+0x139/0x480&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8100988e&amp;gt;&amp;#93;&lt;/span&gt;: ? __switch_to+0x26e/0x320&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8150e130&amp;gt;&amp;#93;&lt;/span&gt;: ? thread_return+0x4e/0x76e&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81513b6e&amp;gt;&amp;#93;&lt;/span&gt;: do_page_fault+0x3e/0xa0&lt;br/&gt;
Oct 31 10:26:22 &amp;lt;user.notice&amp;gt; n0466-g6l.zeus &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81510f25&amp;gt;&amp;#93;&lt;/span&gt;: page_fault+0x25/0x30&lt;/p&gt;

&lt;p&gt;So maybe it&apos;s worth to push this patch into b2_4, unless you don&apos;t plan to release 2.4.2.&lt;/p&gt;</comment>
                            <comment id="95151" author="mjo" created="Mon, 29 Sep 2014 02:08:52 +0000"  >&lt;p&gt;We appear to have hit the same bug with 2.4.2, as well.&lt;/p&gt;

&lt;p&gt;Is there a plan to backport this fix to the 2.4 tree?&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvtpb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>8769</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>