<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:20:16 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1854] system crash when reading the file /proc/fs/lustre/ost/OSS/ost_create/req_history</title>
                <link>https://jira.whamcloud.com/browse/LU-1854</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;The lustre version is 2.2.93.&lt;/p&gt;

&lt;p&gt;When reading the file /proc/fs/lustre/ost/OSS/ost_create/req_history, the system crashed with LBUG ASSERTION( !list_empty(&amp;amp;svcpt-&amp;gt;scp_hist_reqs).&lt;/p&gt;

&lt;p&gt;Here are some information from the core dump&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;      KERNEL: /usr/lib/debug/lib/modules/2.6.32-220.23.1.bl6.Bull.28.8.x86_64/vmlinux
    DUMPFILE: /var/crash/127.0.0.1-2012-09-07-09:51:01/vmcore  [PARTIAL DUMP]
        CPUS: 16
        DATE: Fri Sep  7 09:50:45 2012
      UPTIME: 1 days, 19:58:49
LOAD AVERAGE: 0.05, 0.05, 0.05
       TASKS: 1006
    NODENAME: mo88
     RELEASE: 2.6.32-220.23.1.bl6.Bull.28.8.x86_64
     VERSION: #1 SMP Thu Jul 5 17:34:18 CEST 2012
     MACHINE: x86_64  (2199 Mhz)
      MEMORY: 32 GB
       PANIC: &quot;Kernel panic - not syncing: LBUG&quot;
         PID: 29617
     COMMAND: &quot;cat&quot;
        TASK: ffff8806e65437d0  [THREAD_INFO: ffff8804dbf1c000]
         CPU: 9
       STATE: TASK_RUNNING (PANIC)

crash&amp;gt; bt
PID: 29617  TASK: ffff8806e65437d0  CPU: 9   COMMAND: &quot;cat&quot;
 #0 [ffff8804dbf1fbf0] machine_kexec at ffffffff8102895b
 #1 [ffff8804dbf1fc50] crash_kexec at ffffffff810a4622
 #2 [ffff8804dbf1fd20] panic at ffffffff81484647
 #3 [ffff8804dbf1fda0] lbug_with_loc at ffffffffa0680f6b [libcfs]
 #4 [ffff8804dbf1fdc0] ptlrpc_lprocfs_svc_req_history_seek at ffffffffa0c30104 [ptlrpc]
 #5 [ffff8804dbf1fdd0] ptlrpc_lprocfs_svc_req_history_next at ffffffffa0c301e1 [ptlrpc]
 #6 [ffff8804dbf1fe20] seq_read at ffffffff81185e9a
 #7 [ffff8804dbf1fea0] proc_reg_read at ffffffff811c84ee
 #8 [ffff8804dbf1fef0] vfs_read at ffffffff81163a15
 #9 [ffff8804dbf1ff30] sys_read at ffffffff81163b51
#10 [ffff8804dbf1ff80] system_call_fastpath at ffffffff810030f2
    RIP: 0000003dc64d83f0  RSP: 00007fff6cb0c9e0  RFLAGS: 00010206
    RAX: 0000000000000000  RBX: ffffffff810030f2  RCX: 00000000024a7030
    RDX: 0000000000008000  RSI: 000000000249f000  RDI: 0000000000000003
    RBP: 000000000249f000   R8: 0000000000000003   R9: 0000000001000000
    R10: 0000000000008fff  R11: 0000000000000246  R12: ffffffffffff8000
    R13: 0000000000000003  R14: 0000000000008000  R15: 0000000000000003
    ORIG_RAX: 0000000000000000  CS: 0033  SS: 002b

crash&amp;gt; dmesg | tail -n 50
Lustre: fsperf-OST0005: Now serving fsperf-OST0005 on /dev/dm-11 with recovery enabled
Lustre: 27386:0:(ldlm_lib.c:2110:target_recovery_init()) RECOVERY: service fsperf-OST000a, 1 recoverable clients, last_transno 1340929
Lustre: 27386:0:(ldlm_lib.c:2110:target_recovery_init()) Skipped 3 previous similar messages
Lustre: fsperf-OST000a: Now serving fsperf-OST000a on /dev/dm-26 with recovery enabled
Lustre: Skipped 3 previous similar messages
Lustre: 27419:0:(ldlm_lib.c:2110:target_recovery_init()) RECOVERY: service fsperf-OST0001, 1 recoverable clients, last_transno 1340929
Lustre: 27419:0:(ldlm_lib.c:2110:target_recovery_init()) Skipped 6 previous similar messages
Lustre: fsperf-OST0001: Now serving fsperf-OST0001 on /dev/dm-16 with recovery enabled
Lustre: Skipped 6 previous similar messages
LustreError: 137-5: UUID &apos;fsperf-OST000f_UUID&apos; is not available for connect (no target)
Lustre: fsperf-OST0001: Will be in recovery for at least 5:00, or until 1 client reconnects
Lustre: fsperf-OST000b: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted.
Lustre: fsperf-OST000b: received MDS connection from 32.0.0.39@o2ib1
Lustre: Skipped 14 previous similar messages
Lustre: fsperf-OST0006: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted.
Lustre: fsperf-OST000e: received MDS connection from 32.0.0.39@o2ib1
Lustre: Skipped 13 previous similar messages
Lustre: Echo OBD driver; http://www.lustre.org/
mlx4_core 0000:04:00.0: vpd r/w failed.  This is likely a firmware bug on this device.  Contact the card vendor for a firmware update.
mlx4_core 0000:82:00.0: vpd r/w failed.  This is likely a firmware bug on this device.  Contact the card vendor for a firmware update.
process `cat&apos; is using deprecated sysctl (syscall) net.ipv6.neigh.default.retrans_time; Use net.ipv6.neigh.default.retrans_time_ms instead.
LustreError: 29617:0:(lproc_ptlrpc.c:431:ptlrpc_lprocfs_svc_req_history_seek()) ASSERTION( !list_empty(&amp;amp;svcpt-&amp;gt;scp_hist_reqs) ) failed: 
LustreError: 29617:0:(lproc_ptlrpc.c:431:ptlrpc_lprocfs_svc_req_history_seek()) LBUG
Pid: 29617, comm: cat

Call Trace:
 [&amp;lt;ffffffffa0680905&amp;gt;] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
 [&amp;lt;ffffffffa0680f17&amp;gt;] lbug_with_loc+0x47/0xb0 [libcfs]
 [&amp;lt;ffffffffa0c30104&amp;gt;] ptlrpc_lprocfs_svc_req_history_seek+0xf4/0x100 [ptlrpc]
 [&amp;lt;ffffffffa0c301e1&amp;gt;] ptlrpc_lprocfs_svc_req_history_next+0x71/0x1b0 [ptlrpc]
 [&amp;lt;ffffffff81185e9a&amp;gt;] seq_read+0x24a/0x3f0
 [&amp;lt;ffffffff811c84ee&amp;gt;] proc_reg_read+0x7e/0xc0
 [&amp;lt;ffffffff81163a15&amp;gt;] vfs_read+0xb5/0x1a0
 [&amp;lt;ffffffff810c0e1a&amp;gt;] ? audit_syscall_entry+0x26a/0x290
 [&amp;lt;ffffffff81163b51&amp;gt;] sys_read+0x51/0x90
 [&amp;lt;ffffffff810030f2&amp;gt;] system_call_fastpath+0x16/0x1b

Kernel panic - not syncing: LBUG
Pid: 29617, comm: cat Not tainted 2.6.32-220.23.1.bl6.Bull.28.8.x86_64 #1
Call Trace:
 [&amp;lt;ffffffff81484640&amp;gt;] ? panic+0x78/0x143
 [&amp;lt;ffffffffa0680f6b&amp;gt;] ? lbug_with_loc+0x9b/0xb0 [libcfs]
 [&amp;lt;ffffffffa0c30104&amp;gt;] ? ptlrpc_lprocfs_svc_req_history_seek+0xf4/0x100 [ptlrpc]
 [&amp;lt;ffffffffa0c301e1&amp;gt;] ? ptlrpc_lprocfs_svc_req_history_next+0x71/0x1b0 [ptlrpc]
 [&amp;lt;ffffffff81185e9a&amp;gt;] ? seq_read+0x24a/0x3f0
 [&amp;lt;ffffffff811c84ee&amp;gt;] ? proc_reg_read+0x7e/0xc0
 [&amp;lt;ffffffff81163a15&amp;gt;] ? vfs_read+0xb5/0x1a0
 [&amp;lt;ffffffff810c0e1a&amp;gt;] ? audit_syscall_entry+0x26a/0x290
 [&amp;lt;ffffffff81163b51&amp;gt;] ? sys_read+0x51/0x90
 [&amp;lt;ffffffff810030f2&amp;gt;] ? system_call_fastpath+0x16/0x1b

crash&amp;gt; files
PID: 29617  TASK: ffff8806e65437d0  CPU: 9   COMMAND: &quot;cat&quot;
ROOT: /    CWD: /root
 FD       FILE            DENTRY           INODE       TYPE PATH
  0 ffff88045cba6180 ffff880217b5b800 ffff88046a2e89c8 FIFO 
  1 ffff88045cba6600 ffff880217b5bbc0 ffff8802ace6e148 FIFO 
  2 ffff88045cba6cc0 ffff880217b5b380 ffff88023fd6a048 FIFO 
  3 ffff880872bc5240 ffff880519514480 ffff880874370d38 REG  /proc/fs/lustre/ost/OSS/ost_create/req_history


&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;I can provide additional information from the dump if needed.&lt;/p&gt;</description>
                <environment>lustre 2.2.93&lt;br/&gt;
bullxlinux distribution (based on redhat 6.2)&lt;br/&gt;
kernel 2.6.32-220</environment>
        <key id="15732">LU-1854</key>
            <summary>system crash when reading the file /proc/fs/lustre/ost/OSS/ost_create/req_history</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="liang">Liang Zhen</assignee>
                                    <reporter username="pichong">Gregoire Pichon</reporter>
                        <labels>
                            <label>crash</label>
                            <label>procfs</label>
                    </labels>
                <created>Fri, 7 Sep 2012 04:31:45 +0000</created>
                <updated>Thu, 18 Apr 2013 20:18:53 +0000</updated>
                            <resolved>Mon, 17 Sep 2012 22:23:01 +0000</resolved>
                                    <version>Lustre 2.3.0</version>
                                    <fixVersion>Lustre 2.4.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="44366" author="adilger" created="Fri, 7 Sep 2012 12:44:12 +0000"  >&lt;p&gt;Probably related to changes from SMP scaling   &lt;/p&gt;</comment>
                            <comment id="44477" author="liang" created="Mon, 10 Sep 2012 04:03:44 +0000"  >&lt;p&gt;I suspect this bug is there for long time: ptlrpc_server_drop_request() will never track culled request seq if the req::rq_reply_state is preallocated, it will just unlink the request from history list.&lt;/p&gt;

&lt;p&gt;here is the patch: &lt;a href=&quot;http://review.whamcloud.com/#change,3916&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,3916&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="45105" author="liang" created="Mon, 17 Sep 2012 22:23:01 +0000"  >&lt;p&gt;patch landed&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzv5lr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>4442</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>