<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:13:18 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1080] mds-survey crash</title>
                <link>https://jira.whamcloud.com/browse/LU-1080</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Running on a real machine:&lt;/p&gt;

&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;$ mkfs.lustre --fsname=survey --mdt --index=0 /dev/sda3
$ mount -t lustre /dev/sda3 /mnt
$ thrhi=64 file_count=200000 sh mds-survey
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;then crash:&lt;/p&gt;

&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;Build Version: jenkins-arch=x86_64,build_type=server,distro=el6,ib_stack=inkernel-4610-g614
Lustre: Added LNI 10.45.1.8@tcp [8/256/0/180]
Lustre: Accept all, port 988
LDISKFS-fs (sda3): recovery complete
LDISKFS-fs (sda3): mounted filesystem with ordered data mode. Opts:
LDISKFS-fs (sda3): mounted filesystem with ordered data mode. Opts:
Lustre: MGC10.45.1.8@tcp: Reactivating &lt;span class=&quot;code-keyword&quot;&gt;import&lt;/span&gt;
Lustre: survey-MDT0000: used disk, loading
Lustre: Echo OBD driver; http:&lt;span class=&quot;code-comment&quot;&gt;//www.lustre.org/
&lt;/span&gt;LustreError: 1821:0:(echo_client.c:1810:echo_md_destroy_internal())
Can not unlink child tests: rc = -39
LustreError: 1823:0:(echo_client.c:1810:echo_md_destroy_internal())
Can not unlink child tests1: rc = -39
LustreError: 1831:0:(osd_handler.c:2294:osd_object_ref_del())
ASSERTION((oh)-&amp;gt;ot_declare_ref_del &amp;gt; 0) failed
LustreError: 1831:0:(osd_handler.c:2294:osd_object_ref_del()) LBUG
Pid: 1831, comm: lctl

Call Trace:
 [&amp;lt;ffffffffa038e855&amp;gt;] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
 [&amp;lt;ffffffffa038ee95&amp;gt;] lbug_with_loc+0x75/0xe0 [libcfs]
 [&amp;lt;ffffffffa0399d96&amp;gt;] libcfs_assertion_failed+0x66/0x70 [libcfs]
 [&amp;lt;ffffffffa0a1781a&amp;gt;] osd_object_ref_del+0x14a/0x180 [osd_ldiskfs]
 [&amp;lt;ffffffffa096ecbb&amp;gt;] __mdd_ref_del+0x5b/0xa0 [mdd]
 [&amp;lt;ffffffffa09777a2&amp;gt;] mdd_create+0x1ae2/0x2470 [mdd]
 [&amp;lt;ffffffffa051190d&amp;gt;] ? htable_lookup+0xed/0x190 [obdclass]
 [&amp;lt;ffffffffa041b5a9&amp;gt;] ? cfs_hash_bd_add_locked+0x29/0x90 [libcfs]
 [&amp;lt;ffffffff81275894&amp;gt;] ? vsnprintf+0x484/0x5f0
 [&amp;lt;ffffffffa0a6822b&amp;gt;] echo_md_create_internal+0xab/0x4b0 [obdecho]
 [&amp;lt;ffffffff81275a40&amp;gt;] ? sprintf+0x40/0x50
 [&amp;lt;ffffffffa0a6ff40&amp;gt;] echo_md_handler+0x1380/0x1dd0 [obdecho]
 [&amp;lt;ffffffffa040d87e&amp;gt;] ? cfs_mem_cache_free+0xe/0x10 [libcfs]
 [&amp;lt;ffffffffa0a75ae6&amp;gt;] echo_client_iocontrol+0x1c86/0x2a30 [obdecho]
 [&amp;lt;ffffffff81127e77&amp;gt;] ? ____pagevec_lru_add+0x167/0x180
 [&amp;lt;ffffffffa040da13&amp;gt;] ? cfs_alloc+0x63/0x90 [libcfs]
 [&amp;lt;ffffffffa04c0f52&amp;gt;] ? obd_ioctl_getdata+0x172/0x1060 [obdclass]
 [&amp;lt;ffffffffa04d6264&amp;gt;] class_handle_ioctl+0x14d4/0x2340 [obdclass]
 [&amp;lt;ffffffff8120d5df&amp;gt;] ? security_inode_permission+0x1f/0x30
 [&amp;lt;ffffffffa04c0313&amp;gt;] obd_class_ioctl+0x53/0x240 [obdclass]
 [&amp;lt;ffffffff8100bc0e&amp;gt;] ? apic_timer_interrupt+0xe/0x20
 [&amp;lt;ffffffff81189342&amp;gt;] vfs_ioctl+0x22/0xa0
 [&amp;lt;ffffffff811894c9&amp;gt;] ? do_vfs_ioctl+0x69/0x580
 [&amp;lt;ffffffff811894e4&amp;gt;] do_vfs_ioctl+0x84/0x580
 [&amp;lt;ffffffff8100bc0e&amp;gt;] ? apic_timer_interrupt+0xe/0x20
 [&amp;lt;ffffffff81189a61&amp;gt;] sys_ioctl+0x81/0xa0
 [&amp;lt;ffffffff8100b0f2&amp;gt;] system_call_fastpath+0x16/0x1b

Kernel panic - not syncing: LBUG
Pid: 7014, comm: lctl Not tainted 2.6.32-220.el6_lustre.x86_64 #1
Call Trace:
 [&amp;lt;ffffffff814ec701&amp;gt;] ? panic+0x78/0x143
 [&amp;lt;ffffffffa040ceeb&amp;gt;] ? lbug_with_loc+0xcb/0xe0 [libcfs]
 [&amp;lt;ffffffffa0417d96&amp;gt;] ? libcfs_assertion_failed+0x66/0x70 [libcfs]
 [&amp;lt;ffffffffa0a1781a&amp;gt;] ? osd_object_ref_del+0x14a/0x180 [osd_ldiskfs]
 [&amp;lt;ffffffffa096ecbb&amp;gt;] ? __mdd_ref_del+0x5b/0xa0 [mdd]
 [&amp;lt;ffffffffa09777a2&amp;gt;] ? mdd_create+0x1ae2/0x2470 [mdd]
 [&amp;lt;ffffffffa051190d&amp;gt;] ? htable_lookup+0xed/0x190 [obdclass]
 [&amp;lt;ffffffffa041b5a9&amp;gt;] ? cfs_hash_bd_add_locked+0x29/0x90 [libcfs]
 [&amp;lt;ffffffff81275894&amp;gt;] ? vsnprintf+0x484/0x5f0
 [&amp;lt;ffffffffa0a6822b&amp;gt;] ? echo_md_create_internal+0xab/0x4b0 [obdecho]
 [&amp;lt;ffffffff81275a40&amp;gt;] ? sprintf+0x40/0x50
 [&amp;lt;ffffffffa0a6ff40&amp;gt;] ? echo_md_handler+0x1380/0x1dd0 [obdecho]
 [&amp;lt;ffffffffa040d87e&amp;gt;] ? cfs_mem_cache_free+0xe/0x10 [libcfs]
 [&amp;lt;ffffffffa0a75ae6&amp;gt;] ? echo_client_iocontrol+0x1c86/0x2a30 [obdecho]
 [&amp;lt;ffffffff81127e77&amp;gt;] ? ____pagevec_lru_add+0x167/0x180
 [&amp;lt;ffffffffa040da13&amp;gt;] ? cfs_alloc+0x63/0x90 [libcfs]
 [&amp;lt;ffffffffa04c0f52&amp;gt;] ? obd_ioctl_getdata+0x172/0x1060 [obdclass]
 [&amp;lt;ffffffffa04d6264&amp;gt;] ? class_handle_ioctl+0x14d4/0x2340 [obdclass]
 [&amp;lt;ffffffff8120d5df&amp;gt;] ? security_inode_permission+0x1f/0x30
 [&amp;lt;ffffffffa04c0313&amp;gt;] ? obd_class_ioctl+0x53/0x240 [obdclass]
 [&amp;lt;ffffffff8100bc0e&amp;gt;] ? apic_timer_interrupt+0xe/0x20
 [&amp;lt;ffffffff81189342&amp;gt;] ? vfs_ioctl+0x22/0xa0
 [&amp;lt;ffffffff811894c9&amp;gt;] ? do_vfs_ioctl+0x69/0x580
 [&amp;lt;ffffffff811894e4&amp;gt;] ? do_vfs_ioctl+0x84/0x580
 [&amp;lt;ffffffff8100bc0e&amp;gt;] ? apic_timer_interrupt+0xe/0x20
 [&amp;lt;ffffffff81189a61&amp;gt;] ? sys_ioctl+0x81/0xa0
 [&amp;lt;ffffffff8100b0f2&amp;gt;] ? system_call_fastpath+0x16/0x1b
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment>Lustre Master</environment>
        <key id="13130">LU-1080</key>
            <summary>mds-survey crash</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="5">Cannot Reproduce</resolution>
                                        <assignee username="bzzz">Alex Zhuravlev</assignee>
                                    <reporter username="rhenwood">Richard Henwood</reporter>
                        <labels>
                    </labels>
                <created>Wed, 8 Feb 2012 12:29:37 +0000</created>
                <updated>Tue, 21 Feb 2012 13:00:54 +0000</updated>
                            <resolved>Tue, 21 Feb 2012 13:00:54 +0000</resolved>
                                    <version>Lustre 2.2.0</version>
                                    <fixVersion>Lustre 2.2.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                            <comments>
                            <comment id="28181" author="rhenwood" created="Wed, 8 Feb 2012 12:49:54 +0000"  >&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;# rpm -qa | grep lustre
lustre-ldiskfs-3.3.0-2.6.32_220.el6_lustre.x86_64_g61f62a1.x86_64
kernel-2.6.32-131.17.1.el6_lustre.g60f4e35.x86_64
kernel-2.6.32-220.el6_lustre.x86_64
lustre-modules-2.1.55-2.6.32_220.el6_lustre.x86_64_g61f62a1.x86_64
lustre-tests-2.1.55-2.6.32_220.el6_lustre.x86_64_g61f62a1.x86_64
kernel-firmware-2.6.32-220.el6_lustre.x86_64
lustre-2.1.55-2.6.32_220.el6_lustre.x86_64_g61f62a1.x86_64
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;mds-survey and libecho are from:&lt;br/&gt;
&lt;a href=&quot;http://review.whamcloud.com/#change,1969,patchset=7&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,1969,patchset=7&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="28222" author="di.wang" created="Thu, 9 Feb 2012 01:44:38 +0000"  >&lt;p&gt;It seems this is related with recent osd API change, instead of mds-survey bug. Richard, it seems mdd_create is failed somewhere? could you be able to find the debug log of LBUG.&lt;/p&gt;</comment>
                            <comment id="28223" author="di.wang" created="Thu, 9 Feb 2012 01:44:58 +0000"  >&lt;p&gt;Assign this to Alex.&lt;/p&gt;</comment>
                            <comment id="28500" author="adilger" created="Mon, 13 Feb 2012 12:55:18 +0000"  >&lt;p&gt;Alex, do you have any ideas on how this might be fixed?&lt;/p&gt;</comment>
                            <comment id="28501" author="bzzz" created="Mon, 13 Feb 2012 12:56:52 +0000"  >&lt;p&gt;sorry, still thinking how to solve this easily ...&lt;/p&gt;</comment>
                            <comment id="29161" author="niu" created="Thu, 16 Feb 2012 22:20:16 +0000"  >&lt;p&gt;Since we can&apos;t declare undo operations, I&apos;ve removed this LASSERT in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-993&quot; title=&quot;1.8&amp;lt;-&amp;gt;2.1.54 Test failure on test suite sanity :osd_attr_set()) ASSERTION((oh)-&amp;gt;ot_declare_attr_set &amp;gt; 0) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-993&quot;&gt;&lt;del&gt;LU-993&lt;/del&gt;&lt;/a&gt;(see ec20be97b9f977d3f4944523baaffb1bf95cf76c &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-993&quot; title=&quot;1.8&amp;lt;-&amp;gt;2.1.54 Test failure on test suite sanity :osd_attr_set()) ASSERTION((oh)-&amp;gt;ot_declare_attr_set &amp;gt; 0) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-993&quot;&gt;&lt;del&gt;LU-993&lt;/del&gt;&lt;/a&gt; osd: code cleanup for directory nlink count), but I&apos;m not sure why the echo create failed, is it a normal failure during the test?&lt;/p&gt;</comment>
                            <comment id="29179" author="bzzz" created="Fri, 17 Feb 2012 01:01:54 +0000"  >&lt;p&gt;well, we can declare undo ops - that could be the easiest solution, but that results in more credits.&lt;/p&gt;

&lt;p&gt;btw, any idea why &lt;span class=&quot;error&quot;&gt;&amp;#91;DTO_INDEX_DELETE&amp;#93;&lt;/span&gt;  = 16 ? ldiskfs never shrinks dir nor it updates neighbor blocks during entry removal&lt;br/&gt;
nor it changes quota usage. I&apos;d think 1 should be enough?&lt;/p&gt;
</comment>
                            <comment id="29197" author="niu" created="Fri, 17 Feb 2012 05:32:54 +0000"  >&lt;blockquote&gt;
&lt;p&gt;btw, any idea why &lt;span class=&quot;error&quot;&gt;&amp;#91;DTO_INDEX_DELETE&amp;#93;&lt;/span&gt; = 16 ? ldiskfs never shrinks dir nor it updates neighbor blocks during entry removal&lt;br/&gt;
nor it changes quota usage. I&apos;d think 1 should be enough?&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;I have no idea why it was 16. You are the ext3/4 expert, I believe you are right &lt;img class=&quot;emoticon&quot; src=&quot;https://jira.whamcloud.com/images/icons/emoticons/smile.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;, 1 block could be enough.&lt;/p&gt;</comment>
                            <comment id="29198" author="bzzz" created="Fri, 17 Feb 2012 05:42:11 +0000"  >&lt;p&gt;I&apos;m beginning to think that if deletion consts 1 credit, then 1..2 more credits to set nlink.&lt;br/&gt;
so 2..3 additional credits for undo path won&apos;t hurt us, probably? at least in mdd_create() case.&lt;br/&gt;
another (enormous) case is mdd_rename() - it&apos;ll take mode credits for undo, but probably still&lt;br/&gt;
acceptable. and at some point we&apos;re going to change the approach to be more object-based than&lt;br/&gt;
just summing ops.&lt;/p&gt;</comment>
                            <comment id="29452" author="pjones" created="Sun, 19 Feb 2012 22:23:28 +0000"  >&lt;p&gt;Andreas/Johann&lt;/p&gt;

&lt;p&gt;Could one of you please comment on this 2.2 blocker?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="29482" author="rhenwood" created="Mon, 20 Feb 2012 11:13:28 +0000"  >&lt;p&gt;FYI: I have been running a more recent Lustre and I have not been able to reproduce this issue.&lt;/p&gt;

&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;# rpm -qa | grep lustre
lustre-ldiskfs-3.3.0-2.6.32_220.el6_lustre.gfd1c51d.x86_64_g0204171.x86_64
kernel-2.6.32-220.el6_lustre.gfd1c51d.x86_64
lustre-modules-2.1.55-2.6.32_220.el6_lustre.gfd1c51d.x86_64_g0204171.x86_64
kernel-firmware-2.6.32-220.el6_lustre.gfd1c51d.x86_64
lustre-2.1.55-2.6.32_220.el6_lustre.gfd1c51d.x86_64_g0204171.x86_64
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;These rpms are from build 480:&lt;br/&gt;
&lt;a href=&quot;http://build.whamcloud.com/job/lustre-master/480/arch=x86_64,build_type=server,distro=el6,ib_stack=inkernel/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://build.whamcloud.com/job/lustre-master/480/arch=x86_64,build_type=server,distro=el6,ib_stack=inkernel/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="29512" author="adilger" created="Tue, 21 Feb 2012 13:00:33 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-1082&quot; title=&quot;add mds-survey to lustre-tests&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-1082&quot;&gt;&lt;del&gt;LU-1082&lt;/del&gt;&lt;/a&gt; is tracking the test which will run mds-survey during normal testing ensure it keeps working.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="13134">LU-1082</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvhef:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6468</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>