<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:09:45 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-7538] file.c:3891:ll_layout_lock_set()) LBUG</title>
                <link>https://jira.whamcloud.com/browse/LU-7538</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;The error occurred during soak testing of master via build &apos;20151209&apos; (see &lt;a href=&quot;https://wiki.hpdd.intel.com/pages/viewpage.action?title=Soak+Testing+on+Lola&amp;amp;spaceKey=Releases#SoakTestingonLola-20151209&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://wiki.hpdd.intel.com/pages/viewpage.action?title=Soak+Testing+on+Lola&amp;amp;spaceKey=Releases#SoakTestingonLola-20151209&lt;/a&gt;). DNE is enabled. MDTs had been formatted using &lt;em&gt;ldiskfs&lt;/em&gt;, OSTs using &lt;em&gt;zfs&lt;/em&gt;. MDSes are configured in active-active HA - configuration.&lt;/p&gt;

&lt;p&gt;During normal operations (no fault injected) two Lustre client nodes hit the LBUG listed below:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;&lt;tt&gt;lola-26&lt;/tt&gt; &amp;#8211; &lt;tt&gt;192.168.1.126&lt;/tt&gt;   &amp;#8211; Dec  9 21:41:59&lt;/li&gt;
	&lt;li&gt;&lt;tt&gt;lola-27&lt;/tt&gt; &amp;#8211; &lt;tt&gt;192.168.1.127&lt;/tt&gt;   &amp;#8211; Dec  9 21:41:40
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Dec  9 21:41:40 lola-27 kernel: LustreError: 3786:0:(file.c:3891:ll_layout_lock_set()) ASSERTION( ldlm_has_layout(lock) ) f
ailed: 
Dec  9 21:41:40 lola-27 kernel: LustreError: 3786:0:(file.c:3891:ll_layout_lock_set()) LBUG
Dec  9 21:41:40 lola-27 kernel: Pid: 3786, comm: flush-lustre-1
Dec  9 21:41:40 lola-27 kernel: 
Dec  9 21:41:40 lola-27 kernel: Call Trace:
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa045f875&amp;gt;] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa045fe77&amp;gt;] lbug_with_loc+0x47/0xb0 [libcfs]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0a04b89&amp;gt;] ll_layout_lock_set+0xa9/0x1360 [lustre]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0a03b5a&amp;gt;] ? ll_take_md_lock+0xfa/0x4b0 [lustre]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0a08fc1&amp;gt;] ll_layout_refresh_locked+0xe1/0xe00 [lustre]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa058b7f1&amp;gt;] ? cl_io_slice_add+0xc1/0x190 [obdclass]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0a37c20&amp;gt;] ? ll_md_blocking_ast+0x0/0x7d0 [lustre]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa072f350&amp;gt;] ? ldlm_completion_ast+0x0/0x9b0 [ptlrpc]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0470aa7&amp;gt;] ? cfs_hash_bd_lookup_intent+0x37/0x130 [libcfs]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0a09e79&amp;gt;] ll_layout_refresh+0x199/0x300 [lustre]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa058b7f1&amp;gt;] ? cl_io_slice_add+0xc1/0x190 [obdclass]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0a56c8f&amp;gt;] vvp_io_init+0x39f/0x480 [lustre]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa047377a&amp;gt;] ? cfs_hash_find_or_add+0x9a/0x190 [libcfs]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa058a3a8&amp;gt;] cl_io_init0+0x88/0x150 [obdclass]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa058d4a4&amp;gt;] cl_io_init+0x64/0xe0 [obdclass]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0a04022&amp;gt;] cl_sync_file_range+0x112/0x2f0 [lustre]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffffa0a2cd7c&amp;gt;] ll_writepages+0x9c/0x220 [lustre]
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff81139871&amp;gt;] do_writepages+0x21/0x40
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff811bb19d&amp;gt;] writeback_single_inode+0xdd/0x290
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff811bb59d&amp;gt;] writeback_sb_inodes+0xbd/0x170
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff811bb6fb&amp;gt;] writeback_inodes_wb+0xab/0x1b0
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff811bbaf3&amp;gt;] wb_writeback+0x2f3/0x410
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff810880b2&amp;gt;] ? del_timer_sync+0x22/0x30
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff811bbdb5&amp;gt;] wb_do_writeback+0x1a5/0x240
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff811bbeb3&amp;gt;] bdi_writeback_task+0x63/0x1b0
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff8109eaa7&amp;gt;] ? bit_waitqueue+0x17/0xd0
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff81148620&amp;gt;] ? bdi_start_fn+0x0/0x100
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff811486a6&amp;gt;] bdi_start_fn+0x86/0x100
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff81148620&amp;gt;] ? bdi_start_fn+0x0/0x100
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff8109e78e&amp;gt;] kthread+0x9e/0xc0
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff8100c28a&amp;gt;] child_rip+0xa/0x20
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff8109e6f0&amp;gt;] ? kthread+0x0/0xc0
Dec  9 21:41:40 lola-27 kernel: [&amp;lt;ffffffff8100c280&amp;gt;] ? child_rip+0x0/0x20
Dec  9 21:41:40 lola-27 kernel: 
Dec  9 21:41:40 lola-27 kernel: Kernel panic - not syncing: LBUG
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;The errors temporal correlate the errors OSS nodes (&lt;tt&gt;lola-2,3&lt;/tt&gt; of the form:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;lola-2.log:Dec  9 21:41:48 lola-2 kernel: LustreError: 28806:0:(ldlm_lockd.c:689:ldlm_handle_ast_error()) ### client (nid 1
92.168.1.126@o2ib100) failed to reply to blocking AST (req status 0 rc -11), evict it ns: filter-soaked-OST0004_UUID lock: 
ffff880377d872c0/0xef6ba6a3129d2917 lrc: 4/0,0 mode: PR/PR res: [0x500000406:0xfa062d:0x0].0x0 rrc: 2 type: EXT [0-&amp;gt;1844674
4073709551615] (req 0-&amp;gt;18446744073709551615) flags: 0x60000000010020 nid: 192.168.1.126@o2ib100 remote: 0x6044879e61dc398 e
xpref: 33311 pid: 27253 timeout: 4297781214 lvb_type: 1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Several messages on the OSS nodes can be found in (attached) messages files for both OSSes.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;Attached files:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;&lt;tt&gt;lola-26,27&lt;/tt&gt; - messages, console, vmcore-dmesg.txt files&lt;/li&gt;
	&lt;li&gt;&lt;tt&gt;lola-2,3&lt;/tt&gt;     - messages, console files&lt;/li&gt;
&lt;/ul&gt;
</description>
                <environment>lola&lt;br/&gt;
build: &lt;a href=&quot;https://build.hpdd.intel.com/job/lustre-reviews/36149/&quot;&gt;https://build.hpdd.intel.com/job/lustre-reviews/36149/&lt;/a&gt; </environment>
        <key id="33548">LU-7538</key>
            <summary>file.c:3891:ll_layout_lock_set()) LBUG</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="bobijam">Zhenyu Xu</assignee>
                                    <reporter username="heckes">Frank Heckes</reporter>
                        <labels>
                            <label>soak</label>
                    </labels>
                <created>Thu, 10 Dec 2015 08:21:01 +0000</created>
                <updated>Fri, 6 Aug 2021 03:09:36 +0000</updated>
                            <resolved>Fri, 6 Aug 2021 03:09:36 +0000</resolved>
                                    <version>Lustre 2.8.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                            <comments>
                            <comment id="135780" author="heckes" created="Thu, 10 Dec 2015 08:32:17 +0000"  >&lt;p&gt;Crash dump files have been saved to &lt;tt&gt;lola-1:/scratch/crashdumps/lu-7538&lt;/tt&gt;: &lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;lola-26-127.0.0.1-2015-12-09-21:42:13&lt;/li&gt;
	&lt;li&gt;lola-27-127.0.0.1-2015-12-09-21:41:54&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="135902" author="pjones" created="Thu, 10 Dec 2015 18:42:19 +0000"  >&lt;p&gt;Bobijam&lt;/p&gt;

&lt;p&gt;Could you please review this failure and assess how serious it is and whether it should be considered a blocker for 2.8?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="135999" author="bobijam" created="Fri, 11 Dec 2015 06:30:57 +0000"  >&lt;p&gt;Can you point out where can I find the corresponding crash dump support files (uncompressed vmlinux file, System.map of client - lola26/lola27 with debug CFLAG turned on)? Since I want to check its debug log which should be extracted from the core dump, but need these supporting files.&lt;/p&gt;</comment>
                            <comment id="138189" author="heckes" created="Thu, 7 Jan 2016 08:31:36 +0000"  >&lt;p&gt;The crash files have been saved to scratch file system on cluster &lt;em&gt;lola&lt;/em&gt; as described in comment above(&lt;a href=&quot;https://jira.hpdd.intel.com/browse/LU-7538?focusedCommentId=135780&amp;amp;page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-135780&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jira.hpdd.intel.com/browse/LU-7538?focusedCommentId=135780&amp;amp;page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-135780&lt;/a&gt;)&lt;br/&gt;
Please let if I should upload them to shadow or onyx cluster or any other location if you don&apos;t have access to lola. I think they&apos;re to big to be attached to this ticket.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                                                <inwardlinks description="is duplicated by">
                                        <issuelink>
            <issuekey id="64816">LU-14780</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="19842" name="console-lola-2.log.bz2" size="41793" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19844" name="console-lola-26.log.bz2" size="34212" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19845" name="console-lola-27.log.bz2" size="34279" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19843" name="console-lola-3.log.bz2" size="40360" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19846" name="lola-26-vmcore-dmesg.txt.bz2" size="18760" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19847" name="lola-27-vmcore-dmesg.txt.bz2" size="18333" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19848" name="messages-lola-2.log.bz2" size="188885" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19850" name="messages-lola-26.log.bz2" size="212158" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19851" name="messages-lola-27.log.bz2" size="211187" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                            <attachment id="19849" name="messages-lola-3.log.bz2" size="200738" author="heckes" created="Thu, 10 Dec 2015 09:00:38 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzxvfz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>