<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:48:59 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-5154] Failure on test suite recovery-double-scale test_pairwise_fail</title>
                <link>https://jira.whamcloud.com/browse/LU-5154</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;This issue was created by maloo for sarah &amp;lt;sarah@whamcloud.com&amp;gt;&lt;/p&gt;

&lt;p&gt;This issue relates to the following test suite run: &lt;a href=&quot;http://maloo.whamcloud.com/test_sets/f5b45d64-e1d6-11e3-8cc0-52540035b04c&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://maloo.whamcloud.com/test_sets/f5b45d64-e1d6-11e3-8cc0-52540035b04c&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The sub-test test_pairwise_fail failed with the following error:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;test failed to respond and timed out&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;client 3 dmesg&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Lustre: 1763:0:(llite_lib.c:2738:ll_dirty_page_discard_warn()) lustre: dirty page discard: 10.1.4.10@tcp:10.1.4.6@tcp:/lustre/fid: [0x200024621:0x2f14:0x0]/ may get corrupted (rc -108)
Lustre: 1763:0:(llite_lib.c:2738:ll_dirty_page_discard_warn()) lustre: dirty page discard: 10.1.4.10@tcp:10.1.4.6@tcp:/lustre/fid: [0x200024621:0x2eec:0x0]/ may get corrupted (rc -108)
Lustre: 1763:0:(llite_lib.c:2738:ll_dirty_page_discard_warn()) lustre: dirty page discard: 10.1.4.10@tcp:10.1.4.6@tcp:/lustre/fid: [0x200024621:0x2d28:0x0]/ may get corrupted (rc -108)
Lustre: 1764:0:(llite_lib.c:2738:ll_dirty_page_discard_warn()) lustre: dirty page discard: 10.1.4.10@tcp:10.1.4.6@tcp:/lustre/fid: [0x200024621:0x2d86:0x0]/ may get corrupted (rc -108)
Lustre: 1764:0:(llite_lib.c:2738:ll_dirty_page_discard_warn()) lustre: dirty page discard: 10.1.4.10@tcp:10.1.4.6@tcp:/lustre/fid: [0x200024621:0x2d94:0x0]/ may get corrupted (rc -108)
Lustre: lustre-OST0004-osc-ffff88007d400c00: Connection restored to lustre-OST0004 (at 10.1.4.9@tcp)
Lustre: DEBUG MARKER: /usr/sbin/lctl mark                             Failing type2=OST item2=ost5 ... 
Lustre: DEBUG MARKER: Failing type2=OST item2=ost5 ...
INFO: task tar:3715 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.17.1.el6.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
tar           D 0000000000000000     0  3715   3713 0x00000080
 ffff8800704d39e8 0000000000000082 0000000000000000 ffffffff810097cc
 ffff88007b8c4ad8 0000000000000000 00000000004d39a8 ffff8800022143c0
 ffff88007133daf8 ffff8800704d3fd8 000000000000fbc8 ffff88007133daf8
Call Trace:
 [&amp;lt;ffffffff810097cc&amp;gt;] ? __switch_to+0x1ac/0x320
 [&amp;lt;ffffffff81528a95&amp;gt;] schedule_timeout+0x215/0x2e0
 [&amp;lt;ffffffffa0747470&amp;gt;] ? lustre_swab_ost_body+0x0/0x10 [ptlrpc]
 [&amp;lt;ffffffff81528713&amp;gt;] wait_for_common+0x123/0x180
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffff8152882d&amp;gt;] wait_for_completion+0x1d/0x20
 [&amp;lt;ffffffffa0b66c5c&amp;gt;] osc_io_setattr_end+0xbc/0x190 [osc]
 [&amp;lt;ffffffffa0998370&amp;gt;] ? lov_io_end_wrapper+0x0/0x100 [lov]
 [&amp;lt;ffffffffa055e0e0&amp;gt;] cl_io_end+0x60/0x150 [obdclass]
 [&amp;lt;ffffffffa055ec60&amp;gt;] ? cl_io_start+0x0/0x140 [obdclass]
 [&amp;lt;ffffffffa0998461&amp;gt;] lov_io_end_wrapper+0xf1/0x100 [lov]
 [&amp;lt;ffffffffa09981ae&amp;gt;] lov_io_call+0x8e/0x130 [lov]
 [&amp;lt;ffffffffa0999f3c&amp;gt;] lov_io_end+0x4c/0xf0 [lov]
 [&amp;lt;ffffffffa055e0e0&amp;gt;] cl_io_end+0x60/0x150 [obdclass]
 [&amp;lt;ffffffffa0562e52&amp;gt;] cl_io_loop+0xc2/0x1b0 [obdclass]
 [&amp;lt;ffffffffa0a7c4c8&amp;gt;] cl_setattr_ost+0x218/0x2f0 [lustre]
 [&amp;lt;ffffffffa0a4692c&amp;gt;] ll_setattr_raw+0xa2c/0x10d0 [lustre]
 [&amp;lt;ffffffffa0a47035&amp;gt;] ll_setattr+0x65/0xd0 [lustre]
 [&amp;lt;ffffffff811a6fc8&amp;gt;] notify_change+0x168/0x340
 [&amp;lt;ffffffff811bb4ac&amp;gt;] utimes_common+0xdc/0x1b0
 [&amp;lt;ffffffff810ec3fe&amp;gt;] ? call_rcu+0xe/0x10
 [&amp;lt;ffffffff811aa6b0&amp;gt;] ? mntput_no_expire+0x30/0x110
 [&amp;lt;ffffffff811bb650&amp;gt;] do_utimes+0xd0/0x170
 [&amp;lt;ffffffff811bb7f2&amp;gt;] sys_utimensat+0x32/0x90
 [&amp;lt;ffffffff8100b072&amp;gt;] system_call_fastpath+0x16/0x1b
Lustre: 1762:0:(client.c:1914:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1400706578/real 1400706578]  req@ffff8800721e6800 x1468746594693168/t0(0) o8-&amp;gt;lustre-OST0006-osc-ffff88007d400c00@10.1.4.5@tcp:28/4 lens 400/544 e 0 to 1 dl 1400706606 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
Lustre: 1762:0:(client.c:1914:ptlrpc_expire_one_request()) Skipped 29 previous similar messages
Lustre: lustre-OST0000-osc-ffff88007d400c00: Connection to lustre-OST0000 (at 10.1.4.9@tcp) was lost; in progress operations using this service will wait for recovery to complete
Lustre: Skipped 6 previous similar messages
Lustre: lustre-OST0001-osc-ffff88007d400c00: Connection to lustre-OST0001 (at 10.1.4.9@tcp) was lost; in progress operations using this service will wait for recovery to complete
Lustre: lustre-OST0003-osc-ffff88007d400c00: Connection to lustre-OST0003 (at 10.1.4.9@tcp) was lost; in progress operations using this service will wait for recovery to complete
Lustre: lustre-OST0000-osc-ffff88007d400c00: Connection restored to lustre-OST0000 (at 10.1.4.5@tcp)
Lustre: lustre-OST0001-osc-ffff88007d400c00: Connection restored to lustre-OST0001 (at 10.1.4.5@tcp)
INFO: task tar:3715 blocked for more than 120 seconds.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment>client and server: lustre-master build # 2052</environment>
        <key id="25054">LU-5154</key>
            <summary>Failure on test suite recovery-double-scale test_pairwise_fail</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="1" iconUrl="https://jira.whamcloud.com/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="default"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="maloo">Maloo</reporter>
                        <labels>
                    </labels>
                <created>Fri, 6 Jun 2014 17:26:56 +0000</created>
                <updated>Mon, 8 Aug 2016 20:29:02 +0000</updated>
                                            <version>Lustre 2.6.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="86043" author="adilger" created="Fri, 6 Jun 2014 19:22:02 +0000"  >&lt;p&gt;Jinshan or Bobijam, could you please take a look at this bug to see whether it needs to be a blocker for 2.6, and/or what the priority should be?&lt;/p&gt;</comment>
                            <comment id="86142" author="jay" created="Mon, 9 Jun 2014 19:34:06 +0000"  >&lt;p&gt;from the log, it looks like that the client was struggling to reconnect to OSTs but kept failing. The client thread was waiting for the SETATTR REQ to finish so this is where the stack trace came from. At last, the client was evicted and then the REQ could wind up with error.&lt;/p&gt;

&lt;p&gt;This is certainly not a problem on client side.&lt;/p&gt;

&lt;p&gt;The recovery is really complex from what I have seen on the OSS. For some reason, some of OSTs can&apos;t be recovered and caused lots of eviction.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzwnw7:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>14222</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>