<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:41:40 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-11183] sanity test 244 hangs with no information in the logs</title>
                <link>https://jira.whamcloud.com/browse/LU-11183</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;sanity test 244 hangs in recent testing. The last thing seen in the test_log is a hang during test 10 or 11&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;== sanity test 244: sendfile with group lock tests =================================================== 18:46:27 (1531334787)
35+0 records in
35+0 records out
36700160 bytes (37 MB) copied, 0.482129 s, 76.1 MB/s
Starting test test10 at 1531334788

&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;In all cases, the stack_dump is empty and the only thing seen in the console logs and dmesg is the test starting, rebooting and start testing sanity-sec. There&#8217;s about a one hour gap between when the test last reports in and when the node is rebooted&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[ 5769.663115] Lustre: DEBUG MARKER: /usr/sbin/lctl mark == sanity test 244: sendfile with group lock tests =================================================== 18:46:27 \(1531334787\)
[ 5769.858388] Lustre: DEBUG MARKER: == sanity test 244: sendfile with group lock tests =================================================== 18:46:27 (1531334787)

&amp;lt;ConMan&amp;gt; Console [trevis-12vm4] disconnected from &amp;lt;trevis-12:6003&amp;gt; at 07-11 19:49.

&amp;lt;ConMan&amp;gt; Console [trevis-12vm4] connected to &amp;lt;trevis-12:6003&amp;gt; at 07-11 19:49.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;It looks like there is no information on why this test hung. &lt;/p&gt;

&lt;p&gt;We have several instances of this with logs at&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/fca99f92-6fcd-11e8-aa24-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/fca99f92-6fcd-11e8-aa24-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/553d0058-80cd-11e8-b441-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/553d0058-80cd-11e8-b441-52540065bddc&lt;/a&gt;&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/f52eaf70-8d67-11e8-87f3-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/f52eaf70-8d67-11e8-87f3-52540065bddc&lt;/a&gt;&lt;/p&gt;
</description>
                <environment></environment>
        <key id="52839">LU-11183</key>
            <summary>sanity test 244 hangs with no information in the logs</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                    </labels>
                <created>Thu, 26 Jul 2018 22:54:15 +0000</created>
                <updated>Thu, 17 Mar 2022 16:10:54 +0000</updated>
                            <resolved>Thu, 17 Mar 2022 16:10:54 +0000</resolved>
                                    <version>Lustre 2.12.0</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="231267" author="colmstea" created="Wed, 1 Aug 2018 18:14:55 +0000"  >&lt;p&gt;Stack dumps were empty due to a bug in AT (ATM-1046). Are there new instances of this issue after 7/23?&lt;/p&gt;</comment>
                            <comment id="231470" author="adilger" created="Sun, 5 Aug 2018 21:29:07 +0000"  >&lt;p&gt;+1 on master&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://testing.whamcloud.com/test_sets/3ef30c02-97ed-11e8-b0aa-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/3ef30c02-97ed-11e8-b0aa-52540065bddc&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="232694" author="tappro" created="Tue, 28 Aug 2018 19:40:46 +0000"  >&lt;p&gt;+1 on master&lt;br/&gt;
&lt;a href=&quot;https://testing.whamcloud.com/test_sets/b74c176a-aacc-11e8-80f7-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/b74c176a-aacc-11e8-80f7-52540065bddc&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="233004" author="adilger" created="Tue, 4 Sep 2018 21:06:46 +0000"  >&lt;p&gt;16 failures in the past 4 weeks, of 1160 test runs, or 0.13% failure rate.  Definitely not one of the top 10 at this point.&lt;/p&gt;</comment>
                            <comment id="233558" author="utopiabound" created="Fri, 14 Sep 2018 21:24:29 +0000"  >&lt;p&gt;master review-ldiskfs&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://testing.whamcloud.com/test_sets/8eb7ed38-b7c4-11e8-a7de-52540065bddc&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://testing.whamcloud.com/test_sets/8eb7ed38-b7c4-11e8-a7de-52540065bddc&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;In the console log for client 1 (onyx-30vm1) is the stack traces of all running processes including sendfile_grouplock:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[ 9676.429140] sendfile_groupl S ffff974e39416eb0     0 22342  22160 0x00000080
[ 9676.429921] Call Trace:
[ 9676.430178]  [&amp;lt;ffffffffb8714029&amp;gt;] schedule+0x29/0x70
[ 9676.430681]  [&amp;lt;ffffffffb87118d4&amp;gt;] schedule_timeout+0x174/0x2c0
[ 9676.431389]  [&amp;lt;ffffffffb80a3750&amp;gt;] ? internal_add_timer+0x70/0x70
[ 9676.431985]  [&amp;lt;ffffffffc09c2650&amp;gt;] ? ptlrpc_init_rq_pool+0x110/0x110 [ptlrpc]
[ 9676.432705]  [&amp;lt;ffffffffc09cc3b0&amp;gt;] ptlrpc_set_wait+0x480/0x790 [ptlrpc]
[ 9676.433440]  [&amp;lt;ffffffffb80cf670&amp;gt;] ? wake_up_state+0x20/0x20
[ 9676.433990]  [&amp;lt;ffffffffc09cc73d&amp;gt;] ptlrpc_queue_wait+0x7d/0x220 [ptlrpc]
[ 9676.434666]  [&amp;lt;ffffffffc09b16a2&amp;gt;] ldlm_cli_enqueue+0x3d2/0x920 [ptlrpc]
[ 9676.435414]  [&amp;lt;ffffffffc09ac7c0&amp;gt;] ? ldlm_expired_completion_wait+0x220/0x220 [ptlrpc]
[ 9676.436210]  [&amp;lt;ffffffffc0ba2d50&amp;gt;] ? osc_lock_lockless_cancel+0xe0/0xe0 [osc]
[ 9676.436929]  [&amp;lt;ffffffffc0ba1ad0&amp;gt;] ? osc_lock_upcall+0x580/0x580 [osc]
[ 9676.437653]  [&amp;lt;ffffffffc0b98965&amp;gt;] osc_enqueue_base+0x2b5/0x6a0 [osc]
[ 9676.438288]  [&amp;lt;ffffffffc0ba1550&amp;gt;] ? osc_lock_lvb_update+0x330/0x330 [osc]
[ 9676.438979]  [&amp;lt;ffffffffc0ba37bb&amp;gt;] osc_lock_enqueue+0x38b/0x840 [osc]
[ 9676.439738]  [&amp;lt;ffffffffc0ba1550&amp;gt;] ? osc_lock_lvb_update+0x330/0x330 [osc]
[ 9676.440443]  [&amp;lt;ffffffffc0802d95&amp;gt;] cl_lock_enqueue+0x65/0x120 [obdclass]
[ 9676.441176]  [&amp;lt;ffffffffc0bfc285&amp;gt;] lov_lock_enqueue+0x95/0x150 [lov]
[ 9676.441817]  [&amp;lt;ffffffffc0802d95&amp;gt;] cl_lock_enqueue+0x65/0x120 [obdclass]
[ 9676.442567]  [&amp;lt;ffffffffc0803327&amp;gt;] cl_lock_request+0x67/0x1f0 [obdclass]
[ 9676.443227]  [&amp;lt;ffffffffc080721b&amp;gt;] cl_io_lock+0x2bb/0x3d0 [obdclass]
[ 9676.443866]  [&amp;lt;ffffffffc08075ab&amp;gt;] cl_io_loop+0x11b/0xc70 [obdclass]
[ 9676.444634]  [&amp;lt;ffffffffc0c54db2&amp;gt;] ll_file_io_generic+0x4e2/0xd10 [lustre]
[ 9676.445301]  [&amp;lt;ffffffffc0695395&amp;gt;] ? cfs_trace_unlock_tcd+0x35/0x90 [libcfs]
[ 9676.445990]  [&amp;lt;ffffffffb8357e01&amp;gt;] ? vsnprintf+0x1c1/0x6a0
[ 9676.446644]  [&amp;lt;ffffffffb824cca0&amp;gt;] ? splice_write_to_file+0x120/0x120
[ 9676.447271]  [&amp;lt;ffffffffc0c55b32&amp;gt;] ll_file_aio_write+0x372/0x540 [lustre]
[ 9676.447945]  [&amp;lt;ffffffffc0c55da4&amp;gt;] ll_file_write+0xa4/0x170 [lustre]
[ 9676.448648]  [&amp;lt;ffffffffb821c082&amp;gt;] __kernel_write+0x72/0x140
[ 9676.449194]  [&amp;lt;ffffffffb824ccfe&amp;gt;] write_pipe_buf+0x5e/0xa0
[ 9676.449748]  [&amp;lt;ffffffffb824c386&amp;gt;] splice_from_pipe_feed+0x86/0x130
[ 9676.450446]  [&amp;lt;ffffffffb824cca0&amp;gt;] ? splice_write_to_file+0x120/0x120
[ 9676.451054]  [&amp;lt;ffffffffb824c96e&amp;gt;] __splice_from_pipe+0x6e/0x90
[ 9676.451639]  [&amp;lt;ffffffffb824cca0&amp;gt;] ? splice_write_to_file+0x120/0x120
[ 9676.452363]  [&amp;lt;ffffffffb824e34e&amp;gt;] splice_from_pipe+0x5e/0x90
[ 9676.452960]  [&amp;lt;ffffffffb824e399&amp;gt;] default_file_splice_write+0x19/0x30
[ 9676.453610]  [&amp;lt;ffffffffb824d190&amp;gt;] do_splice_from+0xb0/0xf0
[ 9676.454208]  [&amp;lt;ffffffffb824d1f0&amp;gt;] direct_splice_actor+0x20/0x30
[ 9676.454793]  [&amp;lt;ffffffffb824cf27&amp;gt;] splice_direct_to_actor+0xd7/0x200
[ 9676.455486]  [&amp;lt;ffffffffb824d1d0&amp;gt;] ? do_splice_from+0xf0/0xf0
[ 9676.456039]  [&amp;lt;ffffffffb824d0b2&amp;gt;] do_splice_direct+0x62/0x90
[ 9676.456611]  [&amp;lt;ffffffffb821bb38&amp;gt;] do_sendfile+0x1d8/0x3c0
[ 9676.457212]  [&amp;lt;ffffffffb821d1ea&amp;gt;] SyS_sendfile64+0x9a/0xb0
[ 9676.457764]  [&amp;lt;ffffffffb87206d5&amp;gt;] ? system_call_after_swapgs+0xa2/0x146
[ 9676.458508]  [&amp;lt;ffffffffb8720795&amp;gt;] system_call_fastpath+0x1c/0x21
[ 9676.459099]  [&amp;lt;ffffffffb87206e1&amp;gt;] ? system_call_after_swapgs+0xae/0x146&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="233564" author="adilger" created="Sat, 15 Sep 2018 00:47:45 +0000"  >&lt;p&gt;This looks like a duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11128&quot; title=&quot;replay-single test timeout&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11128&quot;&gt;&lt;del&gt;LU-11128&lt;/del&gt;&lt;/a&gt;, which has a fix about to land.  I&apos;ve also got an additional debugging patch that should land, in case Alex&apos;s patch doesn&apos;t fix this, and to improve debugability beyond &quot;deafening silence in the logs&quot;.&lt;/p&gt;</comment>
                            <comment id="329513" author="paf0186" created="Thu, 17 Mar 2022 16:10:54 +0000"  >&lt;p&gt;Probably a dupe of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-11128&quot; title=&quot;replay-single test timeout&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-11128&quot;&gt;&lt;del&gt;LU-11128&lt;/del&gt;&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="52659">LU-11128</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzzx3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>