<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:15:39 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1324] expected application behavior for timed out read operations</title>
                <link>https://jira.whamcloud.com/browse/LU-1324</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;A user application on our classified BGP system running a Lustre 1.8 client is having problems reading from 2.1 servers.  We are still light on details about what exact errors the application is getting back from reads, if any.  But on the client side we see reads timing out, lost connections, and EBUSY errors while reconnecting:&lt;/p&gt;

&lt;p&gt;Request ost_read sent 675s ago to 172.18.102.48@tcp1 has timed out (limit 675s)&lt;br/&gt;
Connection to ls2-OST029f (at 172.18.102.48@tcp1) was lost; in progress operations using the service will wait for recovery to complete&lt;br/&gt;
An error occurred while communicating with 172.18.102.48@tcp1; the ost_connect operation failed with -16&lt;br/&gt;
(repeats several times)&lt;br/&gt;
Connection restored to ls2-OST029f (at 172.18.102.48@tcp1)&lt;/p&gt;

&lt;p&gt;While on the server we get many of these corresponding events:&lt;/p&gt;

&lt;p&gt;Lustre: ls2-OST029f: Client &amp;lt;uuid&amp;gt; reconnecting&lt;br/&gt;
Lustre: ls2-OST029f: Client &amp;lt;uuid&amp;gt; refused reconnection, still busy with 2 active RPCs&lt;br/&gt;
LustreError: ldlm_lib.c:2614:target_bulk_io()) @@@ build PUT failed: rc -107 ... rc 0/-1&lt;br/&gt;
Lustre: ls2-OST029f: Build IO read error with &amp;lt;uuid&amp;gt; ... client will retry: -107&lt;br/&gt;
Lustre: ldlm_lib.c:913:target_handle_connect()) ls2-OST-29f: connection from &amp;lt;uuid&amp;gt; ...&lt;/p&gt;

&lt;p&gt;My understanding is that all of this should be transparent to the application and no error should propagate to user space unless the client is evicted.   Is this correct?&lt;/p&gt;</description>
                <environment>&lt;a href=&quot;https://github.com/chaos/lustre&quot;&gt;https://github.com/chaos/lustre&lt;/a&gt;&lt;br/&gt;
Client: Lustre 1.8 BGP&lt;br/&gt;
Server: 2.1.1-3chaos</environment>
        <key id="14014">LU-1324</key>
            <summary>expected application behavior for timed out read operations</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="6">Not a Bug</resolution>
                                        <assignee username="bobijam">Zhenyu Xu</assignee>
                                    <reporter username="nedbass">Ned Bass</reporter>
                        <labels>
                    </labels>
                <created>Fri, 13 Apr 2012 21:13:51 +0000</created>
                <updated>Mon, 4 Jun 2012 14:58:31 +0000</updated>
                            <resolved>Mon, 4 Jun 2012 14:58:31 +0000</resolved>
                                    <version>Lustre 2.1.1</version>
                    <version>Lustre 1.8.x (1.8.0 - 1.8.5)</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="34761" author="pjones" created="Sun, 15 Apr 2012 17:41:31 +0000"  >&lt;p&gt;Bobi&lt;/p&gt;

&lt;p&gt;Could you please comment on this one?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="34771" author="bobijam" created="Mon, 16 Apr 2012 03:40:05 +0000"  >&lt;p&gt;yes, and the client application I/O will wait untill being evicted.&lt;/p&gt;</comment>
                            <comment id="34843" author="nedbass" created="Mon, 16 Apr 2012 19:04:50 +0000"  >&lt;p&gt;Thanks.  Also, could these errors result in fewer than the requested number of bytes being read (i.e. short reads)?&lt;/p&gt;</comment>
                            <comment id="34860" author="bobijam" created="Mon, 16 Apr 2012 22:01:16 +0000"  >&lt;p&gt;The IO rpc won&apos;t be returned to client if network issue happens, and client app will get -EIO if it fails to reconnecting to the OST from which it tries to get data.&lt;/p&gt;</comment>
                            <comment id="39885" author="bobijam" created="Sun, 3 Jun 2012 23:52:55 +0000"  >&lt;p&gt;Ned,&lt;/p&gt;

&lt;p&gt;Any further question about this ticket? &lt;/p&gt;</comment>
                            <comment id="39933" author="nedbass" created="Mon, 4 Jun 2012 12:30:42 +0000"  >&lt;p&gt;Hi,&lt;/p&gt;

&lt;p&gt;No further questions, feel free to close the ticket.&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;</comment>
                            <comment id="39944" author="pjones" created="Mon, 4 Jun 2012 14:58:31 +0000"  >&lt;p&gt;Thanks Prakash.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvh2n:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6415</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>