<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:31:38 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-3177] mpi process crash on accessing file/directory</title>
                <link>https://jira.whamcloud.com/browse/LU-3177</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We&apos;re running WRF (3.4.1) using 300 to 500 processes.  There are situations that some runs is stuck.  We found that in those cases, one of the processes crashes at the very beginning.  In WRF&apos;s rsl.error.* file, there are such an error message:&lt;/p&gt;

&lt;p&gt;forrtl: severe (121): Cannot access current working directory for unit 27, file &quot;Unknown&quot;&lt;/p&gt;

&lt;p&gt;About 5% of the runs encounter this problem.  In searching Lustre bug report, we guess it might be related to this one:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://projectlava.xyratex.com/show_bug.cgi?id=23978&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://projectlava.xyratex.com/show_bug.cgi?id=23978&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We then follow the suggested workaround by using a getcwd wrapper (without changing the WRF source code), the problem seems to be gone in the subsequent 300 runs.  And in the log message generated by the wrapper, we found the following:&lt;/p&gt;

&lt;p&gt;rsl.out.0095: getcwdfixwrap: host/pid: node23/87536 time: 1366042583 problem null, retryctr: 0 errno: 2 errstr: &quot;No such file or directory&quot;&lt;br/&gt;
rsl.out.0095: getcwdfixwrap: host/pid: node23/87536 problem buf non null, value:&lt;/p&gt;

&lt;p&gt;Seems that the workaround bails out the getcwd problem.&lt;/p&gt;

&lt;p&gt;There seems to a fix in 1.8.x version.  We wonder if such fix will be applied to the 2.1.4 version.&lt;/p&gt;</description>
                <environment>CentOS 6.3, Intel fortran compiler 13.0.1, intel MPI, WRF</environment>
        <key id="18425">LU-3177</key>
            <summary>mpi process crash on accessing file/directory</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bobijam">Zhenyu Xu</assignee>
                                    <reporter username="kwchong">Joe K.W. Chong</reporter>
                        <labels>
                    </labels>
                <created>Tue, 16 Apr 2013 06:10:58 +0000</created>
                <updated>Tue, 11 Mar 2014 01:26:45 +0000</updated>
                            <resolved>Tue, 11 Mar 2014 01:26:45 +0000</resolved>
                                    <version>Lustre 2.1.4</version>
                                    <fixVersion>Lustre 2.1.5</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>5</watches>
                                                                            <comments>
                            <comment id="56528" author="pjones" created="Thu, 18 Apr 2013 01:11:35 +0000"  >&lt;p&gt;Bobijam&lt;/p&gt;

&lt;p&gt;Could you please confirm whether this issue will be resolved by &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-645&quot; title=&quot;getcwd fails&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-645&quot;&gt;&lt;del&gt;LU-645&lt;/del&gt;&lt;/a&gt; and thus covered by this landing - &lt;a href=&quot;http://git.whamcloud.com/?p=fs/lustre-release.git;a=commit;h=94509cda52b49a0153fae4b7a1f0772077aa9809?&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://git.whamcloud.com/?p=fs/lustre-release.git;a=commit;h=94509cda52b49a0153fae4b7a1f0772077aa9809?&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="56529" author="bobijam" created="Thu, 18 Apr 2013 02:55:34 +0000"  >&lt;p&gt;patch &lt;a href=&quot;http://review.whamcloud.com/3206&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/3206&lt;/a&gt; can be applied upon 2.1.4 which should fix the issue.&lt;br/&gt;
Even better to apply &lt;a href=&quot;http://review.whamcloud.com/2400&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/2400&lt;/a&gt; which also handles the issue while still being kept align with 2.1.5 code base.&lt;/p&gt;</comment>
                            <comment id="56580" author="pjones" created="Thu, 18 Apr 2013 21:23:52 +0000"  >&lt;p&gt;Bobijam&lt;/p&gt;

&lt;p&gt;Am I correct in understanding that simply upgrading to 2.1.5 itself would also address this issue?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="56604" author="bobijam" created="Fri, 19 Apr 2013 01:29:58 +0000"  >&lt;p&gt;Peter,&lt;/p&gt;

&lt;p&gt;yes, it is. 2.1.5 contains &lt;a href=&quot;http://review.whamcloud.com/2400&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/2400&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="78803" author="jfc" created="Sat, 8 Mar 2014 01:46:24 +0000"  >&lt;p&gt;Joe,&lt;br/&gt;
Is there any further action required on this ticket?&lt;br/&gt;
If not, can I go ahead and mark it as resolved?&lt;br/&gt;
Thanks,&lt;br/&gt;
~ jfc.&lt;/p&gt;</comment>
                            <comment id="78836" author="kwchong" created="Mon, 10 Mar 2014 01:04:24 +0000"  >&lt;p&gt;Dear John,&lt;/p&gt;

&lt;p&gt;Yes, please mark it as resolved.&lt;/p&gt;

&lt;p&gt;regards,&lt;/p&gt;

&lt;p&gt;Joe&lt;/p&gt;</comment>
                            <comment id="78961" author="jfc" created="Tue, 11 Mar 2014 01:26:45 +0000"  >&lt;p&gt;Customer says OK to resolve and patch has been landed in 2.1.5.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvo8f:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>7744</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>