<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:32:44 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-3302] ll_fill_super() Unable to process log: -2</title>
                <link>https://jira.whamcloud.com/browse/LU-3302</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;We updated a client to 2.3.64-4chaos and tried to mount a 2.3.63-6chaos server.  The mount fails with&lt;/p&gt;




&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;LustreError: 15c-8: MGC172.20.20.201@o2ib500: The configuration from log &apos;fsv-client&apos; failed (-2). This may be the result of communication errors between this node and the MGS, a bad configuration, or other errors. See the syslog for more information.
LustreError: 14351:0:(llite_lib.c:1043:ll_fill_super()) Unable to process log: -2
Lustre: Unmounted fsv-client
LustreError: 14351:0:(obd_mount.c:1265:lustre_fill_super()) Unable to mount  (-2)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Using git bisect I found the mount failure was introduced with this patch:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/#change,5820&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,5820&lt;/a&gt;&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;LU-2684 fid: unify ostid and FID
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The critical questions at this point are:&lt;/p&gt;

&lt;ul class=&quot;alternate&quot; type=&quot;square&quot;&gt;
	&lt;li&gt;Can we solve this problem by updating both server and client to 2.3.64-4chaos?&lt;/li&gt;
&lt;/ul&gt;


&lt;ul class=&quot;alternate&quot; type=&quot;square&quot;&gt;
	&lt;li&gt;Can we safely upgrade the server, or does the above patch introduce on-disk format incompatibilities?&lt;/li&gt;
&lt;/ul&gt;


&lt;ul class=&quot;alternate&quot; type=&quot;square&quot;&gt;
	&lt;li&gt;Will we be able to safely revert the server to 2.3.63 in case we find problems, or will it write new objects in an incompatible format?&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;LLNL-bug-id: TOSS-2060&lt;/p&gt;</description>
                <environment>PPC client</environment>
        <key id="18764">LU-3302</key>
            <summary>ll_fill_super() Unable to process log: -2</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="di.wang">Di Wang</assignee>
                                    <reporter username="nedbass">Ned Bass</reporter>
                        <labels>
                            <label>LB</label>
                    </labels>
                <created>Thu, 9 May 2013 00:29:59 +0000</created>
                <updated>Fri, 17 May 2013 16:58:04 +0000</updated>
                            <resolved>Fri, 10 May 2013 18:17:23 +0000</resolved>
                                    <version>Lustre 2.4.0</version>
                                    <fixVersion>Lustre 2.4.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                            <comments>
                            <comment id="57968" author="nedbass" created="Thu, 9 May 2013 00:37:41 +0000"  >&lt;p&gt;As an editorial comment, while we understand that interoperability issues are inevitable in a pre-release branch, we wish such changes would be advertised more prominently.  Clear statements about compatibility between tags would really help us plan our update process.  At a minimum, patches that introduce incompatibilities should say so clearly in the commit message.&lt;/p&gt;</comment>
                            <comment id="57983" author="nedbass" created="Thu, 9 May 2013 05:43:48 +0000"  >&lt;p&gt;Di, can you advise us on this? Thanks&lt;/p&gt;</comment>
                            <comment id="57984" author="adilger" created="Thu, 9 May 2013 05:57:53 +0000"  >&lt;p&gt;Ned, can you please attach a -1 debug log from the 2.3.64 client, and ideally also from the MGS.&lt;/p&gt;

&lt;p&gt;I agree that the &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2684&quot; title=&quot;convert ost_id to lu_fid for FID_SEQ_NORMAL objects&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2684&quot;&gt;&lt;del&gt;LU-2684&lt;/del&gt;&lt;/a&gt; change was problematic, and it was intended to only change the network protocol between clients and OSTs when running DNE.  The &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2888&quot; title=&quot;After downgrade from 2.4 to 2.1.4, hit (osd_handler.c:2343:osd_index_try()) ASSERTION( dt_object_exists(dt) ) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2888&quot;&gt;&lt;del&gt;LU-2888&lt;/del&gt;&lt;/a&gt; patch &lt;a href=&quot;http://review.whamcloud.com/6044&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/6044&lt;/a&gt; (which was already included in 2.3.64) should have fixed the LLOG handling, so I&apos;m not sure what the exact cause of your problem is.  AFAIK, the current master code interoperates with 2.1.5 and 2.3.0 properly, but there might be something specific with your setup that is causing grief.&lt;/p&gt;</comment>
                            <comment id="57985" author="nedbass" created="Thu, 9 May 2013 06:09:49 +0000"  >&lt;p&gt;Andreas, yes I&apos;ll grab the logs.&lt;/p&gt;

&lt;p&gt;Note the above error was a 2.3.64 client talking to a 2.3.63 server.  Do you mean that patch 6044 fixed LLOG handling on the client, or is it needed on the server as well?&lt;/p&gt;</comment>
                            <comment id="57986" author="nedbass" created="Thu, 9 May 2013 06:18:28 +0000"  >&lt;p&gt;Attaching -1 debug logs for client and MDS.  Note these were not captured from the same mount attempt.&lt;/p&gt;

&lt;p&gt;The NID of the client is 172.20.16.10@o2ib500.&lt;/p&gt;</comment>
                            <comment id="57987" author="nedbass" created="Thu, 9 May 2013 06:22:05 +0000"  >&lt;p&gt;I did notice the mgs got ENOENT handling opcodes LLOG_ORIGIN_HANDLE_CREATE and LLOG_ORIGIN_HANDLE_READ_HEADER:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;20000000:01000000:6.0:1368040430.786604:0:18265:0:(mgs_handler.c:757:mgs_handle()) @@@ MGS fail to handle opc = 501: rc = -2
  req@ffff881019ecb050 x1434494006460492/t0(0) o501-&amp;gt;2e89e428-68d9-71a1-75f0-147bc1963566@172.20.16.10@o2ib500:0/0 lens 296/0 e 0 to 0 dl 1368040491 ref 1 fl Interpret:/0/ffffffff rc 0/-1
...
20000000:01000000:6.0:1368040430.788063:0:18265:0:(mgs_handler.c:757:mgs_handle()) @@@ MGS fail to handle opc = 503: rc = -2
  req@ffff881019f14850 x1434494006460504/t0(0) o503-&amp;gt;2e89e428-68d9-71a1-75f0-147bc1963566@172.20.16.10@o2ib500:0/0 lens 272/0 e 0 to 0 dl 1368040491 ref 1 fl Interpret:/0/ffffffff rc 0/-1
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="57988" author="adilger" created="Thu, 9 May 2013 06:36:11 +0000"  >&lt;p&gt;The 6044/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2888&quot; title=&quot;After downgrade from 2.4 to 2.1.4, hit (osd_handler.c:2343:osd_index_try()) ASSERTION( dt_object_exists(dt) ) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2888&quot;&gt;&lt;del&gt;LU-2888&lt;/del&gt;&lt;/a&gt; patch fixed the handling on the server, but the original problem patch from &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-2684&quot; title=&quot;convert ost_id to lu_fid for FID_SEQ_NORMAL objects&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-2684&quot;&gt;&lt;del&gt;LU-2684&lt;/del&gt;&lt;/a&gt; wasn&apos;t in 2.3.63, so it shouldn&apos;t be relevant.&lt;/p&gt;</comment>
                            <comment id="58031" author="pjones" created="Thu, 9 May 2013 17:36:27 +0000"  >&lt;p&gt;Di&lt;/p&gt;

&lt;p&gt;Could you please comment on this?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="58032" author="adilger" created="Thu, 9 May 2013 17:37:33 +0000"  >&lt;p&gt;Ned, is this a PPC client?  It would be useful to include this information in the &quot;Environment&quot; section when filing a bug.&lt;/p&gt;</comment>
                            <comment id="58036" author="nedbass" created="Thu, 9 May 2013 17:46:55 +0000"  >&lt;p&gt;Yes it is.  Sorry for the omission.&lt;/p&gt;</comment>
                            <comment id="58037" author="adilger" created="Thu, 9 May 2013 17:53:07 +0000"  >&lt;p&gt;John, it was mentioned to me that you have already found some endian issues with the FID-on-OST code?  Could you please point out where they are, it might be that this is the source of the problem being seen here, since we didn&apos;t see any problems with our x86_64 clients for interoperability.&lt;/p&gt;</comment>
                            <comment id="58040" author="jhammond" created="Thu, 9 May 2013 17:58:57 +0000"  >&lt;p&gt;Possibly. Please see &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3294&quot; title=&quot;osp_sync_llog_init(): ASSERTION( lgh != ((void *)0) ) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3294&quot;&gt;&lt;del&gt;LU-3294&lt;/del&gt;&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Ned, it would be interesting to know what happens when you create a new 2.3.65 FS on ppc, unmount, and then remount it.&lt;/p&gt;</comment>
                            <comment id="58050" author="nedbass" created="Thu, 9 May 2013 19:13:23 +0000"  >&lt;p&gt;John, okay, we&apos;re getting a test environment set up where I should be able to do that test.&lt;/p&gt;</comment>
                            <comment id="58079" author="nedbass" created="Thu, 9 May 2013 21:20:26 +0000"  >&lt;p&gt;Haven&apos;t tried 2.3.65 yet, but initial testing suggests updating the server to 2.3.64 lets the mount succeed.  Here&apos;s what I did:&lt;/p&gt;

&lt;p&gt;1. Tried to mount a 2.3.62 server from a 2.3.64 PPC client.  Fails with &quot;ll_fill_super() Unable to process log: -2&quot;&lt;br/&gt;
2. Updated the server to 2.3.64.  Mount from 2.3.64 PPC client succeeds.&lt;/p&gt;</comment>
                            <comment id="58081" author="di.wang" created="Thu, 9 May 2013 21:25:09 +0000"  >&lt;p&gt;Ned, I just checked the debug log, it seems client get correct log ID after swab, &lt;/p&gt;

&lt;p&gt;Here are the client log&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;00000040:00000001:5.0:1368040600.989913:5152:8187:0:(llog_swab.c:86:lustre_swab_llogd_body()) Process entered
00000040:00001000:5.0:1368040600.989914:5328:8187:0:(llog_swab.c:53:print_llogd_body()) llogd body: c000000f50e9a100
00000040:00001000:5.0:1368040600.989915:5328:8187:0:(llog_swab.c:55:print_llogd_body())         lgd_logid.lgl_oi: 0x6400000000000000:16777216
00000040:00001000:5.0:1368040600.989915:5328:8187:0:(llog_swab.c:56:print_llogd_body())         lgd_logid.lgl_ogen: 0x0
00000040:00001000:5.0:1368040600.989916:5328:8187:0:(llog_swab.c:57:print_llogd_body())         lgd_ctxt_idx: 0x0
00000040:00001000:5.0:1368040600.989917:5328:8187:0:(llog_swab.c:58:print_llogd_body())         lgd_llh_flags: 0x0
00000040:00001000:5.0:1368040600.989917:5328:8187:0:(llog_swab.c:59:print_llogd_body())         lgd_index: 0x0
00000040:00001000:5.0:1368040600.989918:5328:8187:0:(llog_swab.c:60:print_llogd_body())         lgd_saved_index: 0x0
00000040:00001000:5.0:1368040600.989918:5328:8187:0:(llog_swab.c:61:print_llogd_body())         lgd_len: 0x0
00000040:00001000:5.0:1368040600.989919:5328:8187:0:(llog_swab.c:62:print_llogd_body())         lgd_cur_offset: 0x0
00000040:00001000:5.0:1368040600.989920:5328:8187:0:(llog_swab.c:53:print_llogd_body()) llogd body: c000000f50e9a100
00000040:00001000:5.0:1368040600.989920:5328:8187:0:(llog_swab.c:55:print_llogd_body())         lgd_logid.lgl_oi: 0x64:1
00000040:00001000:5.0:1368040600.989921:5328:8187:0:(llog_swab.c:56:print_llogd_body())         lgd_logid.lgl_ogen: 0x0
00000040:00001000:5.0:1368040600.989921:5328:8187:0:(llog_swab.c:57:print_llogd_body())         lgd_ctxt_idx: 0x0
00000040:00001000:5.0:1368040600.989922:5328:8187:0:(llog_swab.c:58:print_llogd_body())         lgd_llh_flags: 0x0
00000040:00001000:5.0:1368040600.989923:5328:8187:0:(llog_swab.c:59:print_llogd_body())         lgd_index: 0x0
00000040:00001000:5.0:1368040600.989923:5328:8187:0:(llog_swab.c:60:print_llogd_body())         lgd_saved_index: 0x0
00000040:00001000:5.0:1368040600.989924:5328:8187:0:(llog_swab.c:61:print_llogd_body())         lgd_len: 0x0
00000040:00001000:5.0:1368040600.989924:5328:8187:0:(llog_swab.c:62:print_llogd_body())         lgd_cur_offset: 0x0
00000040:00000001:5.0:1368040600.989925:5152:8187:0:(llog_swab.c:97:lustre_swab_llogd_body()) Process leaving
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;But somehow server can not find the log object by this ID.  Unfortunately, I can not find correspondent mgs handling information in the MDS debug log. Could you please redo the test update the debug log. &lt;/p&gt;

&lt;p&gt;In the mean time, I do see there are some problem during the logid swab(John also point out one in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3294&quot; title=&quot;osp_sync_llog_init(): ASSERTION( lgh != ((void *)0) ) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3294&quot;&gt;&lt;del&gt;LU-3294&lt;/del&gt;&lt;/a&gt;) I will cook the patch now. &lt;/p&gt;</comment>
                            <comment id="58082" author="di.wang" created="Thu, 9 May 2013 21:48:21 +0000"  >&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/#change,6305&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,6305&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="58088" author="jhammond" created="Thu, 9 May 2013 22:52:23 +0000"  >&lt;p&gt;Ned would you confirm that these are x86_64 servers and ppc/ppc64 clients? In that case it&apos;s unlikely that you&apos;re affected by &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-3294&quot; title=&quot;osp_sync_llog_init(): ASSERTION( lgh != ((void *)0) ) failed&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-3294&quot;&gt;&lt;del&gt;LU-3294&lt;/del&gt;&lt;/a&gt; since that issue is probably limited to BE servers.&lt;/p&gt;</comment>
                            <comment id="58099" author="nedbass" created="Thu, 9 May 2013 23:30:06 +0000"  >&lt;p&gt;Yes these are x86_64 servers and ppc64 clients.&lt;/p&gt;

&lt;p&gt;Also, if it is an unfixed swabbing bug, I would expect the mount to also fail with 2.3.64 servers.&lt;/p&gt;</comment>
                            <comment id="58168" author="jlevi" created="Fri, 10 May 2013 17:50:33 +0000"  >&lt;p&gt;Now that this patch has landed, can we get confirmation that this is fixed?&lt;br/&gt;
Thank you!&lt;/p&gt;</comment>
                            <comment id="58170" author="nedbass" created="Fri, 10 May 2013 18:15:59 +0000"  >&lt;p&gt;With the patch, a 2.3.64 PPC client can mount from a 2.3.63 server.  So this appears to be fixed.   Thanks&lt;/p&gt;</comment>
                            <comment id="58171" author="jlevi" created="Fri, 10 May 2013 18:17:23 +0000"  >&lt;p&gt;Based on latest comments, this patch landed and has fixed the issue. Closing ticket.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="18748">LU-3294</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="12659" name="lustre.log.vesta-mds1.1368040434.gz" size="3370183" author="nedbass" created="Thu, 9 May 2013 06:18:28 +0000"/>
                            <attachment id="12660" name="lustre.log.vulcanlac1.1368040616.gz" size="53771" author="nedbass" created="Thu, 9 May 2013 06:18:28 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzvqhr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>8173</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>