<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:57:15 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-12971] changelog_deregister: fail to cancel 0 of 1 llog-records</title>
                <link>https://jira.whamcloud.com/browse/LU-12971</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Hello,&lt;/p&gt;

&lt;p&gt;We have four MDTs on Fir, and Robinhood kept being blocked on one of them. So we decided to clear/unregister all four changelogs readers and start with a fresh filesystem scan + new readers. While it was ok on three MDTs, with the last one, {fir-MDT0003}, I am not able to successfully run&#160;&lt;tt&gt;changelog_deregister&lt;/tt&gt; due to the following errors:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Nov 14 11:39:23 fir-md1-s4 kernel: LustreError: 13794:0:(llog_cat.c:762:llog_cat_cancel_records()) fir-MDD0003: fail to cancel 0 of 1 llog-records: rc = -2
Nov 14 11:39:23 fir-md1-s4 kernel: LustreError: 13794:0:(mdd_device.c:371:llog_changelog_cancel()) fir-MDD0003: cancel idx 11438 of catalog [0x5:0xa:0x0]: rc = -2
Nov 14 11:39:24 fir-md1-s4 kernel: Lustre: 14109:0:(llog_cat.c:894:llog_cat_process_or_fork()) fir-MDD0003: catlog [0x5:0xa:0x0] crosses index zero &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;A single changelog_deregister will just hang. If I launch a second one, the first will eventually returns with the following error:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[root@fir-md1-s4 ~]#  lctl --device fir-MDT0003 changelog_deregister cl1
error: changelog_deregister: No such file or directory
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;If I launch two &lt;tt&gt;changelog_deregister&lt;/tt&gt; at the same time, each in an endless loop, I&apos;m able to make progress in the idx number, but not very fast:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Nov 14 09:34:18 fir-md1-s4 kernel: LustreError: 11745:0:(mdd_device.c:371:llog_changelog_cancel()) fir-MDD0003: cancel idx 11428 of catalog [0x5:0xa:0x0]: rc = -2
Nov 14 09:34:19 fir-md1-s4 kernel: Lustre: 12314:0:(llog_cat.c:894:llog_cat_process_or_fork()) fir-MDD0003: catlog [0x5:0xa:0x0] crosses index zero
Nov 14 10:02:41 fir-md1-s4 kernel: LustreError: 12314:0:(llog_cat.c:762:llog_cat_cancel_records()) fir-MDD0003: fail to cancel 0 of 1 llog-records: rc = -2
Nov 14 10:02:41 fir-md1-s4 kernel: LustreError: 12314:0:(mdd_device.c:371:llog_changelog_cancel()) fir-MDD0003: cancel idx 11431 of catalog [0x5:0xa:0x0]: rc = -2
Nov 14 10:02:42 fir-md1-s4 kernel: Lustre: 12741:0:(llog_cat.c:894:llog_cat_process_or_fork()) fir-MDD0003: catlog [0x5:0xa:0x0] crosses index zero
Nov 14 10:36:51 fir-md1-s4 kernel: LustreError: 11916:0:(llog_cat.c:762:llog_cat_cancel_records()) fir-MDD0003: fail to cancel 0 of 1 llog-records: rc = -2
Nov 14 10:36:51 fir-md1-s4 kernel: LustreError: 11916:0:(mdd_device.c:371:llog_changelog_cancel()) fir-MDD0003: cancel idx 11433 of catalog [0x5:0xa:0x0]: rc = -2
Nov 14 10:36:52 fir-md1-s4 kernel: Lustre: 13213:0:(llog_cat.c:894:llog_cat_process_or_fork()) fir-MDD0003: catlog [0x5:0xa:0x0] crosses index zero
Nov 14 10:53:58 fir-md1-s4 kernel: LustreError: 12741:0:(llog_cat.c:762:llog_cat_cancel_records()) fir-MDD0003: fail to cancel 0 of 1 llog-records: rc = -2
Nov 14 10:53:58 fir-md1-s4 kernel: LustreError: 12741:0:(mdd_device.c:371:llog_changelog_cancel()) fir-MDD0003: cancel idx 11435 of catalog [0x5:0xa:0x0]: rc = -2
Nov 14 10:53:59 fir-md1-s4 kernel: Lustre: 13452:0:(llog_cat.c:894:llog_cat_process_or_fork()) fir-MDD0003: catlog [0x5:0xa:0x0] crosses index zero
Nov 14 11:16:38 fir-md1-s4 kernel: LustreError: 13452:0:(llog_cat.c:762:llog_cat_cancel_records()) fir-MDD0003: fail to cancel 0 of 1 llog-records: rc = -2
Nov 14 11:16:38 fir-md1-s4 kernel: LustreError: 13452:0:(mdd_device.c:371:llog_changelog_cancel()) fir-MDD0003: cancel idx 11436 of catalog [0x5:0xa:0x0]: rc = -2
Nov 14 11:16:39 fir-md1-s4 kernel: Lustre: 13794:0:(llog_cat.c:894:llog_cat_process_or_fork()) fir-MDD0003: catlog [0x5:0xa:0x0] crosses index zero
Nov 14 11:39:23 fir-md1-s4 kernel: LustreError: 13794:0:(llog_cat.c:762:llog_cat_cancel_records()) fir-MDD0003: fail to cancel 0 of 1 llog-records: rc = -2
Nov 14 11:39:23 fir-md1-s4 kernel: LustreError: 13794:0:(mdd_device.c:371:llog_changelog_cancel()) fir-MDD0003: cancel idx 11438 of catalog [0x5:0xa:0x0]: rc = -2
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;We would like to clear all changelogs and start fresh on &lt;tt&gt;fir-MDT0003&lt;/tt&gt;. Because we have been using 2.12 since the beginning and we might have corrupt changelogs there. How to do that properly? Thanks!&lt;/p&gt;</description>
                <environment>CentOS 7.6 - Lustre 2.12.3 clients and servers</environment>
        <key id="57390">LU-12971</key>
            <summary>changelog_deregister: fail to cancel 0 of 1 llog-records</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="3" iconUrl="https://jira.whamcloud.com/images/icons/priorities/major.svg">Major</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="2">Won&apos;t Fix</resolution>
                                        <assignee username="tappro">Mikhail Pershin</assignee>
                                    <reporter username="sthiell">Stephane Thiell</reporter>
                        <labels>
                    </labels>
                <created>Thu, 14 Nov 2019 19:54:18 +0000</created>
                <updated>Sat, 23 Nov 2019 14:55:48 +0000</updated>
                            <resolved>Sat, 23 Nov 2019 14:55:48 +0000</resolved>
                                    <version>Lustre 2.12.3</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="258324" author="sthiell" created="Thu, 14 Nov 2019 20:01:41 +0000"  >&lt;p&gt;TL;DR Can I just quickly remove &lt;tt&gt;changelog_catalog&lt;/tt&gt; and &lt;tt&gt;changelog_users&lt;/tt&gt; at the root level using ldiskfs on the MDT? Or these files have special inodes? Alternatively, can I empty them?&lt;/p&gt;</comment>
                            <comment id="258332" author="sthiell" created="Thu, 14 Nov 2019 23:03:23 +0000"  >&lt;p&gt;I found some instructions on how to manually regenerate the changelog files in &lt;a href=&quot;https://jira.whamcloud.com/browse/EDU-122?focusedCommentId=255248&amp;amp;page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-255248&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;https://jira.whamcloud.com/browse/EDU-122?focusedCommentId=255248&amp;amp;page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-255248&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;I will wait until tomorrow in case you want to investigate the issue with me before I do that. We would like to resume Robinhood operation while at SC19 as it is used to automatically purge the filesystem.&lt;/p&gt;</comment>
                            <comment id="258408" author="pjones" created="Fri, 15 Nov 2019 18:26:19 +0000"  >&lt;p&gt;Mike&lt;/p&gt;

&lt;p&gt;Anything further to add here?&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="258718" author="sthiell" created="Sat, 23 Nov 2019 01:39:55 +0000"  >&lt;p&gt;Issue resolved! Something was wrong as, likely due to this missing file:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[root@fir-md1-s4 ldiskfs]# llog_reader O/1/d4/4
llog_reader: Could not open the file O/1/d4/4.: No such file or directory (2)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;&#160;&lt;br/&gt;
Notes available as  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/33925/33925_fir-md1-s4_chglog_clear_notes_20191122.txt&quot; title=&quot;fir-md1-s4_chglog_clear_notes_20191122.txt attached to LU-12971&quot;&gt;fir-md1-s4_chglog_clear_notes_20191122.txt&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; &lt;br/&gt;
Some files put in  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/33926/33926_fir-md1-s4_chglog.tar.gz&quot; title=&quot;fir-md1-s4_chglog.tar.gz attached to LU-12971&quot;&gt;fir-md1-s4_chglog.tar.gz&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;  for reference.&lt;/p&gt;

&lt;p&gt;&#160;&lt;br/&gt;
Robinhood has been happily reading changelogs from fir-MDT0003 all day:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;2019/11/22 14:44:01 [38287/1] STATS | ChangeLog reader #3:
2019/11/22 14:44:01 [38287/1] STATS |    fs_name    =   fir
2019/11/22 14:44:01 [38287/1] STATS |    mdt_name   =   MDT0003
2019/11/22 14:44:01 [38287/1] STATS |    reader_id  =   cl1
2019/11/22 14:44:01 [38287/1] STATS |    records read        = 217385998
2019/11/22 14:44:01 [38287/1] STATS |    interesting records = 185702
2019/11/22 14:44:01 [38287/1] STATS |    suppressed records  = 217195055
2019/11/22 14:44:01 [38287/1] STATS |    records pending     = 5560
2019/11/22 14:44:01 [38287/1] STATS |    last received: rec_id=217385998, rec_time=2019/11/22 14:43:55.908001, received at 2019/11/22 14:43:57.236896
2019/11/22 14:44:01 [38287/1] STATS |        receive speed: 8554.39 rec/sec, log/real time ratio: 0.98
2019/11/22 14:44:01 [38287/1] STATS |    last pushed: rec_id=214812359, rec_time=2019/11/22 14:38:58.510981, pushed at 2019/11/22 14:43:58.237611
2019/11/22 14:44:01 [38287/1] STATS |        push speed: 3430.45 rec/sec, log/real time ratio: 0.99
2019/11/22 14:44:01 [38287/1] STATS |    last committed: rec_id=214812359, rec_time=2019/11/22 14:38:58.510981, committed at 2019/11/22 14:43:58.271717
2019/11/22 14:44:01 [38287/1] STATS |        commit speed: 3430.45 rec/sec, log/real time ratio: 0.99
2019/11/22 14:44:01 [38287/1] STATS |    last cleared: rec_id=214812359, rec_time=2019/11/22 14:38:58.510981, cleared at 2019/11/22 14:43:58.272185
2019/11/22 14:44:01 [38287/1] STATS |        clear speed: 3430.45 rec/sec, log/real time ratio: 0.99
2019/11/22 14:44:01 [38287/1] STATS |    ChangeLog stats:
2019/11/22 14:44:01 [38287/1] STATS |    MARK: 0, CREAT: 75559, MKDIR: 10260, HLINK: 0, SLINK: 19095, MKNOD: 0, UNLNK: 46381
2019/11/22 14:44:01 [38287/1] STATS |    RMDIR: 2439, RENME: 8947, RNMTO: 0, OPEN: 0, CLOSE: 211845911, LYOUT: 10786, TRUNC: 5339182
2019/11/22 14:44:01 [38287/1] STATS |    SATTR: 22598, XATTR: 0, HSM: 0, MTIME: 4817, CTIME: 23, ATIME: 0, MIGRT: 0, FLRW: 0
2019/11/22 14:44:01 [38287/1] STATS |    RESYNC: 0, GXATR: 0, NOPEN: 0
 &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="258725" author="pjones" created="Sat, 23 Nov 2019 14:55:48 +0000"  >&lt;p&gt;ok - thanks&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="33926" name="fir-md1-s4_chglog.tar.gz" size="2207208" author="sthiell" created="Sat, 23 Nov 2019 01:37:10 +0000"/>
                            <attachment id="33925" name="fir-md1-s4_chglog_clear_notes_20191122.txt" size="8881" author="sthiell" created="Sat, 23 Nov 2019 01:37:06 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i00phr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>