<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:20:50 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-1919] Soft lockup on MGS stop</title>
                <link>https://jira.whamcloud.com/browse/LU-1919</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;After a bunch of recent MGS-related landings, I am unable to go into sanity much anymore, first or second mgs unmount attempt hangs on current master.&lt;br/&gt;
100% incident ratio with REFORMAT=yes SLOW=yes sh sanity.sh&lt;br/&gt;
I have 10G ram and 8 CPUs in this vm:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;[  742.243128] Lustre: DEBUG MARKER: == sanity test 17k: symlinks: rsync with xattrs enabled =========================== 18:09:54 (1347487794)
[  742.751459] Lustre: DEBUG MARKER: == sanity test 17m: run e2fsck against MDT which contains &lt;span class=&quot;code-object&quot;&gt;short&lt;/span&gt;/&lt;span class=&quot;code-object&quot;&gt;long&lt;/span&gt; symlink == 18:09:55 (1347487795)
[  772.153542] Lustre: 4505:0:(client.c:1905:ptlrpc_expire_one_request()) @@@ Request  sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1347487817/real 1347487817]  req@ffff88025a670bf0 x1412943327533287/t0(0) o400-&amp;gt;lustre-MDT0000-mdc-ffff880239395bf0@0@lo:12/10 lens 224/224 e 0 to 1 dl 1347487824 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[  772.158030] Lustre: lustre-MDT0000-mdc-ffff880239395bf0: Connection to lustre-MDT0000 (at 0@lo) was lost; in progress operations using &lt;span class=&quot;code-keyword&quot;&gt;this&lt;/span&gt; service will wait &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; recovery to complete
[  777.152548] Lustre: 4511:0:(client.c:1905:ptlrpc_expire_one_request()) @@@ Request  sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1347487822/real 1347487822]  req@ffff88025e138bf0 x1412943327533293/t0(0) o400-&amp;gt;lustre-MDT0000-mdc-ffff880239395bf0@0@lo:12/10 lens 224/224 e 0 to 1 dl 1347487829 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[  777.153801] LustreError: 166-1: MGC192.168.1.205@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using &lt;span class=&quot;code-keyword&quot;&gt;this&lt;/span&gt; service will fail
[  777.158799] Lustre: 4511:0:(client.c:1905:ptlrpc_expire_one_request()) Skipped 1 previous similar message
[  778.153531] Lustre: 4504:0:(client.c:1905:ptlrpc_expire_one_request()) @@@ Request  sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1347487824/real 1347487824]  req@ffff880261c0ebf0 x1412943327533296/t0(0) o38-&amp;gt;lustre-MDT0000-mdc-ffff880239395bf0@0@lo:12/10 lens 400/544 e 0 to 1 dl 1347487830 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[  783.153544] Lustre: 4504:0:(client.c:1905:ptlrpc_expire_one_request()) @@@ Request  sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1347487829/real 1347487829]  req@ffff88028f55bbf0 x1412943327533297/t0(0) o250-&amp;gt;MGC192.168.1.205@tcp@0@lo:26/25 lens 400/544 e 0 to 1 dl 1347487835 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[  793.153542] Lustre: 4504:0:(client.c:1905:ptlrpc_expire_one_request()) @@@ Request  sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1347487834/real 1347487834]  req@ffff8802389adbf0 x1412943327533300/t0(0) o38-&amp;gt;lustre-MDT0000-mdc-ffff880239395bf0@0@lo:12/10 lens 400/544 e 0 to 1 dl 1347487845 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[  798.153534] Lustre: 4504:0:(client.c:1905:ptlrpc_expire_one_request()) @@@ Request  sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1347487839/real 1347487839]  req@ffff880231971bf0 x1412943327533303/t0(0) o250-&amp;gt;MGC192.168.1.205@tcp@0@lo:26/25 lens 400/544 e 0 to 1 dl 1347487850 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[  813.153531] Lustre: 4504:0:(client.c:1905:ptlrpc_expire_one_request()) @@@ Request  sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1347487849/real 1347487849]  req@ffff880247083bf0 x1412943327533308/t0(0) o38-&amp;gt;lustre-MDT0000-mdc-ffff880239395bf0@0@lo:12/10 lens 400/544 e 0 to 1 dl 1347487865 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[  817.156401] Lustre: lustre-OST0000: haven&lt;span class=&quot;code-quote&quot;&gt;&apos;t heard from client lustre-MDT0000-mdtlov_UUID (at 0@lo) in 52 seconds. I think it&apos;&lt;/span&gt;s dead, and I am evicting it. exp ffff8802390bdbf0, cur 1347487869 expire 1347487839 last 1347487817
[  838.152794] Lustre: 4504:0:(client.c:1905:ptlrpc_expire_one_request()) @@@ Request  sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1347487869/real 1347487869]  req@ffff880262d9bbf0 x1412943327533318/t0(0) o38-&amp;gt;lustre-MDT0000-mdc-ffff880239395bf0@0@lo:12/10 lens 400/544 e 0 to 1 dl 1347487890 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[  838.157141] Lustre: 4504:0:(client.c:1905:ptlrpc_expire_one_request()) Skipped 1 previous similar message
[  840.056007] BUG: soft lockup - CPU#1 stuck &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; 67s! [obd_zombid:4474]
[  840.056255] Modules linked in: lustre obdfilter ost cmm mdt osd_ldiskfs fsfilt_ldiskfs ldiskfs exportfs mdd mds mgs lquota jbd obdecho mgc lov osc mdc lmv fid fld ptlrpc obdclass lvfs ksocklnd lnet sha512_generic sha256_generic libcfs sunrpc ipv6 microcode virtio_balloon virtio_net i2c_piix4 i2c_core ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]
[  840.056255] CPU 1 
[  840.056255] Modules linked in: lustre obdfilter ost cmm mdt osd_ldiskfs fsfilt_ldiskfs ldiskfs exportfs mdd mds mgs lquota jbd obdecho mgc lov osc mdc lmv fid fld ptlrpc obdclass lvfs ksocklnd lnet sha512_generic sha256_generic libcfs sunrpc ipv6 microcode virtio_balloon virtio_net i2c_piix4 i2c_core ext4 mbcache jbd2 virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]
[  840.056255] 
[  840.056255] Pid: 4474, comm: obd_zombid Not tainted 2.6.32-debug #3 Bochs Bochs
[  840.056255] RIP: 0010:[&amp;lt;ffffffffa0388808&amp;gt;]  [&amp;lt;ffffffffa0388808&amp;gt;] server_deregister_mount+0xa8/0x390 [obdclass]
[  840.056255] RSP: 0018:ffff880292c3ddd0  EFLAGS: 00010282
[  840.056255] RAX: ffff880265478e50 RBX: ffff880292c3dde0 RCX: 0000000000000000
[  840.056255] RDX: ffff880292e45f08 RSI: 0000000000000000 RDI: ffff88028fa5fc70
[  840.056255] RBP: ffffffff8100bc0e R08: 00000000ffffffff R09: 000000000000009e
[  840.056255] R10: 000000000000000f R11: 000000000000000f R12: ffff8802654b008c
[  840.056255] R13: 0000000000000000 R14: 0000000000000000 R15: ffffffffffffffff
[  840.056255] FS:  00007f3b197eb700(0000) GS:ffff880028240000(0000) knlGS:0000000000000000
[  840.056255] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[  840.056255] CR2: ffff880292e45fd8 CR3: 0000000001a25000 CR4: 00000000000006e0
[  840.056255] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  840.056255] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  840.056255] &lt;span class=&quot;code-object&quot;&gt;Process&lt;/span&gt; obd_zombid (pid: 4474, threadinfo ffff880292c3c000, task ffff88028d4a23c0)
[  840.056255] Stack:
[  840.056255]  ffff880265478e50 ffff8802654b008c ffff880292c3de00 ffffffffa0388c50
[  840.056255] &amp;lt;d&amp;gt; ffff8802654b0080 ffff8802654b008c ffff880292c3de20 ffffffffa098eb3e
[  840.056255] &amp;lt;d&amp;gt; ffff8802654b0080 0000000000000000 ffff880292c3de90 ffffffffa037b7f2
[  840.056255] Call Trace:
[  840.056255]  [&amp;lt;ffffffffa0388c50&amp;gt;] ? server_put_mount+0x160/0x290 [obdclass]
[  840.056255]  [&amp;lt;ffffffffa098eb3e&amp;gt;] ? mgs_cleanup+0x4e/0x1c0 [mgs]
[  840.056255]  [&amp;lt;ffffffffa037b7f2&amp;gt;] ? class_decref+0x212/0x590 [obdclass]
[  840.056255]  [&amp;lt;ffffffffa0364c74&amp;gt;] ? obd_zombie_impexp_cull+0x314/0x620 [obdclass]
[  840.056255]  [&amp;lt;ffffffffa0365045&amp;gt;] ? obd_zombie_impexp_thread+0xc5/0x1c0 [obdclass]
[  840.056255]  [&amp;lt;ffffffff81057d60&amp;gt;] ? default_wake_function+0x0/0x20
[  840.056255]  [&amp;lt;ffffffffa0364f80&amp;gt;] ? obd_zombie_impexp_thread+0x0/0x1c0 [obdclass]
[  840.056255]  [&amp;lt;ffffffff8100c14a&amp;gt;] ? child_rip+0xa/0x20
[  840.056255]  [&amp;lt;ffffffffa0364f80&amp;gt;] ? obd_zombie_impexp_thread+0x0/0x1c0 [obdclass]
[  840.056255]  [&amp;lt;ffffffffa0364f80&amp;gt;] ? obd_zombie_impexp_thread+0x0/0x1c0 [obdclass]
[  840.056255]  [&amp;lt;ffffffff8100c140&amp;gt;] ? child_rip+0x0/0x20
[  840.056255] Code: a6 3b a0 c7 05 6a d4 08 00 93 00 00 00 48 c7 05 6b d4 08 00 00 00 00 00 c7 05 59 d4 08 00 04 00 00 01 48 8b 50 10 48 85 d2 74 07 &amp;lt;44&amp;gt; 8b 82 d0 00 00 00 4c 89 e1 48 c7 c6 a8 25 3c a0 48 c7 c7 40 
[  840.056255] Call Trace:
[  840.056255]  [&amp;lt;ffffffffa0388c50&amp;gt;] ? server_put_mount+0x160/0x290 [obdclass]
[  840.056255]  [&amp;lt;ffffffffa098eb3e&amp;gt;] ? mgs_cleanup+0x4e/0x1c0 [mgs]
[  840.056255]  [&amp;lt;ffffffffa037b7f2&amp;gt;] ? class_decref+0x212/0x590 [obdclass]
[  840.056255]  [&amp;lt;ffffffffa0364c74&amp;gt;] ? obd_zombie_impexp_cull+0x314/0x620 [obdclass]
[  840.056255]  [&amp;lt;ffffffffa0365045&amp;gt;] ? obd_zombie_impexp_thread+0xc5/0x1c0 [obdclass]
[  840.056255]  [&amp;lt;ffffffff81057d60&amp;gt;] ? default_wake_function+0x0/0x20
[  840.056255]  [&amp;lt;ffffffffa0364f80&amp;gt;] ? obd_zombie_impexp_thread+0x0/0x1c0 [obdclass]
[  840.056255]  [&amp;lt;ffffffff8100c14a&amp;gt;] ? child_rip+0xa/0x20
[  840.056255]  [&amp;lt;ffffffffa0364f80&amp;gt;] ? obd_zombie_impexp_thread+0x0/0x1c0 [obdclass]
[  840.056255]  [&amp;lt;ffffffffa0364f80&amp;gt;] ? obd_zombie_impexp_thread+0x0/0x1c0 [obdclass]
[  840.056255]  [&amp;lt;ffffffff8100c140&amp;gt;] ? child_rip+0x0/0x20
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment></environment>
        <key id="15933">LU-1919</key>
            <summary>Soft lockup on MGS stop</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="wc-triage">WC Triage</assignee>
                                    <reporter username="green">Oleg Drokin</reporter>
                        <labels>
                    </labels>
                <created>Wed, 12 Sep 2012 18:16:59 +0000</created>
                <updated>Wed, 2 Jan 2013 15:59:45 +0000</updated>
                            <resolved>Fri, 21 Sep 2012 13:27:51 +0000</resolved>
                                    <version>Lustre 2.4.0</version>
                                    <fixVersion>Lustre 2.4.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="44798" author="bzzz" created="Thu, 13 Sep 2012 12:24:19 +0000"  >&lt;p&gt;&lt;a href=&quot;http://review.whamcloud.com/#change,3982&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#change,3982&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="45357" author="bzzz" created="Fri, 21 Sep 2012 13:03:38 +0000"  >&lt;p&gt;can be closed now?&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzv4in:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>4265</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>