<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:37:57 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-10760] MGS startup is happening asynchronously and taking a long time</title>
                <link>https://jira.whamcloud.com/browse/LU-10760</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;I have noticed that from 2.10.3 to &lt;a href=&quot;https://build.hpdd.intel.com/job/lustre-reviews/54732/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;b2_10&lt;/a&gt; MGS startup is happening asynchronously and taking much longer.&lt;/p&gt;

&lt;p&gt;In 2.10.3, immediately after (or within a very small number of seconds at most) the &lt;tt&gt;mount&lt;/tt&gt; for the MGS returns it reports as being up:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# mount -t lustre zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13/MGS /mnt/MGS
# lctl dl:
  0 UP osd-zfs MGS-osd MGS-osd_UUID 4
  1 UP mgs MGS MGS 4
  2 UP mgc MGC10.14.83.84@tcp f2845b74-6eb5-07a5-1aaa-d7c54f76df5f 4



&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;However the same operation on &lt;a href=&quot;https://build.hpdd.intel.com/job/lustre-reviews/54732/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;b2_10&lt;/a&gt; is taking on the order of 10-11 seconds after the &lt;tt&gt;mount&lt;/tt&gt; returns before &lt;tt&gt;lctl dl&lt;/tt&gt; shows the MGS as being up.&lt;/p&gt;

&lt;p&gt;I have attached &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29666/29666_debug-lctl-1519949571.log.xz&quot; title=&quot;debug-lctl-1519949571.log.xz attached to LU-10760&quot;&gt;debug-lctl-1519949571.log.xz&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; which is a lustre debug log of the mount an &lt;tt&gt;lctl conf_param&lt;/tt&gt; where it takes 10-11 seconds before the &lt;tt&gt;lctl conf_param&lt;/tt&gt; returns with success instead of a -19.&lt;/p&gt;</description>
                <environment></environment>
        <key id="51090">LU-10760</key>
            <summary>MGS startup is happening asynchronously and taking a long time</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="6">Not a Bug</resolution>
                                        <assignee username="utopiabound">Nathaniel Clark</assignee>
                                    <reporter username="brian">Brian Murrell</reporter>
                        <labels>
                    </labels>
                <created>Fri, 2 Mar 2018 19:59:30 +0000</created>
                <updated>Sat, 17 Mar 2018 10:59:14 +0000</updated>
                            <resolved>Sat, 17 Mar 2018 10:59:14 +0000</resolved>
                                    <version>Lustre 2.10.4</version>
                                                        <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                            <comments>
                            <comment id="222586" author="pjones" created="Tue, 6 Mar 2018 18:18:52 +0000"  >&lt;p&gt;Nathaniel&lt;/p&gt;

&lt;p&gt;Can you please investigate?&lt;/p&gt;

&lt;p&gt;Thanks&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="222613" author="utopiabound" created="Tue, 6 Mar 2018 19:47:45 +0000"  >&lt;p&gt;Looking at the  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29666/29666_debug-lctl-1519949571.log.xz&quot; title=&quot;debug-lctl-1519949571.log.xz attached to LU-10760&quot;&gt;debug-lctl-1519949571.log.xz&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Thu Mar  1 19:07:59 2018 [17187] (mgs_handler.c:1010:mgs_iocontrol()) Process entered
Thu Mar  1 19:07:59 2018 [17187] (mgs_handler.c:1011:mgs_iocontrol()) handling ioctl cmd 0x400866bb
...
Thu Mar  1 19:07:59 2018 [17187] (mgs_llog.c:4832:mgs_set_conf_param()) Process entered
...
Thu Mar  1 19:07:59 2018 [17187] (mgs_llog.c:3609:mgs_write_log_param()) next param &apos;llite.max_cached_mb=16&apos;
...
Thu Mar  1 19:07:59 2018 [17187] (mgs_llog.c:4941:mgs_set_conf_param()) Process leaving (rc=0 : 0 : 0)
Thu Mar  1 19:07:59 2018 [17187] (mgs_handler.c:1132:mgs_iocontrol()) Process leaving (rc=0 : 0 : 0)
Thu Mar  1 19:07:59 2018 [17187] (obd_class.h:1168:obd_iocontrol()) Process leaving (rc=0 : 0 : 0)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;There are a couple other call chains like that, none take longer than a second to complete.  Likewise MGS startup seems to start and end within a second&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1514:lustre_fill_super()) Process entered
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1516:lustre_fill_super()) VFS Op: sb ffff880078e06000
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:574:lustre_init_lsi()) Process entered
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:576:lustre_init_lsi()) kmalloced &apos;lsi&apos;: 984 at ffff880079cd7800.
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:579:lustre_init_lsi()) kmalloced &apos;lsi-&amp;gt;lsi_lmd&apos;: 104 at ffff880077d2b700.
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:597:lustre_init_lsi()) Process leaving (rc=18446612134357727232 : -131939351824384 : ffff880079cd7800)
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1247:lmd_parse()) Process entered
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1265:lmd_parse()) kmalloced &apos;lmd-&amp;gt;lmd_params&apos;: 4096 at ffff880049e40000.
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1107:lmd_parse_string()) kmalloced &apos;*handle&apos;: 8 at ffff880078338d08.
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1107:lmd_parse_string()) kmalloced &apos;*handle&apos;: 4 at ffff880078338cc8.
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1471:lmd_parse()) kmalloced &apos;lmd-&amp;gt;lmd_dev&apos;: 43 at ffff880078dd4f40.
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:931:lmd_print())   mount data:
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:934:lmd_print()) device:  zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1/MGS
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:935:lmd_print()) flags:   2200
...
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1578:lustre_fill_super()) Process leaving via out (rc=0 : 0 : 0x0)
Thu Mar  1 19:06:14 2018 [12364] (obd_mount.c:1585:lustre_fill_super()) Mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1/MGS complete
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;What conf_param are you setting that should return error?&lt;/p&gt;</comment>
                            <comment id="223008" author="brian" created="Fri, 9 Mar 2018 21:25:10 +0000"  >&lt;p&gt;Please find attached a &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29741/29741_debug-lctl-0-1520607668.log.xz&quot; title=&quot;debug-lctl-0-1520607668.log.xz attached to LU-10760&quot;&gt;lustre debug log&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;&#160;where I got:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# lctl conf_param testfs.llite.max_cached_mb=16
No device found for name MGS: Invalid argument
This command must be run on the MGS.
error: conf_param: No such device
# echo $?
19
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Right after starting the MGS and where only a moment later the same command succeeded in this &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29742/29742_debug-lctl-1-1520607677.log&quot; title=&quot;debug-lctl-1-1520607677.log attached to LU-10760&quot;&gt;lustre debug log&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;.&lt;/p&gt;</comment>
                            <comment id="223346" author="utopiabound" created="Mon, 12 Mar 2018 17:29:31 +0000"  >&lt;p&gt;Looking at  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29741/29741_debug-lctl-0-1520607668.log.xz&quot; title=&quot;debug-lctl-0-1520607668.log.xz attached to LU-10760&quot;&gt;debug-lctl-0-1520607668.log.xz&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;:&lt;br/&gt;
Modules loaded:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Fri Mar  9 09:44:49 2018 [ 1153] (api-ni.c:1462:lnet_startup_lndni()) Added LNI 10.14.82.96@tcp [8/256/0/180]
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;MGS Mounts:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Fri Mar  9 09:54:25 2018 [12151] (obd_mount.c:1514:lustre_fill_super()) Process entered
Fri Mar  9 09:54:25 2018 [12151] (obd_mount.c:1516:lustre_fill_super()) VFS Op: sb ffff88007a2ec000
...
Fri Mar  9 09:54:25 2018 [12151] (obd_mount.c:1578:lustre_fill_super()) Process leaving via out (rc=0 : 0 : 0x0)
Fri Mar  9 09:54:25 2018 [12151] (obd_mount.c:1585:lustre_fill_super()) Mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5/MGS complete
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;MDS Mounts:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Fri Mar  9 09:54:29 2018 [12419] (obd_mount.c:1514:lustre_fill_super()) Process entered
Fri Mar  9 09:54:29 2018 [12419] (obd_mount.c:1516:lustre_fill_super()) VFS Op: sb ffff88007acf7800
...
Fri Mar  9 09:54:31 2018 [12419] (obd_mount.c:1578:lustre_fill_super()) Process leaving via out (rc=0 : 0 : 0x0)
Fri Mar  9 09:54:31 2018 [12419] (obd_mount.c:1585:lustre_fill_super()) Mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3/testfs-MDT0000 complete
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;OST Mounts:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Fri Mar  9 09:55:15 2018 [14657] (obd_mount.c:1514:lustre_fill_super()) Process entered
Fri Mar  9 09:55:15 2018 [14657] (obd_mount.c:1516:lustre_fill_super()) VFS Op: sb ffff88007b29e000
...
Fri Mar  9 09:55:15 2018 [14657] (obd_mount_server.c:1757:osd_start()) Attempting to start testfs-OST0000, type=osd-ldiskfs, lsifl=200062, mountfl=0
...
Fri Mar  9 09:55:15 2018 [14657] (obd_mount.c:1578:lustre_fill_super()) Process leaving via out (rc=0 : 0 : 0x0)
Fri Mar  9 09:55:15 2018 [14657] (obd_mount.c:1585:lustre_fill_super()) Mount /dev/sde complete
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;&lt;tt&gt;# lctl conf_param testfs.llite.max_cached_mb=16&lt;/tt&gt; runs successfully&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Fri Mar  9 09:55:58 2018 [16654] (mgs_handler.c:1010:mgs_iocontrol()) Process entered
Fri Mar  9 09:55:58 2018 [16654] (mgs_handler.c:1011:mgs_iocontrol()) handling ioctl cmd 0x400866bb
...
Fri Mar  9 09:55:58 2018 [16654] (mgs_llog.c:4911:mgs_set_conf_param()) set_conf_param fs=&apos;testfs&apos; device=&apos;testfs&apos; param=&apos;llite.max_cached_mb=16&apos;
...
Fri Mar  9 09:55:58 2018 [16654] (linux-module.c:235:obd_class_release()) Process entered
Fri Mar  9 09:55:58 2018 [16654] (linux-module.c:238:obd_class_release()) Process leaving (rc=0 : 0 : 0)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;


&lt;p&gt;The failing lctl doesn&apos;t seem to show up in the debug log.  Can you run the initial (failing) lctl with &lt;tt&gt;strace -fytt&lt;/tt&gt;?&lt;/p&gt;</comment>
                            <comment id="223603" author="brian" created="Wed, 14 Mar 2018 14:15:19 +0000"  >&lt;blockquote&gt;
&lt;p&gt;Can you run the initial (failing) lctl with strace -fytt?&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;Please find attached the &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29819/29819_lctl-0-1521033811.41.strace&quot; title=&quot;lctl-0-1521033811.41.strace attached to LU-10760&quot;&gt;strace output&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; with the corresponding  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29818/29818_debug-lctl-0-1521033813.44.log.xz&quot; title=&quot;debug-lctl-0-1521033813.44.log.xz attached to LU-10760&quot;&gt;debug log&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;.&lt;/p&gt;

&lt;p&gt;FWIW, the output of &lt;tt&gt;lctl dl&lt;/tt&gt; immediately after the failed:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# lctl conf_param testfs.llite.max_cached_mb=16

No device found for name MGS: Invalid argument
This command must be run on the MGS.
error: conf_param: No such device
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;looked like this:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# lctl dl
# echo $?
# 0
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="223648" author="utopiabound" created="Wed, 14 Mar 2018 21:05:13 +0000"  >&lt;p&gt;Here is the ioctl call via strace:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;24654 06:23:31.438314 ioctl(3&amp;lt;/dev/obd&amp;gt;, _IOC(_IOC_READ|_IOC_WRITE, 0x66, 0x7f, 0x08), 0x7fffef9492a0) = -1 EINVAL (Invalid argument)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Here is the corresponding debug log:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Wed Mar 14 09:23:31 2018 [24654] (linux-module.c:226:obd_class_open()) Process entered
Wed Mar 14 09:23:31 2018 [24654] (linux-module.c:229:obd_class_open()) Process leaving (rc=0 : 0 : 0)
Wed Mar 14 09:23:31 2018 [24654] (linux-module.c:246:obd_class_ioctl()) Process entered
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:222:class_handle_ioctl()) Process entered
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:232:class_handle_ioctl()) cmd = c008667f
Wed Mar 14 09:23:31 2018 [24654] (linux-module.c:155:obd_ioctl_getdata()) Process entered
Wed Mar 14 09:23:31 2018 [24654] (linux-module.c:181:obd_ioctl_getdata()) kmalloced &apos;*buf&apos;: 584 at ffff880043251400.
Wed Mar 14 09:23:31 2018 [24654] (linux-module.c:219:obd_ioctl_getdata()) Process leaving (rc=0 : 0 : 0)
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:191:class_resolve_dev_name()) Process entered
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:201:class_resolve_dev_name()) device name MGS
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:204:class_resolve_dev_name()) No device for name MGS!
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:205:class_resolve_dev_name()) Process leaving via out (rc=18446744073709551594 : -22 : 0xffffffffffffffea)
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:212:class_resolve_dev_name()) Process leaving (rc=18446744073709551594 : -22 : ffffffffffffffea)
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:299:class_handle_ioctl()) Process leaving via out (rc=18446744073709551594 : -22 : 0xffffffffffffffea)
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:426:class_handle_ioctl()) kfreed &apos;buf&apos;: 584 at ffff880043251400.
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:427:class_handle_ioctl()) Process leaving (rc=18446744073709551594 : -22 : ffffffffffffffea)
Wed Mar 14 09:23:31 2018 [24654] (linux-module.c:256:obd_class_ioctl()) Process leaving (rc=18446744073709551594 : -22 : ffffffffffffffea)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="223698" author="utopiabound" created="Thu, 15 Mar 2018 12:25:17 +0000"  >&lt;p&gt;Now that I know what to look for I&apos;ve found it in previous  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29741/29741_debug-lctl-0-1520607668.log.xz&quot; title=&quot;debug-lctl-0-1520607668.log.xz attached to LU-10760&quot;&gt;debug-lctl-0-1520607668.log.xz&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; :&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Fri Mar  9 09:54:25 2018 [12151] (obd_config.c:372:class_attach()) Process entered
Fri Mar  9 09:54:25 2018 [12151] (genops.c:309:class_newdev()) Process entered
...
Fri Mar  9 09:54:25 2018 [12151] (genops.c:140:class_get_type()) Loaded module &apos;mgs&apos;
Fri Mar  9 09:54:25 2018 [12151] (genops.c:373:class_newdev()) Allocate new device MGS (ffff880049e88f78)
...
Fri Mar  9 09:54:25 2018 [12151] (genops.c:505:class_register_device()) Process leaving (rc=0 : 0 : 0)
Fri Mar  9 09:54:25 2018 [12151] (obd_config.c:430:class_attach()) OBD: dev 1 attached type mgs with refcount 1
Fri Mar  9 09:54:25 2018 [12151] (obd_config.c:432:class_attach()) Process leaving (rc=0 : 0 : 0)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;But then ioctl can&apos;t find it:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Fri Mar  9 10:01:08 2018 [25094] (class_obd.c:191:class_resolve_dev_name()) Process entered
Fri Mar  9 10:01:08 2018 [25094] (class_obd.c:201:class_resolve_dev_name()) device name MGS
Fri Mar  9 10:01:08 2018 [25094] (class_obd.c:204:class_resolve_dev_name()) No device for name MGS!
Fri Mar  9 10:01:08 2018 [25094] (class_obd.c:205:class_resolve_dev_name()) Process leaving via out (rc=18446744073709551594 : -22 : 0xffffffffffffffea)
Fri Mar  9 10:01:08 2018 [25094] (class_obd.c:212:class_resolve_dev_name()) Process leaving (rc=18446744073709551594 : -22 : ffffffffffffffea)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="223699" author="utopiabound" created="Thu, 15 Mar 2018 12:46:59 +0000"  >&lt;p&gt;Re-looking at  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29741/29741_debug-lctl-0-1520607668.log.xz&quot; title=&quot;debug-lctl-0-1520607668.log.xz attached to LU-10760&quot;&gt;debug-lctl-0-1520607668.log.xz&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; :&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Fri Mar  9 10:00:35 2018 [24022] (obd_mount_server.c:1618:server_put_super()) server umount MGS complete
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The MGS is unmounting prior to you running conf param.&lt;/p&gt;

&lt;p&gt;Same with  &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/attachment/29818/29818_debug-lctl-0-1521033813.44.log.xz&quot; title=&quot;debug-lctl-0-1521033813.44.log.xz attached to LU-10760&quot;&gt;debug-lctl-0-1521033813.44.log.xz&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.whamcloud.com/images/icons/link_attachment_7.gif&quot; height=&quot;7&quot; width=&quot;7&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt;:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;Wed Mar 14 09:16:44 2018 [12129] (obd_mount_server.c:236:server_start_mgs()) Start MGS service MGS
...
Wed Mar 14 09:23:02 2018 [23609] (obd_mount_server.c:1618:server_put_super()) server umount MGS complete
...
Wed Mar 14 09:23:31 2018 [24654] (class_obd.c:204:class_resolve_dev_name()) No device for name MGS!
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="223700" author="brian" created="Thu, 15 Mar 2018 12:57:47 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.whamcloud.com/secure/ViewProfile.jspa?name=utopiabound&quot; class=&quot;user-hover&quot; rel=&quot;utopiabound&quot;&gt;utopiabound&lt;/a&gt;: Just want to say be careful.&#160; You might have realised this already, but just in case you haven&apos;t, there could be multiple MGS start/stop operations before the one where the subsequent &lt;tt&gt;lctl conf_parm&lt;/tt&gt; &lt;tt&gt;ioctl&lt;/tt&gt; can&apos;t find it.&#160; As in the case above, the almost 7 minutes between the MGS start and the &lt;b&gt;No device for name MGS&lt;/b&gt; seems much too long.&#160; The delay I am seeing seems to be on the order of about 10 seconds, not many minutes.&lt;/p&gt;</comment>
                            <comment id="223906" author="brian" created="Sat, 17 Mar 2018 10:59:14 +0000"  >&lt;p&gt;This turned out to be a subtle difference in the way Pacemaker reports resource status in RHEL 7.5 which was causing IML to report a device as started before it actually was.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="29741" name="debug-lctl-0-1520607668.log.xz" size="1722580" author="brian" created="Fri, 9 Mar 2018 21:16:22 +0000"/>
                            <attachment id="29818" name="debug-lctl-0-1521033813.44.log.xz" size="1785944" author="brian" created="Wed, 14 Mar 2018 13:55:15 +0000"/>
                            <attachment id="29742" name="debug-lctl-1-1520607677.log" size="496223" author="brian" created="Fri, 9 Mar 2018 21:23:45 +0000"/>
                            <attachment id="29666" name="debug-lctl-1519949571.log.xz" size="1690492" author="brian" created="Fri, 2 Mar 2018 19:53:49 +0000"/>
                            <attachment id="29819" name="lctl-0-1521033811.41.strace" size="8860" author="brian" created="Wed, 14 Mar 2018 13:55:11 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzztpb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>