<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:25:49 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-9394] lu_object_find_try - kernel NULL pointer dereference</title>
                <link>https://jira.whamcloud.com/browse/LU-9394</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Soak cluster had completed MDS failover of MDT0002, lfsck was aborted on the MDS after timeout.&lt;br/&gt;
soak-2 OSS node crashed while under load. &lt;br/&gt;
crash dump is available on soak-2, dmesg attached.&lt;br/&gt;
Unfortunately console logged had failed, so some data is not available. &lt;br/&gt;
Syslog:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;Apr 22 15:02:44 soak-2 kernel: LustreError: Skipped 3 previous similar messages
Apr 22 15:02:44 soak-2 kernel: Lustre: soaked-MDT0002-lwp-OST0006: Connection restored to 192.168.1.110@o2ib10 (at 192.168.1.110@o2ib10)
Apr 22 15:02:44 soak-2 kernel: Lustre: Skipped 4 previous similar messages
Apr 22 15:03:09 soak-2 kernel: LustreError: 167-0: soaked-MDT0002-lwp-OST0012: This client was evicted by soaked-MDT0002; in progress operations using &lt;span class=&quot;code-keyword&quot;&gt;this&lt;/span&gt; service will fail.
Apr 22 15:03:09 soak-2 kernel: LustreError: Skipped 1 previous similar message
Apr 22 15:03:09 soak-2 kernel: Lustre: soaked-MDT0002-lwp-OST0012: Connection restored to 192.168.1.110@o2ib10 (at 192.168.1.110@o2ib10)
Apr 22 15:03:09 soak-2 kernel: Lustre: Skipped 1 previous similar message
Apr 22 15:07:11 soak-2 kernel: Lustre: soaked-OST0006: deleting orphan objects from 0x480000401:15286395 to 0x480000401:15292401
Apr 22 15:07:11 soak-2 kernel: Lustre: soaked-OST0000: deleting orphan objects from 0x300000400:15323501 to 0x300000400:15333313
Apr 22 15:07:11 soak-2 kernel: Lustre: soaked-OST000c: deleting orphan objects from 0x600000400:15327168 to 0x600000400:15338321
Apr 22 15:07:11 soak-2 kernel: Lustre: soaked-OST0012: deleting orphan objects from 0x740000402:15313619 to 0x740000402:15327665
Apr 22 15:21:56 soak-2 rsyslogd: [origin software=&lt;span class=&quot;code-quote&quot;&gt;&quot;rsyslogd&quot;&lt;/span&gt; swVersion=&lt;span class=&quot;code-quote&quot;&gt;&quot;7.4.7&quot;&lt;/span&gt; x-pid=&lt;span class=&quot;code-quote&quot;&gt;&quot;3806&quot;&lt;/span&gt; x-info=&lt;span class=&quot;code-quote&quot;&gt;&quot;http:&lt;span class=&quot;code-comment&quot;&gt;//www.rsyslog.com&quot;&lt;/span&gt;] start&lt;/span&gt;
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Dmesg from vmcore:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;32845.239323] Lustre: soaked-OST000c: deleting orphan objects from 0x600000400:15327168 to 0x600000400:15338321
[32845.239325] Lustre: soaked-OST0012: deleting orphan objects from 0x740000402:15313619 to 0x740000402:15327665
[33465.017357] LustreError: 15131:0:(osd_object.c:427:osd_object_init()) soaked-OST000c: lookup [0x1000c0000:0x21c57a8:0x0]/0x387ef2 failed: rc = 17
[33465.035118] BUG: unable to handle kernel NULL pointer dereference at 0000000000000011
[33465.045485] IP: [&amp;lt;ffffffffa0a590d8&amp;gt;] lu_object_find_try+0x178/0x2b0 [obdclass]
[33465.055153] PGD 0
[33465.058735] Oops: 0000 [#1] SMP
[33465.063750] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) 8021q garp mrp stp llc rpcrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm mlx4_ib ib_core intel_powerclamp coretemp intel_rapl iosf_mbi kvm irqbypass crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd dm_round_robin iTCO_wdt iTCO_vendor_support mei_me pcspkr ses ipmi_devintf enclosure mei ntb sg ipmi_si i2c_i801 lpc_ich sb_edac ipmi_msghandler ioatdma edac_core shpchp wmi nfsd dm_multipath nfs_acl dm_mod lockd grace auth_rpcgss sunrpc ip_tables ext4 mbcache
[33465.151994]  jbd2 zfs(POE) zunicode(POE) zavl(POE) zcommon(POE) znvpair(POE) spl(OE) zlib_deflate sd_mod crc_t10dif crct10dif_generic mlx4_en mgag200 drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops igb isci ttm ahci crct10dif_pclmul crct10dif_common libsas ptp crc32c_intel libahci pps_core mlx4_core drm mpt2sas dca libata raid_class i2c_algo_bit scsi_transport_sas devlink i2c_core fjes
[33465.195938] CPU: 15 PID: 15131 Comm: ldlm_cn01_018 Tainted: P           OE  ------------   3.10.0-514.10.2.el7_lustre.x86_64 #1
[33465.211995] Hardware name: Intel Corporation S2600GZ ........../S2600GZ, BIOS SE5C600.86B.01.08.0003.022620131521 02/26/2013
[33465.225976] task: ffff88042a2dbec0 ti: ffff88073b4f8000 task.ti: ffff88073b4f8000
[33465.236319] RIP: 0010:[&amp;lt;ffffffffa0a590d8&amp;gt;]  [&amp;lt;ffffffffa0a590d8&amp;gt;] lu_object_find_try+0x178/0x2b0 [obdclass]
[33465.249161] RSP: 0018:ffff88073b4fbaa8  EFLAGS: 00010207
[33465.256998] RAX: 0000000000000011 RBX: ffff880825cd6cc0 RCX: ffff8800ad22a918
[33465.266794] RDX: 00000001002a0021 RSI: ffffea0009502a80 RDI: ffff8800ad22a930
[33465.276797] RBP: ffff88073b4fbb08 R08: ffff8802540aaa80 R09: 00000001002a0020
[33465.286649] R10: 00000000540aad01 R11: ffffea0009502a80 R12: ffff8803fd4d5020
[33465.296218] R13: ffff8804b928e1f0 R14: 0000000000000000 R15: ffff88080b3d9000
[33465.305840] FS:  0000000000000000(0000) GS:ffff88082d9c0000(0000) knlGS:0000000000000000
[33465.316722] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[33465.324590] CR2: 0000000000000011 CR3: 00000000019ba000 CR4: 00000000000407e0
[33465.333919] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[33465.343445] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[33465.353428] Stack:
[33465.357274]  ffff88073b4fbc00 fffffffffffffffe ffff88073b4fbb30 000000000002246f
[33465.367105]  ffff8800ad22a800 ffff88000000000d 00000000a848b037 ffff88073b4fbc00
[33465.376714]  ffff88080b3d9000 ffff8803fd4d5020 0000000000000000 00000001000c0000
[33465.386284] Call Trace:
[33465.390755]  [&amp;lt;ffffffffa0a592bc&amp;gt;] lu_object_find_at+0xac/0xe0 [obdclass]
[33465.400101]  [&amp;lt;ffffffffa0fb27db&amp;gt;] ? ofd_key_init+0x3b/0xd0 [ofd]
[33465.408740]  [&amp;lt;ffffffffa0a59306&amp;gt;] lu_object_find+0x16/0x20 [obdclass]
[33465.417192]  [&amp;lt;ffffffffa0fca4e5&amp;gt;] ofd_object_find+0x35/0x100 [ofd]
[33465.425447]  [&amp;lt;ffffffffa0fd8d12&amp;gt;] ofd_lvbo_update+0x4b2/0xfc8 [ofd]
[33465.434020]  [&amp;lt;ffffffffa0fd7ca9&amp;gt;] ? ofd_lvbo_free+0x59/0xe0 [ofd]
[33465.442680]  [&amp;lt;ffffffffa0c566f2&amp;gt;] ldlm_request_cancel+0x132/0x720 [ptlrpc]
[33465.451967]  [&amp;lt;ffffffffa0c5dc6a&amp;gt;] ldlm_handle_cancel+0xba/0x250 [ptlrpc]
[33465.461254]  [&amp;lt;ffffffffa0c5df41&amp;gt;] ldlm_cancel_handler+0x141/0x490 [ptlrpc]
[33465.470567]  [&amp;lt;ffffffffa0c8df1b&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[33465.480874]  [&amp;lt;ffffffffa09359a8&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[33465.489808]  [&amp;lt;ffffffffa0c8bac8&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[33465.498533]  [&amp;lt;ffffffff810c5092&amp;gt;] ? default_wake_function+0x12/0x20
[33465.507323]  [&amp;lt;ffffffff810ba2e8&amp;gt;] ? __wake_up_common+0x58/0x90
[33465.515728]  [&amp;lt;ffffffffa0c91f40&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[33465.524220]  [&amp;lt;ffffffffa0c914a0&amp;gt;] ? ptlrpc_register_service+0xe60/0xe60 [ptlrpc]
[33465.534131]  [&amp;lt;ffffffff810b06ff&amp;gt;] kthread+0xcf/0xe0
[33465.540714]  [&amp;lt;ffffffff810b0630&amp;gt;] ? kthread_create_on_node+0x140/0x140
[33465.549134]  [&amp;lt;ffffffff81696c98&amp;gt;] ret_from_fork+0x58/0x90
[33465.556096]  [&amp;lt;ffffffff810b0630&amp;gt;] ? kthread_create_on_node+0x140/0x140
[33465.564386] Code: c6 62 e0 48 83 f8 fe 0f 85 54 ff ff ff 48 8b 7d a0 4c 89 f1 4c 89 e2 4c 89 fe e8 24 fb ff ff 48 3d 00 f0 ff ff 0f 87 36 ff ff ff &amp;lt;48&amp;gt; 8b 38 ba 10 00 00 00 4c 89 e6 48 89 45 a8 e8 04 80 8c e0 85
[33465.588258] RIP  [&amp;lt;ffffffffa0a590d8&amp;gt;] lu_object_find_try+0x178/0x2b0 [obdclass]
[33465.597871]  RSP &amp;lt;ffff88073b4fbaa8&amp;gt;
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</description>
                <environment>Soak performance cluster. </environment>
        <key id="45698">LU-9394</key>
            <summary>lu_object_find_try - kernel NULL pointer dereference</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bzzz">Alex Zhuravlev</assignee>
                                    <reporter username="cliffw">Cliff White</reporter>
                        <labels>
                            <label>soak</label>
                    </labels>
                <created>Mon, 24 Apr 2017 21:22:42 +0000</created>
                <updated>Wed, 10 May 2017 17:24:37 +0000</updated>
                            <resolved>Tue, 9 May 2017 04:17:10 +0000</resolved>
                                    <version>Lustre 2.10.0</version>
                                    <fixVersion>Lustre 2.10.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>7</watches>
                                                                            <comments>
                            <comment id="193380" author="cliffw" created="Tue, 25 Apr 2017 15:13:44 +0000"  >&lt;p&gt;System crashes consistently with this error, stopping testing of this build on soak.&lt;/p&gt;</comment>
                            <comment id="193409" author="pjones" created="Tue, 25 Apr 2017 17:33:46 +0000"  >&lt;p&gt;Fan Yong&lt;/p&gt;

&lt;p&gt;Could you please advise on this one?&lt;/p&gt;

&lt;p&gt;Peter&lt;/p&gt;</comment>
                            <comment id="193656" author="jgmitter" created="Wed, 26 Apr 2017 18:22:51 +0000"  >&lt;p&gt;Hi Alex,&lt;/p&gt;

&lt;p&gt;Can you please take this issue?&lt;/p&gt;

&lt;p&gt;Thanks.&lt;br/&gt;
Joe&lt;/p&gt;</comment>
                            <comment id="193683" author="cliffw" created="Wed, 26 Apr 2017 21:55:37 +0000"  >&lt;p&gt;Attempting to test  &lt;a href=&quot;https://review.whamcloud.com/26751&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/26751&lt;/a&gt; - hit this bug again, appears to be the same. Hard crash on soak-2. The crash dump is available on the node.&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;[93807.458922] Lustre: 15422:0:(client.c:2115:ptlrpc_expire_one_request()) @@@ Request sent has timed out &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; slow reply: [sent 1493238567/real 1493238567]  req@ffff8800498d1800 x1565678189641584/t0(0) o105-&amp;gt;soaked-OST000c@192.168.1.138@o2ib100:15/16 lens 360/224 e 0 to 1 dl 1493238574 ref 1 fl Rpc:X/0/ffffffff rc 0/-1
[93807.496438] Lustre: 15422:0:(client.c:2115:ptlrpc_expire_one_request()) Skipped 5 previous similar messages
[95810.236195] LustreError: 16392:0:(osd_object.c:427:osd_object_init()) soaked-OST000c: lookup [0x1000c0000:0x2a8aea9:0x0]/0x505415 failed: rc = 17
[95810.253694] BUG: unable to handle kernel NULL pointer dereference at 0000000000000011
[95810.264019] IP: [&amp;lt;ffffffffa0a82038&amp;gt;] lu_object_find_try+0x178/0x2b0 [obdclass]
[95810.273549] PGD 0
[95810.277182] Oops: 0000 [#1] SMP
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="193899" author="cliffw" created="Fri, 28 Apr 2017 15:01:13 +0000"  >&lt;p&gt;Hit this again on three nodes, - soak-&lt;span class=&quot;error&quot;&gt;&amp;#91;2-4&amp;#93;&lt;/span&gt; Dropping this build on soak until I see a fix. &lt;/p&gt;</comment>
                            <comment id="193965" author="bzzz" created="Fri, 28 Apr 2017 22:16:11 +0000"  >&lt;p&gt;Cliff, was it ZFS on OST ?&lt;/p&gt;</comment>
                            <comment id="193968" author="gerrit" created="Fri, 28 Apr 2017 22:20:48 +0000"  >&lt;p&gt;Alex Zhuravlev (alexey.zhuravlev@intel.com) uploaded a new patch: &lt;a href=&quot;https://review.whamcloud.com/26893&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/26893&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9394&quot; title=&quot;lu_object_find_try - kernel NULL pointer dereference&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9394&quot;&gt;&lt;del&gt;LU-9394&lt;/del&gt;&lt;/a&gt; osd: __osd_obj2dnode() to return negative errors&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: c795f910ecbf2e955c94d761225010ccf541a6f8&lt;/p&gt;</comment>
                            <comment id="193976" author="cliffw" created="Fri, 28 Apr 2017 22:38:56 +0000"  >&lt;p&gt;Yes, ZFS on OST ldiskfs on MDT&lt;/p&gt;</comment>
                            <comment id="194001" author="cliffw" created="Mon, 1 May 2017 14:52:17 +0000"  >&lt;p&gt;Ran the patch for 24 hours without lfsck, no issues.&lt;br/&gt;
Enabled lfsck, restarted soak.&lt;br/&gt;
First MDS failover happened on soak-8 after soak-8 recovered, lfsck was triggered:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;2017-04-30 19:39:52,529:fsmgmt.fsmgmt:INFO     lfsck started on soak-8
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;System appears to have immediately wedged.&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;Apr 30 19:40:19 soak-8 kernel: NMI watchdog: BUG: soft lockup - CPU#2 stuck &lt;span class=&quot;code-keyword&quot;&gt;for&lt;/span&gt; 22s! [OI_scrub:4251]
Apr 30 19:40:19 soak-8 kernel: Modules linked in: osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) zfs(POE) zunicode(POE) zavl(POE) zcommon(POE) znvpair(POE) spl(OE) zlib_deflate 8021q garp mrp stp llc rpcrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm mlx4_ib ib_core intel_powerclamp coretemp intel_rapl iosf_mbi kvm irqbypass crc32_pclmul dm_round_robin ghash_clmulni_intel aesni_intel ipmi_ssif ipmi_devintf sb_edac mei_me sg lrw gf128mul glue_helper ablk_helper cryptd ipmi_si iTCO_wdt edac_core ipmi_msghandler i2c_i801 mei iTCO_vendor_support lpc_ich pcspkr
Apr 30 19:40:19 soak-8 kernel: shpchp ioatdma dm_multipath dm_mod wmi nfsd nfs_acl lockd auth_rpcgss grace sunrpc ip_tables ext4 mbcache jbd2 sd_mod crc_t10dif crct10dif_generic mlx4_en mgag200 drm_kms_helper syscopyarea sysfillrect isci sysimgblt fb_sys_fops igb crct10dif_pclmul ahci ttm crct10dif_common mpt2sas ptp libahci libsas crc32c_intel raid_class pps_core drm mlx4_core scsi_transport_sas dca libata i2c_algo_bit ntb i2c_core devlink fjes
Apr 30 19:40:19 soak-8 kernel: CPU: 2 PID: 4251 Comm: OI_scrub Tainted: P           OE  ------------   3.10.0-514.16.1.el7_lustre.x86_64 #1
Apr 30 19:40:19 soak-8 kernel: Hardware name: Intel Corporation S2600GZ ........../S2600GZ, BIOS SE5C600.86B.01.08.0003.022620131521 02/26/2013
Apr 30 19:40:19 soak-8 kernel: task: ffff880418a22f10 ti: ffff8807ef304000 task.ti: ffff8807ef304000
Apr 30 19:40:19 soak-8 kernel: RIP: 0010:[&amp;lt;ffffffffa117779a&amp;gt;]  [&amp;lt;ffffffffa117779a&amp;gt;] ldiskfs_itable_unused_count+0x1a/0x30 [ldiskfs]
Apr 30 19:40:19 soak-8 kernel: RSP: 0018:ffff8807ef307d10  EFLAGS: 00000297
Apr 30 19:40:19 soak-8 kernel: RAX: 0000000000000000 RBX: ffff8807ef307d98 RCX: ffff88082b4e7800
Apr 30 19:40:19 soak-8 kernel: RDX: 00000000000072ce RSI: ffff880825d38a00 RDI: ffff88082b4e7000
Apr 30 19:40:19 soak-8 kernel: RBP: ffff8807ef307d10 R08: ffff8807ef307d57 R09: 0000000000000004
Apr 30 19:40:19 soak-8 kernel: R10: 00000000eaf1dc01 R11: ffffea001fabc740 R12: 0000000000000001
Apr 30 19:40:19 soak-8 kernel: R13: 0000000000000018 R14: ffff8807ef307c80 R15: 0000000000000202
Apr 30 19:40:19 soak-8 kernel: FS:  0000000000000000(0000) GS:ffff88042e080000(0000) knlGS:0000000000000000
Apr 30 19:40:19 soak-8 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 30 19:40:19 soak-8 kernel: CR2: 00007fb7eb77de00 CR3: 00000000019be000 CR4: 00000000000407e0
Apr 30 19:40:19 soak-8 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Apr 30 19:40:19 soak-8 kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Apr 30 19:40:19 soak-8 kernel: Stack:
Apr 30 19:40:19 soak-8 kernel: ffff8807ef307df0 ffffffffa1234ead ffffffffa1233880 ffff8804b960d46c
Apr 30 19:40:19 soak-8 kernel: ffffffffa122f7b0 ffff880420000000 ffff88082b4e7000 ffff8804b960d468
Apr 30 19:40:19 soak-8 kernel: 010000000000000c 0000000000000000 0000000000000000 ffff8804b960c000
Apr 30 19:40:19 soak-8 kernel: Call Trace:
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffffa1234ead&amp;gt;] osd_inode_iteration+0x21d/0xd90 [osd_ldiskfs]
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffffa1233880&amp;gt;] ? osd_ios_ROOT_scan+0x300/0x300 [osd_ldiskfs]
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffffa122f7b0&amp;gt;] ? osd_preload_next+0xc0/0xc0 [osd_ldiskfs]
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffffa1236380&amp;gt;] osd_scrub_main+0x960/0xf30 [osd_ldiskfs]
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffff810c54c0&amp;gt;] ? wake_up_state+0x20/0x20
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffffa1235a20&amp;gt;] ? osd_inode_iteration+0xd90/0xd90 [osd_ldiskfs]
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffff810b0a4f&amp;gt;] kthread+0xcf/0xe0
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffff810b0980&amp;gt;] ? kthread_create_on_node+0x140/0x140
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffff81697318&amp;gt;] ret_from_fork+0x58/0x90
Apr 30 19:40:19 soak-8 kernel: [&amp;lt;ffffffff810b0980&amp;gt;] ? kthread_create_on_node+0x140/0x140
Apr 30 19:40:19 soak-8 kernel: Code: 30 c1 e0 10 09 d0 5d c3 66 0f 1f 84 00 00 00 00 00 66 66 66 66 90 55 48 8b 8f a8 03 00 00 31 c0 0f b7 56 1c 48 89 e5 48 83 39 3f &amp;lt;76&amp;gt; 07 0f b7 46 32 c1 e0 10 09 d0 5d c3 66 0f 1f 84 00 00 00 00
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;soak-8 is now completely wedged, available for examination&lt;/p&gt;</comment>
                            <comment id="194149" author="yong.fan" created="Tue, 2 May 2017 14:21:27 +0000"  >&lt;blockquote&gt;
&lt;p&gt;Apr 30 19:40:19 soak-8 kernel: NMI watchdog: BUG: soft lockup - CPU#2 stuck for 22s! &lt;span class=&quot;error&quot;&gt;&amp;#91;OI_scrub:4251&amp;#93;&lt;/span&gt;&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;Cliff, have you applied the patch &lt;a href=&quot;https://review.whamcloud.com/#/c/26751/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/#/c/26751/&lt;/a&gt; when hit above trouble?&lt;/p&gt;</comment>
                            <comment id="194156" author="cliffw" created="Tue, 2 May 2017 15:02:58 +0000"  >&lt;p&gt;No, was only testing the single patch above. &lt;/p&gt;</comment>
                            <comment id="194162" author="cliffw" created="Tue, 2 May 2017 15:29:25 +0000"  >&lt;p&gt;Soak-8 system log attached&lt;/p&gt;</comment>
                            <comment id="194694" author="cliffw" created="Fri, 5 May 2017 15:59:27 +0000"  >&lt;p&gt;Moved this to tip of master. Same failure on soak-8&lt;/p&gt;</comment>
                            <comment id="195018" author="gerrit" created="Tue, 9 May 2017 03:45:48 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;https://review.whamcloud.com/26893/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://review.whamcloud.com/26893/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9394&quot; title=&quot;lu_object_find_try - kernel NULL pointer dereference&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9394&quot;&gt;&lt;del&gt;LU-9394&lt;/del&gt;&lt;/a&gt; osd: __osd_obj2dnode() to return negative errors&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: f2cba082ed605c2f69a254fb4ddb5a2787c1c085&lt;/p&gt;</comment>
                            <comment id="195042" author="pjones" created="Tue, 9 May 2017 04:17:10 +0000"  >&lt;p&gt;Landed for 2.10&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="45915">LU-9465</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="26454" name="soak-2.vmcore-dmesg.txt" size="240361" author="cliffw" created="Mon, 24 Apr 2017 21:22:30 +0000"/>
                            <attachment id="26549" name="soak-8.log.gz" size="169605" author="cliffw" created="Tue, 2 May 2017 15:29:12 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzzb3b:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>