<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:24:59 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-9304] BUG: Bad page state in process ll_ost_io01_013  pfn:1a01bcd kernel BUG at include/linux/scatterlist.h:65! </title>
                <link>https://jira.whamcloud.com/browse/LU-9304</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Running 4 Lustre Clients, 2 OSS nodes each with 1 zpool,  and 1 mds. &lt;br/&gt;
This OSS node: &lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;zpool status  -v&lt;br/&gt;
  pool: ost0&lt;br/&gt;
 state: ONLINE&lt;br/&gt;
  scan: none requested&lt;br/&gt;
config:&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;	NAME            STATE     READ WRITE CKSUM&lt;br/&gt;
	ost0            ONLINE       0     0     0&lt;br/&gt;
	  draid1-0 &lt;/p&gt;
{any}
&lt;p&gt;  ONLINE       0     0     0&lt;br/&gt;
	    mpathaj     ONLINE       0     0     0&lt;br/&gt;
	    mpathai     ONLINE       0     0     0&lt;br/&gt;
	    mpathah     ONLINE       0     0     0&lt;br/&gt;
	    mpathag     ONLINE       0     0     0&lt;br/&gt;
	    mpathaq     ONLINE       0     0     0&lt;br/&gt;
	    mpathap     ONLINE       0     0     0&lt;br/&gt;
	    mpathak     ONLINE       0     0     0&lt;br/&gt;
	    mpathz      ONLINE       0     0     0&lt;br/&gt;
	    mpatham     ONLINE       0     0     0&lt;br/&gt;
	    mpathal     ONLINE       0     0     0&lt;br/&gt;
	    mpathao     ONLINE       0     0     0&lt;br/&gt;
	spares&lt;br/&gt;
	  $draid1-0-s0  AVAIL   &lt;/p&gt;

&lt;p&gt;errors: No known data errors&lt;/p&gt;

&lt;p&gt;This build of zfs was from coral-prototype branch and Lustre was a Lustre Master from Dec 1st.&lt;/p&gt;

&lt;p&gt;We were running our file system aging utility: FileAger.py (1-2 copies on each of the 4 client nodes) along an IOR: mpirun -wdir /mnt/lustre/ -np 4 -rr -machinefile hosts -env I_MPI_EXTRA_FILESYSTEM=on -env I_MPI_EXTRA_FILESYSTEM_LIST=lustre /home/johnsali/wolf-3/ior/src/ior -a POSIX -F -N 4 -d 2 -i 1 -s 20000 -b 16MB -t 16MB -k -w -r  &lt;/p&gt;

&lt;p&gt;While this was running it appears we hit this failure.  &lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;159898.950714&amp;#93;&lt;/span&gt; BUG: Bad page state in process ll_ost_io01_013  pfn:1a01bcd&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159898.960045&amp;#93;&lt;/span&gt; page:ffffea006806f340 count:-1 mapcount:0 mapping:          (null) index:0x0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159898.970667&amp;#93;&lt;/span&gt; page flags: 0x6fffff00000000()&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159898.976808&amp;#93;&lt;/span&gt; page dumped because: nonzero _count&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159898.983412&amp;#93;&lt;/span&gt; Modules linked in: nfsv3 nfs_acl raid10 osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) zfs(OE) zunicode(OE) zavl(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate lustre(OE) lmv(OE) mdc(OE) lov(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) sha512_generic crypto_null rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache xprtrdma sunrpc ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses dm_service_time enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd mpt3sas ipmi_devintf ipmi_ssif ipmi_si&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.072452&amp;#93;&lt;/span&gt;  raid_class sb_edac iTCO_wdt iTCO_vendor_support scsi_transport_sas sg edac_core pcspkr ipmi_msghandler wmi ioatdma mei_me mei lpc_ich shpchp i2c_i801 mfd_core acpi_pad acpi_power_meter dm_multipath dm_mod ip_tables ext4 mbcache jbd2 mlx4_ib mlx4_en ib_sa vxlan ib_mad ip6_udp_tunnel udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper crct10dif_pclmul igb crct10dif_common ttm ptp crc32c_intel ahci pps_core drm mlx4_core libahci dca i2c_algo_bit libata i2c_core &lt;span class=&quot;error&quot;&gt;&amp;#91;last unloaded: zunicode&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.135473&amp;#93;&lt;/span&gt; CPU: 57 PID: 98747 Comm: ll_ost_io01_013 Tainted: G          IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.149461&amp;#93;&lt;/span&gt; Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.162801&amp;#93;&lt;/span&gt;  ffffea006806f340 00000000424e76b3 ffff880f9e233908 ffffffff81636431&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.172821&amp;#93;&lt;/span&gt;  ffff880f9e233930 ffffffff81631645 ffffea006806f340 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.182870&amp;#93;&lt;/span&gt;  000fffff00000000 ffff880f9e233978 ffffffff811714dd fff00000fe000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.192895&amp;#93;&lt;/span&gt; Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.197269&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81636431&amp;gt;&amp;#93;&lt;/span&gt; dump_stack+0x19/0x1b&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.204667&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81631645&amp;gt;&amp;#93;&lt;/span&gt; bad_page.part.59+0xdf/0xfc&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.212639&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff811714dd&amp;gt;&amp;#93;&lt;/span&gt; free_pages_prepare+0x16d/0x190&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.220965&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81171e21&amp;gt;&amp;#93;&lt;/span&gt; free_hot_cold_page+0x31/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.229171&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8117200f&amp;gt;&amp;#93;&lt;/span&gt; __free_pages+0x3f/0x60&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.236690&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa100bad3&amp;gt;&amp;#93;&lt;/span&gt; osd_bufs_put+0x123/0x1f0 &lt;span class=&quot;error&quot;&gt;&amp;#91;osd_zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.245372&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa118284a&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw_write+0xea/0x1c20 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.254234&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa1186f2d&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw+0x51d/0xa40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.262551&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d538d5&amp;gt;&amp;#93;&lt;/span&gt; obd_commitrw+0x2ec/0x32f &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.271488&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d2bf71&amp;gt;&amp;#93;&lt;/span&gt; tgt_brw_write+0xea1/0x1640 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.280509&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810c15cc&amp;gt;&amp;#93;&lt;/span&gt; ? update_curr+0xcc/0x150&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.288372&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810be46e&amp;gt;&amp;#93;&lt;/span&gt; ? account_entity_dequeue+0xae/0xd0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.297010&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0c82560&amp;gt;&amp;#93;&lt;/span&gt; ? target_send_reply_msg+0x170/0x170 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.306746&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d28225&amp;gt;&amp;#93;&lt;/span&gt; tgt_request_handle+0x915/0x1320 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.316058&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0cd41ab&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_server_handle_request+0x21b/0xa90 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.326348&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0967128&amp;gt;&amp;#93;&lt;/span&gt; ? lc_watchdog_touch+0x68/0x180 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.335679&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0cd1d68&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_wait_event+0x98/0x340 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.345029&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810b8952&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x12/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.353394&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810af0b8&amp;gt;&amp;#93;&lt;/span&gt; ? __wake_up_common+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.361264&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0cd8260&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xaa0/0x1de0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.369596&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0cd77c0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_register_service+0xe40/0xe40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.379160&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.385881&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.394413&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.401653&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;159899.410157&amp;#93;&lt;/span&gt; Disabling lock debugging due to kernel taint&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163012.964891&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.8@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000406:0x3c5:0x0&amp;#93;&lt;/span&gt; object 0x0:44785 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;67108864-80752639&amp;#93;&lt;/span&gt;: client csum 7f08fe36, server csum f8fbfe4c&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163012.990138&amp;#93;&lt;/span&gt; LustreError: Skipped 2 previous similar messages&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163020.008131&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.8@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000406:0x3d6:0x0&amp;#93;&lt;/span&gt; object 0x0:44794 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;83886080-100270079&amp;#93;&lt;/span&gt;: client csum 886feb33, server csum ccc0eb4a&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163042.829796&amp;#93;&lt;/span&gt; -----------&lt;del&gt;[ cut here ]&lt;/del&gt;-----------&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163042.837389&amp;#93;&lt;/span&gt; kernel BUG at include/linux/scatterlist.h:65!&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163042.845758&amp;#93;&lt;/span&gt; invalid opcode: 0000 &lt;a href=&quot;#1&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;1&lt;/a&gt; SMP &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163042.852645&amp;#93;&lt;/span&gt; Modules linked in: nfsv3 nfs_acl raid10 osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) zfs(OE) zunicode(OE) zavl(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate lustre(OE) lmv(OE) mdc(OE) lov(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) sha512_generic crypto_null rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache xprtrdma sunrpc ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses dm_service_time enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd mpt3sas ipmi_devintf ipmi_ssif ipmi_si&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163042.944819&amp;#93;&lt;/span&gt;  raid_class sb_edac iTCO_wdt iTCO_vendor_support scsi_transport_sas sg edac_core pcspkr ipmi_msghandler wmi ioatdma mei_me mei lpc_ich shpchp i2c_i801 mfd_core acpi_pad acpi_power_meter dm_multipath dm_mod ip_tables ext4 mbcache jbd2 mlx4_ib mlx4_en ib_sa vxlan ib_mad ip6_udp_tunnel udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper crct10dif_pclmul igb crct10dif_common ttm ptp crc32c_intel ahci pps_core drm mlx4_core libahci dca i2c_algo_bit libata i2c_core &lt;span class=&quot;error&quot;&gt;&amp;#91;last unloaded: zunicode&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.010335&amp;#93;&lt;/span&gt; CPU: 12 PID: 84956 Comm: ll_ost_io00_002 Tainted: G    B     IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.025057&amp;#93;&lt;/span&gt; Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.038989&amp;#93;&lt;/span&gt; task: ffff880fc52bc500 ti: ffff880fc55bc000 task.ti: ffff880fc55bc000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.049639&amp;#93;&lt;/span&gt; RIP: 0010:&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0960fef&amp;gt;&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0960fef&amp;gt;&amp;#93;&lt;/span&gt; cfs_crypto_hash_update_page+0x9f/0xb0 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.063453&amp;#93;&lt;/span&gt; RSP: 0018:ffff880fc55bfab8  EFLAGS: 00010202&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.071687&amp;#93;&lt;/span&gt; RAX: 0000000000000002 RBX: ffff8810f6db9b80 RCX: 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.081918&amp;#93;&lt;/span&gt; RDX: 0000000000000020 RSI: 0000000000000000 RDI: ffff880fc55bfad8&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.092095&amp;#93;&lt;/span&gt; RBP: ffff880fc55bfb00 R08: 00000000000195a0 R09: ffff880fc55bfab8&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.103441&amp;#93;&lt;/span&gt; R10: ffff88103e807900 R11: 0000000000000001 R12: 3635343332313036&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.113462&amp;#93;&lt;/span&gt; R13: 0000000033323130 R14: 0000000000000534 R15: 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.123487&amp;#93;&lt;/span&gt; FS:  0000000000000000(0000) GS:ffff88103ef00000(0000) knlGS:0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.134599&amp;#93;&lt;/span&gt; CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.143101&amp;#93;&lt;/span&gt; CR2: 00007fce5afab000 CR3: 000000000194a000 CR4: 00000000001407e0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.153184&amp;#93;&lt;/span&gt; DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.163242&amp;#93;&lt;/span&gt; DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.173280&amp;#93;&lt;/span&gt; Stack:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.177580&amp;#93;&lt;/span&gt;  0000000000000002 0000000000000000 0000000000000000 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.188354&amp;#93;&lt;/span&gt;  00000000f43b381e 0000000000000000 ffff880fcc7d1301 ffff880e73ecc200&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.199140&amp;#93;&lt;/span&gt;  0000000000000000 ffff880fc55bfb68 ffffffffa0d5345c ffff88202563f0a8&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.209907&amp;#93;&lt;/span&gt; Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.215455&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d5345c&amp;gt;&amp;#93;&lt;/span&gt; tgt_checksum_bulk.isra.33+0x35a/0x4e7 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.226242&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d2c21d&amp;gt;&amp;#93;&lt;/span&gt; tgt_brw_write+0x114d/0x1640 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.235986&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810c15cc&amp;gt;&amp;#93;&lt;/span&gt; ? update_curr+0xcc/0x150&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.244558&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810be46e&amp;gt;&amp;#93;&lt;/span&gt; ? account_entity_dequeue+0xae/0xd0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.254271&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0c82560&amp;gt;&amp;#93;&lt;/span&gt; ? target_send_reply_msg+0x170/0x170 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.264858&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d28225&amp;gt;&amp;#93;&lt;/span&gt; tgt_request_handle+0x915/0x1320 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.275043&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0cd41ab&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_server_handle_request+0x21b/0xa90 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.286074&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0967128&amp;gt;&amp;#93;&lt;/span&gt; ? lc_watchdog_touch+0x68/0x180 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.296175&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0cd1d68&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_wait_event+0x98/0x340 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.306194&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810b8952&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x12/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.315553&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810af0b8&amp;gt;&amp;#93;&lt;/span&gt; ? __wake_up_common+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.324714&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0cd8260&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xaa0/0x1de0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.334070&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0cd77c0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_register_service+0xe40/0xe40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.344635&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.352181&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.361606&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.369571&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.378772&amp;#93;&lt;/span&gt; Code: 89 43 38 48 8b 43 20 ff 50 c0 48 8b 55 d8 65 48 33 14 25 28 00 00 00 75 0d 48 83 c4 28 5b 41 5c 41 5d 41 5e 5d c3 e8 61 a0 71 e0 &amp;lt;0f&amp;gt; 0b 66 66 66 66 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.406113&amp;#93;&lt;/span&gt; RIP  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0960fef&amp;gt;&amp;#93;&lt;/span&gt; cfs_crypto_hash_update_page+0x9f/0xb0 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;163043.416991&amp;#93;&lt;/span&gt;  RSP &amp;lt;ffff880fc55bfab8&amp;gt; &lt;/p&gt;

&lt;p&gt;This happened fairly quickly.  After this run I restarted the system and it happened again almost immediately.  &lt;/p&gt;</description>
                <environment>[&lt;a href=&apos;mailto:root@wolf-3&apos;&gt;root@wolf-3&lt;/a&gt; 10.8.1.3-2017-04-06-19:44:09]# rpm -qa |grep -i lustre &lt;br/&gt;
kmod-lustre-tests-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
lustre-tests-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
lustre-osd-zfs-mount-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
lustre-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
lustre-iokit-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
kmod-lustre-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
kmod-lustre-osd-zfs-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
lustre-debuginfo-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
[&lt;a href=&apos;mailto:root@wolf-3&apos;&gt;root@wolf-3&lt;/a&gt; 10.8.1.3-2017-04-06-19:44:09]# rpm -qa |grep -i zfs&lt;br/&gt;
libzfs2-0.7.0-rc3_29_g48659df.el7.centos.x86_64&lt;br/&gt;
kmod-zfs-0.7.0-rc3_29_g48659df.el7.centos.x86_64&lt;br/&gt;
zfs-debuginfo-0.7.0-rc3_29_g48659df.el7.centos.x86_64&lt;br/&gt;
lustre-osd-zfs-mount-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
zfs-0.7.0-rc3_29_g48659df.el7.centos.x86_64&lt;br/&gt;
zfs-test-0.7.0-rc3_29_g48659df.el7.centos.x86_64&lt;br/&gt;
kmod-lustre-osd-zfs-2.9.0_dirty-1.el7.centos.x86_64&lt;br/&gt;
zfs-kmod-debuginfo-0.7.0-rc3_29_g48659df.el7.centos.x86_64&lt;br/&gt;
&amp;nbsp;&lt;br/&gt;
4 Clients over IB to 2 OSS and 1 MDS. &lt;br/&gt;
&lt;br/&gt;
OSS each have 1 OST: &lt;br/&gt;
quick_oss1.sh:zpool create -f -o ashift=12 -o cachefile=none -O recordsize=16MB ost0 draid2 cfg=test_2_5_4_18_draidcfg.nvl mpathaa mpathab mpathac mpathad mpathae mpathaf mpathag mpathah mpathai mpathaj mpathak mpathal mpatham mpathan mpathao mpathap mpathaq mpathar &lt;br/&gt;
quick_oss1.sh:zpool status -v ost0&lt;br/&gt;
quick_oss1.sh:zpool feature@large_blocks=enabled ost0&lt;br/&gt;
quick_oss1.sh:zpool get all ost0 |grep large_blocks&lt;br/&gt;
quick_oss2.sh:zpool create -f -o ashift=12 -o cachefile=none -O recordsize=16MB ost1 draid2 cfg=test_2_5_4_18_draidcfg.nvl mpatha mpathb mpathc mpathd mpathe mpathf mpathg mpathh mpathi mpathj mpathk mpathl mpathm mpathn mpatho mpathp mpathq mpathr&lt;br/&gt;
quick_oss2.sh:zpool status -v ost1&lt;br/&gt;
quick_oss2.sh:zpool feature@large_blocks=enabled ost1&lt;br/&gt;
quick_oss2.sh:zpool get all ost1 |grep large_blocks</environment>
        <key id="42281">LU-9304</key>
            <summary>BUG: Bad page state in process ll_ost_io01_013  pfn:1a01bcd kernel BUG at include/linux/scatterlist.h:65! </summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="3">Duplicate</resolution>
                                        <assignee username="utopiabound">Nathaniel Clark</assignee>
                                    <reporter username="jsalians_intel">John Salinas</reporter>
                        <labels>
                            <label>LS_RZ</label>
                            <label>prod</label>
                    </labels>
                <created>Wed, 7 Dec 2016 19:35:33 +0000</created>
                <updated>Fri, 21 Apr 2017 19:12:38 +0000</updated>
                            <resolved>Fri, 21 Apr 2017 19:11:50 +0000</resolved>
                                                    <fixVersion>Lustre 2.10.0</fixVersion>
                                        <due>Fri, 24 Mar 2017 00:00:00 +0000</due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="176925" author="jsalians_intel" created="Wed, 7 Dec 2016 19:44:59 +0000"  >&lt;p&gt; 3.10.0-327.36.3.el7.x86_64&lt;/p&gt;

&lt;p&gt; /**&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;sg_assign_page - Assign a given page to an SG entry&lt;/li&gt;
	&lt;li&gt;@sg:             SG entry&lt;/li&gt;
	&lt;li&gt;@page:           The page&lt;br/&gt;
 *&lt;/li&gt;
	&lt;li&gt;Description:&lt;/li&gt;
	&lt;li&gt;Assign page to sg entry. Also see sg_set_page(), the most commonly used&lt;/li&gt;
	&lt;li&gt;variant.&lt;br/&gt;
 *&lt;br/&gt;
 **/&lt;br/&gt;
static inline void sg_assign_page(struct scatterlist sg, struct page page)&lt;br/&gt;
{&lt;br/&gt;
        unsigned long page_link = sg-&amp;gt;page_link &amp;amp; 0x3;&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;        /*&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;In order for the low bit stealing approach to work, pages&lt;/li&gt;
	&lt;li&gt;must be aligned at a 32-bit boundary as a minimum.&lt;br/&gt;
         */&lt;br/&gt;
        BUG_ON((unsigned long) page &amp;amp; 0x03);     &amp;lt;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;here is 65 &lt;br/&gt;
#ifdef CONFIG_DEBUG_SG&lt;br/&gt;
        BUG_ON(sg-&amp;gt;sg_magic != SG_MAGIC);&lt;br/&gt;
        BUG_ON(sg_is_chain(sg));&lt;br/&gt;
#endif&lt;br/&gt;
        sg-&amp;gt;page_link = page_link | (unsigned long) page; &lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="176927" author="jsalians_intel" created="Wed, 7 Dec 2016 19:46:40 +0000"  >&lt;p&gt;Dump is here: /scratch/dumps/wolf-3.wolf.hpdd.intel.com/10.8.1.3-2016-12-07-17:00:41&lt;/p&gt;</comment>
                            <comment id="185998" author="kalyana" created="Thu, 23 Feb 2017 17:49:33 +0000"  >&lt;p&gt;John Salinas to retest with new stack. &lt;/p&gt;</comment>
                            <comment id="191116" author="jsalians_intel" created="Fri, 7 Apr 2017 01:44:03 +0000"  >&lt;p&gt;Hit this again with Lustre 2.9.0 and ZFS RC3 + dRAID/Metadata Segregation: &lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.591117&amp;#93;&lt;/span&gt; BUG: Bad page state in process ll_ost_io01_020  pfn:1c17405&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.598572&amp;#93;&lt;/span&gt; page:ffffea00705d0140 count:-1 mapcount:0 mapping:          (null) index:0x0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.607674&amp;#93;&lt;/span&gt; page flags: 0x6fffff00000000()&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.612314&amp;#93;&lt;/span&gt; page dumped because: nonzero _count&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.617415&amp;#93;&lt;/span&gt; Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate ses dm_service_time enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel mpt3sas lrw gf128mul glue_helper ablk_helper cryptd raid_class scsi_transport_sas mei_me iTCO_wdt iTCO_vendor_support lpc_ich mei sg shpchp ipmi_ssif ipmi_devintf&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.698570&amp;#93;&lt;/span&gt;  sb_edac ioatdma ipmi_si edac_core mfd_core pcspkr i2c_i801 ipmi_msghandler wmi acpi_power_meter acpi_pad dm_multipath dm_mod nfsd nfs_acl lockd grace binfmt_misc auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 mlx4_en mlx4_ib vxlan ib_sa ip6_udp_tunnel ib_mad udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt igb drm_kms_helper crct10dif_pclmul ttm crct10dif_common ptp crc32c_intel ahci pps_core mlx4_core drm libahci dca i2c_algo_bit libata i2c_core&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.753391&amp;#93;&lt;/span&gt; CPU: 21 PID: 62915 Comm: ll_ost_io01_020 Tainted: G          IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.767182&amp;#93;&lt;/span&gt; Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.780218&amp;#93;&lt;/span&gt;  ffffea00705d0140 00000000be615ee9 ffff880d1ee53908 ffffffff81636431&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.789972&amp;#93;&lt;/span&gt;  ffff880d1ee53930 ffffffff81631645 ffffea00705d0140 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.799723&amp;#93;&lt;/span&gt;  000fffff00000000 ffff880d1ee53978 ffffffff811714dd fff00000fe000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.809454&amp;#93;&lt;/span&gt; Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.813567&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81636431&amp;gt;&amp;#93;&lt;/span&gt; dump_stack+0x19/0x1b&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.820677&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81631645&amp;gt;&amp;#93;&lt;/span&gt; bad_page.part.59+0xdf/0xfc&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.828342&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff811714dd&amp;gt;&amp;#93;&lt;/span&gt; free_pages_prepare+0x16d/0x190&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.836431&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81171e21&amp;gt;&amp;#93;&lt;/span&gt; free_hot_cold_page+0x31/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.844392&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8117200f&amp;gt;&amp;#93;&lt;/span&gt; __free_pages+0x3f/0x60&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.851674&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0fbead3&amp;gt;&amp;#93;&lt;/span&gt; osd_bufs_put+0x123/0x1f0 &lt;span class=&quot;error&quot;&gt;&amp;#91;osd_zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.860120&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa10b884a&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw_write+0xea/0x1c20 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.868709&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa10bcf2d&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw+0x51d/0xa40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.876752&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0e2c8d2&amp;gt;&amp;#93;&lt;/span&gt; obd_commitrw+0x2ec/0x32f &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.885066&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0e04f71&amp;gt;&amp;#93;&lt;/span&gt; tgt_brw_write+0xea1/0x1640 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.893554&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810c15cc&amp;gt;&amp;#93;&lt;/span&gt; ? update_curr+0xcc/0x150&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.900939&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810be46e&amp;gt;&amp;#93;&lt;/span&gt; ? account_entity_dequeue+0xae/0xd0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.909327&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d5b560&amp;gt;&amp;#93;&lt;/span&gt; ? target_send_reply_msg+0x170/0x170 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.918682&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0e01225&amp;gt;&amp;#93;&lt;/span&gt; tgt_request_handle+0x915/0x1320 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.927668&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0dad1ab&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_server_handle_request+0x21b/0xa90 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.937769&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa09f2128&amp;gt;&amp;#93;&lt;/span&gt; ? lc_watchdog_touch+0x68/0x180 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.946877&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0daad68&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_wait_event+0x98/0x340 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.955969&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810b8952&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x12/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.964579&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810af0b8&amp;gt;&amp;#93;&lt;/span&gt; ? __wake_up_common+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.972520&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0db1260&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xaa0/0x1de0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.980768&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0db07c0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_register_service+0xe40/0xe40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.990532&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35316.997352&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35317.005890&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35317.013064&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;35317.021494&amp;#93;&lt;/span&gt; Disabling lock debugging due to kernel taint&lt;/p&gt;

&lt;p&gt;/scratch/dumps/wolf-3.wolf.hpdd.intel.com/10.8.1.3-2017-04-06-19:44:09&lt;/p&gt;
</comment>
                            <comment id="191202" author="jsalians_intel" created="Fri, 7 Apr 2017 16:32:15 +0000"  >&lt;p&gt;Here is another one: &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;85463.960467&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000405:0xc8:0x0&amp;#93;&lt;/span&gt; object 0x0:493 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;50331648-66977791&amp;#93;&lt;/span&gt;: client csum 26eef72b, server csum 6a2afc80&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;85538.710838&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000405:0x130:0x0&amp;#93;&lt;/span&gt; object 0x0:545 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;68812800-83886079&amp;#93;&lt;/span&gt;: client csum 7f41af68, server csum f877af67&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;85629.615262&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000405:0x30e:0x0&amp;#93;&lt;/span&gt; object 0x0:783 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;67108864-82313215&amp;#93;&lt;/span&gt;: client csum bd02b56a, server csum 8f588935&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;85680.448461&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000405:0x3df:0x0&amp;#93;&lt;/span&gt; object 0x0:887 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;67108864-81018879&amp;#93;&lt;/span&gt;: client csum 54933a67, server csum 31bca8f7&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87381.228273&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000405:0x6d3:0x0&amp;#93;&lt;/span&gt; object 0x0:1265 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;83886080-100007935&amp;#93;&lt;/span&gt;: client csum c62adf42, server csum 47f2df45&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.618291&amp;#93;&lt;/span&gt; BUG: Bad page state in process ll_ost_io01_018  pfn:1fef99b&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.627834&amp;#93;&lt;/span&gt; page:ffffea007fbe66c0 count:-1 mapcount:0 mapping:          (null) index:0x0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.639074&amp;#93;&lt;/span&gt; page flags: 0x6fffff00000000()&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.645680&amp;#93;&lt;/span&gt; page dumped because: nonzero _count&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.652779&amp;#93;&lt;/span&gt; Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate lustre(OE) lmv(OE) mdc(OE) lov(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) sha512_ssse3 sha512_generic crypto_null rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache xprtrdma sunrpc ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure dm_service_time intel_powerclamp coretemp intel_rapl kvm_intel mpt3sas kvm crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper sb_edac cryptd iTCO_wdt edac_core ipmi_devintf&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.743972&amp;#93;&lt;/span&gt;  ipmi_ssif mei_me raid_class sg iTCO_vendor_support scsi_transport_sas pcspkr mei ipmi_si ipmi_msghandler ioatdma shpchp lpc_ich i2c_i801 wmi mfd_core acpi_pad acpi_power_meter dm_multipath dm_mod ip_tables ext4 mbcache jbd2 mlx4_en mlx4_ib vxlan ib_sa ip6_udp_tunnel ib_mad udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt igb drm_kms_helper crct10dif_pclmul ahci crct10dif_common ttm ptp crc32c_intel libahci pps_core drm mlx4_core dca libata i2c_algo_bit i2c_core &lt;span class=&quot;error&quot;&gt;&amp;#91;last unloaded: zunicode&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.805273&amp;#93;&lt;/span&gt; CPU: 21 PID: 124934 Comm: ll_ost_io01_018 Tainted: G          IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.819123&amp;#93;&lt;/span&gt; Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.832223&amp;#93;&lt;/span&gt;  ffffea007fbe66c0 00000000140992fa ffff8800354cf908 ffffffff81636431&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.842024&amp;#93;&lt;/span&gt;  ffff8800354cf930 ffffffff81631645 ffffea007fbe66c0 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.851816&amp;#93;&lt;/span&gt;  000fffff00000000 ffff8800354cf978 ffffffff811714dd fff00000fe000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.861609&amp;#93;&lt;/span&gt; Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.865810&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81636431&amp;gt;&amp;#93;&lt;/span&gt; dump_stack+0x19/0x1b&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.873020&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81631645&amp;gt;&amp;#93;&lt;/span&gt; bad_page.part.59+0xdf/0xfc&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.880805&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff811714dd&amp;gt;&amp;#93;&lt;/span&gt; free_pages_prepare+0x16d/0x190&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.888959&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81171e21&amp;gt;&amp;#93;&lt;/span&gt; free_hot_cold_page+0x31/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.897005&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8117200f&amp;gt;&amp;#93;&lt;/span&gt; __free_pages+0x3f/0x60&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.904375&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa13c0ad3&amp;gt;&amp;#93;&lt;/span&gt; osd_bufs_put+0x123/0x1f0 &lt;span class=&quot;error&quot;&gt;&amp;#91;osd_zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.912902&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa153d84a&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw_write+0xea/0x1c20 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.921600&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa1541f2d&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw+0x51d/0xa40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.929762&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0da08d2&amp;gt;&amp;#93;&lt;/span&gt; obd_commitrw+0x2ec/0x32f &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.938190&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d78f71&amp;gt;&amp;#93;&lt;/span&gt; tgt_brw_write+0xea1/0x1640 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.946742&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810c15cc&amp;gt;&amp;#93;&lt;/span&gt; ? update_curr+0xcc/0x150&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.954201&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810be46e&amp;gt;&amp;#93;&lt;/span&gt; ? account_entity_dequeue+0xae/0xd0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.962643&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0ccf560&amp;gt;&amp;#93;&lt;/span&gt; ? target_send_reply_msg+0x170/0x170 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.972101&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d75225&amp;gt;&amp;#93;&lt;/span&gt; tgt_request_handle+0x915/0x1320 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.981134&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d211ab&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_server_handle_request+0x21b/0xa90 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87450.991008&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa09a4128&amp;gt;&amp;#93;&lt;/span&gt; ? lc_watchdog_touch+0x68/0x180 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87451.000321&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d1ed68&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_wait_event+0x98/0x340 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87451.009495&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d25260&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xaa0/0x1de0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87451.018091&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d247c0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_register_service+0xe40/0xe40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87451.027944&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87451.034889&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87451.043631&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87451.051138&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;87451.059821&amp;#93;&lt;/span&gt; Disabling lock debugging due to kernel taint&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;88135.004640&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000405:0x9c5:0x0&amp;#93;&lt;/span&gt; object 0x0:1640 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;67108864-83230719&amp;#93;&lt;/span&gt;: client csum d48fdf40, server csum 7834d05f&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;88167.103209&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000405:0x9f5:0x0&amp;#93;&lt;/span&gt; object 0x0:1664 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;100663296-108920831&amp;#93;&lt;/span&gt;: client csum f45b7896, server csum 796e789a&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;88372.104154&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000405:0xae9:0x0&amp;#93;&lt;/span&gt; object 0x0:1785 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;67108864-83099647&amp;#93;&lt;/span&gt;: client csum 63d944, server csum 990a54d0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89192.783421&amp;#93;&lt;/span&gt; -----------&lt;del&gt;[ cut here ]&lt;/del&gt;-----------&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89192.790964&amp;#93;&lt;/span&gt; WARNING: at lib/list_debug.c:59 __list_del_entry+0xa1/0xd0()&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89192.800675&amp;#93;&lt;/span&gt; list_del corruption. prev-&amp;gt;next should be ffffc906a3d0c010, but was 3635343332313036&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89192.812702&amp;#93;&lt;/span&gt; Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate lustre(OE) lmv(OE) mdc(OE) lov(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) sha512_ssse3 sha512_generic crypto_null rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache xprtrdma sunrpc ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure dm_service_time intel_powerclamp coretemp intel_rapl kvm_intel mpt3sas kvm crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper sb_edac cryptd iTCO_wdt edac_core ipmi_devintf&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89192.906561&amp;#93;&lt;/span&gt;  ipmi_ssif mei_me raid_class sg iTCO_vendor_support scsi_transport_sas pcspkr mei ipmi_si ipmi_msghandler ioatdma shpchp lpc_ich i2c_i801 wmi mfd_core acpi_pad acpi_power_meter dm_multipath dm_mod ip_tables ext4 mbcache jbd2 mlx4_en mlx4_ib vxlan ib_sa ip6_udp_tunnel ib_mad udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt igb drm_kms_helper crct10dif_pclmul ahci crct10dif_common ttm ptp crc32c_intel libahci pps_core drm mlx4_core dca libata i2c_algo_bit i2c_core &lt;span class=&quot;error&quot;&gt;&amp;#91;last unloaded: zunicode&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89192.971373&amp;#93;&lt;/span&gt; CPU: 22 PID: 47821 Comm: z_wr_int_7 Tainted: G    B     IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89192.985319&amp;#93;&lt;/span&gt; Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89192.999116&amp;#93;&lt;/span&gt;  ffff880fd3713bc8 00000000c561abf7 ffff880fd3713b80 ffffffff81636431&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.009585&amp;#93;&lt;/span&gt;  ffff880fd3713bb8 ffffffff8107b260 ffffc906a3d0c010 ffff88202372a660&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.020080&amp;#93;&lt;/span&gt;  0000000000000010 0000000000000000 ffff882013de9800 ffff880fd3713c20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.030560&amp;#93;&lt;/span&gt; Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.035444&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81636431&amp;gt;&amp;#93;&lt;/span&gt; dump_stack+0x19/0x1b&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.043527&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8107b260&amp;gt;&amp;#93;&lt;/span&gt; warn_slowpath_common+0x70/0xb0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.052574&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8107b2fc&amp;gt;&amp;#93;&lt;/span&gt; warn_slowpath_fmt+0x5c/0x80&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.061337&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8130c6a1&amp;gt;&amp;#93;&lt;/span&gt; __list_del_entry+0xa1/0xd0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.069975&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8130c6dd&amp;gt;&amp;#93;&lt;/span&gt; list_del+0xd/0x30&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.077745&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f056d&amp;gt;&amp;#93;&lt;/span&gt; __spl_cache_flush+0xed/0x150 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.087183&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f0696&amp;gt;&amp;#93;&lt;/span&gt; spl_cache_flush+0x36/0x50 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.096324&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f15a2&amp;gt;&amp;#93;&lt;/span&gt; spl_kmem_cache_free+0x1c2/0x1d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.106221&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa11254fa&amp;gt;&amp;#93;&lt;/span&gt; zio_buf_free+0x5a/0x60 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.115119&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa104bba9&amp;gt;&amp;#93;&lt;/span&gt; abd_free+0x249/0x270 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.123765&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81013588&amp;gt;&amp;#93;&lt;/span&gt; ? __switch_to+0xf8/0x4b0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.133434&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa10db5f4&amp;gt;&amp;#93;&lt;/span&gt; vdev_raidz_map_free+0x34/0xd0 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.142998&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa10db6e9&amp;gt;&amp;#93;&lt;/span&gt; vdev_raidz_map_free_vsd+0x29/0x30 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.152927&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa11265ed&amp;gt;&amp;#93;&lt;/span&gt; zio_vdev_io_assess+0x4d/0x250 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.162466&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa112622c&amp;gt;&amp;#93;&lt;/span&gt; zio_execute+0x9c/0x100 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.171271&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f2ed6&amp;gt;&amp;#93;&lt;/span&gt; taskq_thread+0x246/0x470 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.180262&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810b8940&amp;gt;&amp;#93;&lt;/span&gt; ? wake_up_state+0x20/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.188773&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f2c90&amp;gt;&amp;#93;&lt;/span&gt; ? taskq_thread_spawn+0x60/0x60 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.198360&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.206072&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.215629&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.223914&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.233417&amp;#93;&lt;/span&gt; --&lt;del&gt;[ end trace c1da4e4c37ad9549 ]&lt;/del&gt;--&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.409308&amp;#93;&lt;/span&gt; general protection fault: 0000 &lt;a href=&quot;#1&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;1&lt;/a&gt; SMP &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.416842&amp;#93;&lt;/span&gt; Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate lustre(OE) lmv(OE) mdc(OE) lov(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) sha512_ssse3 sha512_generic crypto_null rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache xprtrdma sunrpc ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure dm_service_time intel_powerclamp coretemp intel_rapl kvm_intel mpt3sas kvm crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper sb_edac cryptd iTCO_wdt edac_core ipmi_devintf&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.509290&amp;#93;&lt;/span&gt;  ipmi_ssif mei_me raid_class sg iTCO_vendor_support scsi_transport_sas pcspkr mei ipmi_si ipmi_msghandler ioatdma shpchp lpc_ich i2c_i801 wmi mfd_core acpi_pad acpi_power_meter dm_multipath dm_mod ip_tables ext4 mbcache jbd2 mlx4_en mlx4_ib vxlan ib_sa ip6_udp_tunnel ib_mad udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt igb drm_kms_helper crct10dif_pclmul ahci crct10dif_common ttm ptp crc32c_intel libahci pps_core drm mlx4_core dca libata i2c_algo_bit i2c_core &lt;span class=&quot;error&quot;&gt;&amp;#91;last unloaded: zunicode&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.573354&amp;#93;&lt;/span&gt; CPU: 37 PID: 86386 Comm: z_wr_int_7 Tainted: G    B   W IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.587115&amp;#93;&lt;/span&gt; Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.600690&amp;#93;&lt;/span&gt; task: ffff881ecece7300 ti: ffff88176c4c4000 task.ti: ffff88176c4c4000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.610926&amp;#93;&lt;/span&gt; RIP: 0010:&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8130c54f&amp;gt;&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8130c54f&amp;gt;&amp;#93;&lt;/span&gt; __list_add+0xf/0xc0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.621652&amp;#93;&lt;/span&gt; RSP: 0018:ffff88176c4c7c30  EFLAGS: 00010086&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.629539&amp;#93;&lt;/span&gt; RAX: 0000000000380000 RBX: ffffc906a8127000 RCX: 0000000000000004&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.639440&amp;#93;&lt;/span&gt; RDX: 3130363534333231 RSI: ffffc906a8127020 RDI: ffffc906a9d2f018&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.649298&amp;#93;&lt;/span&gt; RBP: ffff88176c4c7c48 R08: 0000000000000000 R09: 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.659138&amp;#93;&lt;/span&gt; R10: 0000000000000007 R11: 0000000000000000 R12: 3130363534333231&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.668948&amp;#93;&lt;/span&gt; R13: ffffc906a8127020 R14: 0000000000000000 R15: ffff882013de9800&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.678753&amp;#93;&lt;/span&gt; FS:  0000000000000000(0000) GS:ffff88103f0c0000(0000) knlGS:0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.689643&amp;#93;&lt;/span&gt; CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.697898&amp;#93;&lt;/span&gt; CR2: 00007f2413fce000 CR3: 000000000194a000 CR4: 00000000001407e0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.707726&amp;#93;&lt;/span&gt; DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.717569&amp;#93;&lt;/span&gt; DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.727403&amp;#93;&lt;/span&gt; Stack:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.731485&amp;#93;&lt;/span&gt;  ffffc906a8127000 ffff8810252800c0 0000000000000010 ffff88176c4c7c98&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.741719&amp;#93;&lt;/span&gt;  ffffffffa04f0535 0000000200a3c286 0000003e862d74d4 ffff882013de98a0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.751956&amp;#93;&lt;/span&gt;  ffff882013de98b8 ffff882013de9800 ffff8810252800c0 0000000000000002&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.762205&amp;#93;&lt;/span&gt; Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.766851&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f0535&amp;gt;&amp;#93;&lt;/span&gt; __spl_cache_flush+0xb5/0x150 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.775877&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f0696&amp;gt;&amp;#93;&lt;/span&gt; spl_cache_flush+0x36/0x50 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.784617&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f15a2&amp;gt;&amp;#93;&lt;/span&gt; spl_kmem_cache_free+0x1c2/0x1d0 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.793997&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa11254fa&amp;gt;&amp;#93;&lt;/span&gt; zio_buf_free+0x5a/0x60 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.802468&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa104bba9&amp;gt;&amp;#93;&lt;/span&gt; abd_free+0x249/0x270 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.810746&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81013588&amp;gt;&amp;#93;&lt;/span&gt; ? __switch_to+0xf8/0x4b0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.819797&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa10db5f4&amp;gt;&amp;#93;&lt;/span&gt; vdev_raidz_map_free+0x34/0xd0 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.828971&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa10db6e9&amp;gt;&amp;#93;&lt;/span&gt; vdev_raidz_map_free_vsd+0x29/0x30 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.838527&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa11265ed&amp;gt;&amp;#93;&lt;/span&gt; zio_vdev_io_assess+0x4d/0x250 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.847696&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa112622c&amp;gt;&amp;#93;&lt;/span&gt; zio_execute+0x9c/0x100 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.856147&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f2ed6&amp;gt;&amp;#93;&lt;/span&gt; taskq_thread+0x246/0x470 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.864781&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810b8940&amp;gt;&amp;#93;&lt;/span&gt; ? wake_up_state+0x20/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.872946&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa04f2c90&amp;gt;&amp;#93;&lt;/span&gt; ? taskq_thread_spawn+0x60/0x60 &lt;span class=&quot;error&quot;&gt;&amp;#91;spl&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.882186&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.889553&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.898764&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.906707&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.915871&amp;#93;&lt;/span&gt; Code: 48 89 df e8 f4 45 eb ff b8 f4 ff ff ff e9 4a ff ff ff b8 f4 ff ff ff e9 40 ff ff ff 55 48 89 e5 41 55 49 89 f5 41 54 49 89 d4 53 &amp;lt;4c&amp;gt; 8b 42 08 48 89 fb 49 39 f0 75 2a 4d 8b 45 00 4d 39 c4 75 68 &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.942003&amp;#93;&lt;/span&gt; RIP  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8130c54f&amp;gt;&amp;#93;&lt;/span&gt; __list_add+0xf/0xc0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;89193.949893&amp;#93;&lt;/span&gt;  RSP &amp;lt;ffff88176c4c7c30&amp;gt; &lt;/p&gt;

&lt;p&gt;/scratch/dumps/wolf-4.wolf.hpdd.intel.com/10.8.1.4-2017-04-06-14:08:00&lt;/p&gt;</comment>
                            <comment id="191909" author="jgmitter" created="Thu, 13 Apr 2017 17:40:12 +0000"  >&lt;p&gt;Hi Nate,&lt;/p&gt;

&lt;p&gt;Can you please look into this one.  We thought on the triage call that this could be a duplicate of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9279&quot; title=&quot;coral-beta-combined build 124 kernel BUG at include/linux/scatterlist.h:65! invalid opcode: 0000 [#1] SMP&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9279&quot;&gt;&lt;del&gt;LU-9279&lt;/del&gt;&lt;/a&gt;.  Do you agree?&lt;/p&gt;

&lt;p&gt;Thanks.&lt;br/&gt;
Joe&lt;/p&gt;</comment>
                            <comment id="191984" author="utopiabound" created="Thu, 13 Apr 2017 21:22:59 +0000"  >&lt;p&gt;Yes,  looking at this, I would assume they come from the same root cause.&lt;/p&gt;</comment>
                            <comment id="191986" author="jsalians_intel" created="Thu, 13 Apr 2017 21:24:12 +0000"  >&lt;p&gt;Oh good we have a dump for this one!&lt;/p&gt;</comment>
                            <comment id="192084" author="utopiabound" created="Fri, 14 Apr 2017 12:18:10 +0000"  >&lt;p&gt;How can I get a copy?  I don&apos;t have a login to wolf currently.&lt;/p&gt;</comment>
                            <comment id="192089" author="jsalians_intel" created="Fri, 14 Apr 2017 12:35:48 +0000"  >&lt;p&gt;Which clusters do you have a login for I will copy it over to nfs on that cluster?&lt;/p&gt;</comment>
                            <comment id="192090" author="utopiabound" created="Fri, 14 Apr 2017 13:14:19 +0000"  >&lt;p&gt;I have logins to Onyx and Lola.&lt;/p&gt;</comment>
                            <comment id="192102" author="jsalians_intel" created="Fri, 14 Apr 2017 15:09:23 +0000"  >&lt;p&gt;On Onyx:  $ ls -lart /scratch/johnsali/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9304&quot; title=&quot;BUG: Bad page state in process ll_ost_io01_013  pfn:1a01bcd kernel BUG at include/linux/scatterlist.h:65! &quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9304&quot;&gt;&lt;del&gt;LU-9304&lt;/del&gt;&lt;/a&gt;.tgz &lt;br/&gt;
-rwxr-xr-x 1 johnsali johnsali 815773487 Apr 14 07:28 /scratch/johnsali/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9304&quot; title=&quot;BUG: Bad page state in process ll_ost_io01_013  pfn:1a01bcd kernel BUG at include/linux/scatterlist.h:65! &quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9304&quot;&gt;&lt;del&gt;LU-9304&lt;/del&gt;&lt;/a&gt;.tgz&lt;/p&gt;</comment>
                            <comment id="192145" author="utopiabound" created="Fri, 14 Apr 2017 21:42:36 +0000"  >&lt;p&gt;Could the initial dump of &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9279&quot; title=&quot;coral-beta-combined build 124 kernel BUG at include/linux/scatterlist.h:65! invalid opcode: 0000 [#1] SMP&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9279&quot;&gt;&lt;del&gt;LU-9279&lt;/del&gt;&lt;/a&gt; be truncated and there&apos;s a double free prior to the bad page pointer?  That would actually make more sense for a failure scenario.&lt;/p&gt;</comment>
                            <comment id="192148" author="jsalians_intel" created="Fri, 14 Apr 2017 22:03:43 +0000"  >&lt;p&gt;I don&apos;t remember BUG: Bad page state in process  in &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-9279&quot; title=&quot;coral-beta-combined build 124 kernel BUG at include/linux/scatterlist.h:65! invalid opcode: 0000 [#1] SMP&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-9279&quot;&gt;&lt;del&gt;LU-9279&lt;/del&gt;&lt;/a&gt; but it was a month ago so anything is possible.  &lt;/p&gt;

&lt;p&gt;None of the traces here look ZFS related &amp;#8211; can you give us any hint on where to look?&lt;/p&gt;
</comment>
                            <comment id="192179" author="jsalians_intel" created="Sat, 15 Apr 2017 13:59:27 +0000"  >&lt;p&gt;Lustre 2.9.0 + 0.7.0 RC3 (none of our patches) record size 1M on OST0 and 16M on OST1. brw_size=16 on both raidz &amp;#8211; messages but no crash manual dumps: 10.8.1.4-2017-04-15-00:26:17  10.8.1.4-2017-04-15-01:47:43  10.8.1.3-2017-04-15-13:22:45 10.8.1.4-2017-04-15-13:22:47&lt;/p&gt;

&lt;p&gt;wolf-4 OSS&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[  163.434692] Lustre: lsdraid-OST0001: Recovery over after 0:06, of 5 clients 5 recovered and 0 were evicted.
[  163.480746] Lustre: lsdraid-OST0001: deleting orphan objects from 0x0:720 to 0x0:1025
[  370.631336] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x3b0:0x0] object 0x0:1225 extent [83886080-92680191]: client csum d5f42113, server csum 1a89e99c
[  480.339896] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x49c:0x0] object 0x0:4041 extent [33554432-47890431]: client csum e47bcdcb, server csum 86becdcf
[  488.890964] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x4ae:0x0] object 0x0:5107 extent [67108864-73793535]: client csum b74b30df, server csum 20c030ec
[  509.914190] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x52f:0x0] object 0x0:6348 extent [33554432-43007999]: client csum cbc76f28, server csum 4b241635
[  539.505532] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x5be:0x0] object 0x0:7700 extent [67108864-78381055]: client csum b6e2021c, server csum c5ce4f88
[  560.736133] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x5f1:0x0] object 0x0:8747 extent [67108864-81104895]: client csum ddc22e54, server csum 894f5e1a
[  618.743576] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x6d0:0x0] object 0x0:11762 extent [67108864-81694719]: client csum 734e4939, server csum 175394a5
[  618.764867] LustreError: Skipped 1 previous similar message
[ 1080.395798] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x7fa:0x0] object 0x0:14839 extent [40140800-50331647]: client csum 937c50bf, server csum f71e2e65
[ 1080.417120] LustreError: Skipped 2 previous similar messages
[ 3001.142322] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0xd10:0x0] object 0x0:49284 extent [100663296-108527615]: client csum ab9466a8, server csum 10b4e228
[ 3400.563954] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0xfb0:0x0] object 0x0:54388 extent [67108864-82837503]: client csum 71e8cd52, server csum 35becd53
[ 3461.970072] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x1052:0x0] object 0x0:55534 extent [67108864-74973183]: client csum c0a766ab, server csum ab5a66bb
[ 3762.672549] BUG: Bad page state in process ll_ost_io01_003  pfn:182ec6d
[ 3762.680002] page:ffffea0060bb1b40 count:-1 mapcount:0 mapping:          (null) index:0x0
[ 3762.689091] page flags: 0x6fffff00000000()
[ 3762.693727] page dumped because: nonzero _count
[ 3762.700757] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm zfs(POE) zunicode(POE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) zlib_deflate ses dm_service_time enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel mpt3sas lrw gf128mul glue_helper ablk_helper cryptd raid_class scsi_transport_sas mei_me iTCO_wdt ipmi_ssif iTCO_vendor_support mei ipmi_devintf sb_edac sg
[ 3762.790920]  ioatdma lpc_ich shpchp edac_core pcspkr i2c_i801 ipmi_si mfd_core ipmi_msghandler acpi_pad acpi_power_meter wmi nfsd dm_multipath dm_mod auth_rpcgss nfs_acl lockd grace sunrpc binfmt_misc ip_tables ext4 mbcache jbd2 mlx4_en mlx4_ib vxlan ib_sa ip6_udp_tunnel ib_mad udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb crct10dif_pclmul ptp crct10dif_common ttm ahci crc32c_intel pps_core mlx4_core libahci drm dca i2c_algo_bit libata i2c_core
[ 3762.850233] CPU: 31 PID: 9096 Comm: ll_ost_io01_003 Tainted: P          IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1
[ 3762.864178] Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015
[ 3762.877501]  ffffea0060bb1b40 000000006cbfa991 ffff880fd6a47908 ffffffff81636431
[ 3762.887516]  ffff880fd6a47930 ffffffff81631645 ffffea0060bb1b40 0000000000000000
[ 3762.897491]  000fffff00000000 ffff880fd6a47978 ffffffff811714dd fff00000fe000000
[ 3762.907458] Call Trace:
[ 3762.912046]  [&amp;lt;ffffffff81636431&amp;gt;] dump_stack+0x19/0x1b
[ 3762.919394]  [&amp;lt;ffffffff81631645&amp;gt;] bad_page.part.59+0xdf/0xfc
[ 3762.927333]  [&amp;lt;ffffffff811714dd&amp;gt;] free_pages_prepare+0x16d/0x190
[ 3762.935630]  [&amp;lt;ffffffff81171e21&amp;gt;] free_hot_cold_page+0x31/0x140
[ 3762.943790]  [&amp;lt;ffffffff8117200f&amp;gt;] __free_pages+0x3f/0x60
[ 3762.951264]  [&amp;lt;ffffffffa0fa1ad3&amp;gt;] osd_bufs_put+0x123/0x1f0 [osd_zfs]
[ 3762.959874]  [&amp;lt;ffffffffa109b84a&amp;gt;] ofd_commitrw_write+0xea/0x1c20 [ofd]
[ 3762.968646]  [&amp;lt;ffffffffa109ff2d&amp;gt;] ofd_commitrw+0x51d/0xa40 [ofd]
[ 3762.976868]  [&amp;lt;ffffffffa0e0f8d2&amp;gt;] obd_commitrw+0x2ec/0x32f [ptlrpc]
[ 3762.985338]  [&amp;lt;ffffffffa0de7f71&amp;gt;] tgt_brw_write+0xea1/0x1640 [ptlrpc]
[ 3762.993957]  [&amp;lt;ffffffffa0d3e560&amp;gt;] ? target_send_reply_msg+0x170/0x170 [ptlrpc]
[ 3763.003453]  [&amp;lt;ffffffffa0de4225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[ 3763.012530]  [&amp;lt;ffffffffa0d901ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[ 3763.022429]  [&amp;lt;ffffffffa0a33128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[ 3763.031354]  [&amp;lt;ffffffffa0d8dd68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[ 3763.040220]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[ 3763.048476]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[ 3763.056267]  [&amp;lt;ffffffffa0d94260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[ 3763.064562]  [&amp;lt;ffffffffa0d937c0&amp;gt;] ? ptlrpc_register_service+0xe40/0xe40 [ptlrpc]
[ 3763.074037]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[ 3763.080685]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 3763.089162]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[ 3763.096349]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 3855.476573] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x12e3:0x0] object 0x0:58439 extent [67108864-82837503]: client csum 71e8cd52, server csum 14e5cd5e
[ 3923.650281] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x13bc:0x0] object 0x0:59171 extent [33554432-48742399]: client csum 9005f4a9, server csum db87ac4c
[ 5698.551136] perf interrupt took too long (2521 &amp;gt; 2500), lowering kernel.perf_event_max_sample_rate to 50000
[ 5904.311835] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x1734:0x0] object 0x0:66681 extent [67108864-80281599]: client csum 1eaa58ca, server csum 44a378f0
[ 8708.045614] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x1d31:0x0] object 0x0:67733 extent [121729024-134217727]: client csum 99efe98c, server csum e23d22e1
[ 9738.442312] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x2051:0x0] object 0x0:68278 extent [100663296-116666367]: client csum d42f69dc, server csum 8732074f
[10448.854337] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x237a:0x0] object 0x0:68809 extent [100663296-112549887]: client csum 7a8b3e1a, server csum 1dbd0291
[10480.902373] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x2396:0x0] object 0x0:68834 extent [85426176-100663295]: client csum f43a36f0, server csum 9d10e702
[11720.767365] BUG: Bad page state in process ll_ost_io01_001  pfn:15d132f
[11720.777259] page:ffffea005744cbc0 count:-1 mapcount:0 mapping:          (null) index:0x0
[11720.788693] page flags: 0x6fffff00000000()
[11720.795463] page dumped because: nonzero _count
[11720.802596] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm zfs(POE) zunicode(POE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) zlib_deflate ses dm_service_time enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel mpt3sas lrw gf128mul glue_helper ablk_helper cryptd raid_class scsi_transport_sas mei_me iTCO_wdt ipmi_ssif iTCO_vendor_support mei ipmi_devintf sb_edac sg
[11720.893130]  ioatdma lpc_ich shpchp edac_core pcspkr i2c_i801 ipmi_si mfd_core ipmi_msghandler acpi_pad acpi_power_meter wmi nfsd dm_multipath dm_mod auth_rpcgss nfs_acl lockd grace sunrpc binfmt_misc ip_tables ext4 mbcache jbd2 mlx4_en mlx4_ib vxlan ib_sa ip6_udp_tunnel ib_mad udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb crct10dif_pclmul ptp crct10dif_common ttm ahci crc32c_intel pps_core mlx4_core libahci drm dca i2c_algo_bit libata i2c_core
[11720.951749] CPU: 35 PID: 8509 Comm: ll_ost_io01_001 Tainted: P    B     IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1
[11720.965393] Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015
[11720.978463]  ffffea005744cbc0 00000000a971f860 ffff880fdb6bf908 ffffffff81636431
[11720.988249]  ffff880fdb6bf930 ffffffff81631645 ffffea005744cbc0 0000000000000000
[11720.998053]  000fffff00000000 ffff880fdb6bf978 ffffffff811714dd fff00000fe000000
[11721.007838] Call Trace:
[11721.012009]  [&amp;lt;ffffffff81636431&amp;gt;] dump_stack+0x19/0x1b
[11721.019195]  [&amp;lt;ffffffff81631645&amp;gt;] bad_page.part.59+0xdf/0xfc
[11721.026948]  [&amp;lt;ffffffff811714dd&amp;gt;] free_pages_prepare+0x16d/0x190
[11721.035167]  [&amp;lt;ffffffff81171e21&amp;gt;] free_hot_cold_page+0x31/0x140
[11721.043294]  [&amp;lt;ffffffff8117200f&amp;gt;] __free_pages+0x3f/0x60
[11721.050752]  [&amp;lt;ffffffffa0fa1ad3&amp;gt;] osd_bufs_put+0x123/0x1f0 [osd_zfs]
[11721.059372]  [&amp;lt;ffffffffa109b84a&amp;gt;] ofd_commitrw_write+0xea/0x1c20 [ofd]
[11721.068157]  [&amp;lt;ffffffffa109ff2d&amp;gt;] ofd_commitrw+0x51d/0xa40 [ofd]
[11721.076424]  [&amp;lt;ffffffffa0e0f8d2&amp;gt;] obd_commitrw+0x2ec/0x32f [ptlrpc]
[11721.085001]  [&amp;lt;ffffffffa0de7f71&amp;gt;] tgt_brw_write+0xea1/0x1640 [ptlrpc]
[11721.094001]  [&amp;lt;ffffffff81632d15&amp;gt;] ? __slab_free+0x10e/0x277
[11721.101706]  [&amp;lt;ffffffff810c15cc&amp;gt;] ? update_curr+0xcc/0x150
[11721.109340]  [&amp;lt;ffffffff810be46e&amp;gt;] ? account_entity_dequeue+0xae/0xd0
[11721.117924]  [&amp;lt;ffffffffa0d3e560&amp;gt;] ? target_send_reply_msg+0x170/0x170 [ptlrpc]
[11721.127469]  [&amp;lt;ffffffffa0de4225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[11721.136601]  [&amp;lt;ffffffffa0d901ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[11721.146564]  [&amp;lt;ffffffffa0a33128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[11721.155726]  [&amp;lt;ffffffffa0d8dd68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[11721.164815]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[11721.173099]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[11721.180948]  [&amp;lt;ffffffffa0d94260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[11721.189287]  [&amp;lt;ffffffffa0d937c0&amp;gt;] ? ptlrpc_register_service+0xe40/0xe40 [ptlrpc]
[11721.198828]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[11721.205490]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[11721.214017]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[11721.221178]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[11906.409714] perf interrupt took too long (5056 &amp;gt; 5000), lowering kernel.perf_event_max_sample_rate to 25000
[12369.576466] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x28dd:0x0] object 0x0:69605 extent [100663296-115441663]: client csum 34b2200, server csum 5f29220d
[12574.297235] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x2a16:0x0] object 0x0:69767 extent [100663296-114409471]: client csum c953b2e4, server csum f3b9a3f5
[12583.154014] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0001 from 12345-192.168.1.6@o2ib inode [0x200000405:0x2a22:0x0] object 0x0:69773 extent [100663296-117309439]: client csum fa39f722, server csum 17548bac
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;


&lt;p&gt;wolf-3 OSS&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[  702.495373] Lustre: lsdraid-OST0000: Connection restored to 5dd53d1b-72ff-64c0-86f7-b4ab04036f55 (at 192.168.1.6@o2ib)
[  712.111566] LustreError: 35894:0:(ofd_grant.c:641:ofd_grant_check()) lsdraid-OST0000: cli 5dd53d1b-72ff-64c0-86f7-b4ab04036f55 claims 17432576 GRANT, real grant 0
[  712.629997] LustreError: 39491:0:(ofd_grant.c:641:ofd_grant_check()) lsdraid-OST0000: cli 5dd53d1b-72ff-64c0-86f7-b4ab04036f55 claims 17432576 GRANT, real grant 0
[  712.649481] LustreError: 39491:0:(ofd_grant.c:641:ofd_grant_check()) Skipped 8 previous similar messages
[  713.660785] LustreError: 38266:0:(ofd_grant.c:641:ofd_grant_check()) lsdraid-OST0000: cli 5dd53d1b-72ff-64c0-86f7-b4ab04036f55 claims 17432576 GRANT, real grant 0
[  713.679875] LustreError: 38266:0:(ofd_grant.c:641:ofd_grant_check()) Skipped 5 previous similar messages
[  715.665680] LustreError: 38165:0:(ofd_grant.c:641:ofd_grant_check()) lsdraid-OST0000: cli 5dd53d1b-72ff-64c0-86f7-b4ab04036f55 claims 17432576 GRANT, real grant 0
[  715.685499] LustreError: 38165:0:(ofd_grant.c:641:ofd_grant_check()) Skipped 48 previous similar messages
[  835.423369] Lustre: lsdraid-OST0000: Connection restored to 4e5e1424-c5a7-dbfe-ccf8-a041ec520cb5 (at 192.168.1.9@o2ib)
[  835.437468] Lustre: Skipped 2 previous similar messages
[11228.546836] perf interrupt took too long (2506 &amp;gt; 2500), lowering kernel.perf_event_max_sample_rate to 50000
[28193.720410] LNet: Service thread pid 91775 was inactive for 200.29s. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes:
[28193.743765] Pid: 91775, comm: ll_ost00_010
[28193.750363] 
Call Trace:
[28193.758633]  [&amp;lt;ffffffff8163bb39&amp;gt;] schedule+0x29/0x70
[28193.765982]  [&amp;lt;ffffffffa05cb2fd&amp;gt;] cv_wait_common+0x10d/0x130 [spl]
[28193.774687]  [&amp;lt;ffffffff810a6b80&amp;gt;] ? autoremove_wake_function+0x0/0x40
[28193.783567]  [&amp;lt;ffffffffa05cb335&amp;gt;] __cv_wait+0x15/0x20 [spl]
[28193.791608]  [&amp;lt;ffffffffa1439c23&amp;gt;] txg_wait_open+0xb3/0xf0 [zfs]
[28193.799877]  [&amp;lt;ffffffffa13e264d&amp;gt;] dmu_free_long_range+0x25d/0x3d0 [zfs]
[28193.808919]  [&amp;lt;ffffffffa1092468&amp;gt;] osd_unlinked_object_free+0x28/0x280 [osd_zfs]
[28193.818586]  [&amp;lt;ffffffffa10927d3&amp;gt;] osd_unlinked_list_emptify+0x63/0xa0 [osd_zfs]
[28193.828178]  [&amp;lt;ffffffffa1094dba&amp;gt;] osd_trans_stop+0x31a/0x5b0 [osd_zfs]
[28193.836927]  [&amp;lt;ffffffffa119516f&amp;gt;] ofd_trans_stop+0x1f/0x60 [ofd]
[28193.845026]  [&amp;lt;ffffffffa1198d82&amp;gt;] ofd_object_destroy+0x2b2/0x890 [ofd]
[28193.853770]  [&amp;lt;ffffffffa1191987&amp;gt;] ofd_destroy_by_fid+0x307/0x510 [ofd]
[28193.862440]  [&amp;lt;ffffffffa0cdcbe0&amp;gt;] ? ldlm_blocking_ast+0x0/0x170 [ptlrpc]
[28193.871264]  [&amp;lt;ffffffffa0cd71f0&amp;gt;] ? ldlm_completion_ast+0x0/0x910 [ptlrpc]
[28193.880161]  [&amp;lt;ffffffffa1181627&amp;gt;] ofd_destroy_hdl+0x267/0xa50 [ofd]
[28193.888454]  [&amp;lt;ffffffffa0d6b225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[28193.897329]  [&amp;lt;ffffffffa0d171ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[28193.907053]  [&amp;lt;ffffffffa09c7128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[28193.915785]  [&amp;lt;ffffffffa0d14d68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[28193.924476]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[28193.932565]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[28193.940211]  [&amp;lt;ffffffffa0d1b260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[28193.948394]  [&amp;lt;ffffffffa0d1a7c0&amp;gt;] ? ptlrpc_main+0x0/0x1de0 [ptlrpc]
[28193.956493]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[28193.963027]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread+0x0/0xe0
[28193.969635]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[28193.976729]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread+0x0/0xe0

[28193.985950] LustreError: dumping log to /tmp/lustre-log.1492246924.91775
[28199.712751] LNet: Service thread pid 91775 completed after 206.29s. This indicates the system was overloaded (too many service threads, or there were not enough hardware resources).
[31329.310375] perf interrupt took too long (5002 &amp;gt; 5000), lowering kernel.perf_event_max_sample_rate to 25000
[root@wolf-3 10.8.1.3-2017-04-14-22:46:09]# 

&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;root@wolf-4 combined&amp;#93;&lt;/span&gt;# ps aux |grep 9096&lt;br/&gt;
root       9096  0.6  0.0      0     0 ?        S    01:55   4:21 &lt;span class=&quot;error&quot;&gt;&amp;#91;ll_ost_io01_003&amp;#93;&lt;/span&gt;&lt;br/&gt;
root      77386  0.0  0.0 112656   976 pts/0    S+   12:56   0:00 grep --color=auto 9096&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;root@wolf-4 combined&amp;#93;&lt;/span&gt;# man ps&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;root@wolf-4 combined&amp;#93;&lt;/span&gt;# ps aux |grep 8509&lt;br/&gt;
root       8509  4.3  0.0      0     0 ?        D    01:55  28:56 &lt;span class=&quot;error&quot;&gt;&amp;#91;ll_ost_io01_001&amp;#93;&lt;/span&gt;&lt;br/&gt;
root      84813  0.0  0.0 112656   976 pts/0    S+   12:57   0:00 grep --color=auto 8509&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;root@wolf-4 combined&amp;#93;&lt;/span&gt;# cat /proc/9096/stack&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d8dff5&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_wait_event+0x325/0x340 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d93fcb&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0x80b/0x1de0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffffffffff&amp;gt;&amp;#93;&lt;/span&gt; 0xffffffffffffffff&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;root@wolf-4 combined&amp;#93;&lt;/span&gt;# cat /proc/8509/stack&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8108c04f&amp;gt;&amp;#93;&lt;/span&gt; usleep_range+0x4f/0x70&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa269c99a&amp;gt;&amp;#93;&lt;/span&gt; dmu_tx_wait+0x33a/0x360 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa269ca45&amp;gt;&amp;#93;&lt;/span&gt; dmu_tx_assign+0x85/0x3f0 &lt;span class=&quot;error&quot;&gt;&amp;#91;zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0f94fea&amp;gt;&amp;#93;&lt;/span&gt; osd_trans_start+0xaa/0x3c0 &lt;span class=&quot;error&quot;&gt;&amp;#91;osd_zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa10960db&amp;gt;&amp;#93;&lt;/span&gt; ofd_trans_start+0x6b/0xe0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa109c0a3&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw_write+0x943/0x1c20 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa109ff2d&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw+0x51d/0xa40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0e0f8d2&amp;gt;&amp;#93;&lt;/span&gt; obd_commitrw+0x2ec/0x32f &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0de7f71&amp;gt;&amp;#93;&lt;/span&gt; tgt_brw_write+0xea1/0x1640 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0de4225&amp;gt;&amp;#93;&lt;/span&gt; tgt_request_handle+0x915/0x1320 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d901ab&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_server_handle_request+0x21b/0xa90 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0d94260&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xaa0/0x1de0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffffffffff&amp;gt;&amp;#93;&lt;/span&gt; 0xffffffffffffffff&lt;/p&gt;</comment>
                            <comment id="192181" author="jsalians_intel" created="Sat, 15 Apr 2017 14:04:03 +0000"  >&lt;p&gt;Yesterday I tried the following combinations: &lt;br/&gt;
Lustre 2.9.0 + latest coral_beta_combined  record size 16M brw_size=16 draid zfs_abd_scatter_enabled = 0, max_pages_per_rpc=4096 &amp;#8211; crash 10.8.1.3-2017-04-14-22:46:09&lt;br/&gt;
Lustre 2.9.0 + latest coral_beta_combined  record size 16M brw_size=16 draid zfs_abd_scatter_enabled = 0, max_pages_per_rpc=256  &amp;#8211; crash 10.8.1.3-2017-04-15-00:39:07 &lt;/p&gt;

&lt;p&gt;wolf-3 OSS 10.8.1.3-2017-04-14-22:46:09&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;147931.299899] Lustre: lsdraid-OST0000: new disk, initializing
[147931.307239] Lustre: srv-lsdraid-OST0000: No data found on store. Initialize space
[147936.355608] Lustre: lsdraid-OST0000: Connection restored to lsdraid-MDT0000-mdtlov_UUID (at 192.168.1.5@o2ib)
[147963.624729] Lustre: lsdraid-OST0000: Connection restored to bd4f4e40-dbac-a829-f1fd-3c4450a08dcb (at 192.168.1.6@o2ib)
[147970.995882] Lustre: lsdraid-OST0000: Connection restored to b9fbce4c-a90b-3f7f-770e-f9863c38efb5 (at 192.168.1.8@o2ib)
[147975.210049] Lustre: lsdraid-OST0000: Connection restored to 862f84d1-bf42-0dd3-ba54-1e1a9568317e (at 192.168.1.7@o2ib)
[147975.223042] Lustre: Skipped 1 previous similar message
[148306.620448] Lustre: lsdraid-OST0000: Connection restored to b9fbce4c-a90b-3f7f-770e-f9863c38efb5 (at 192.168.1.8@o2ib)
[148306.633674] Lustre: Skipped 1 previous similar message
[233987.779195] perf interrupt took too long (10163 &amp;gt; 9615), lowering kernel.perf_event_max_sample_rate to 13000
[414188.327658] Lustre: 83697:0:(client.c:2111:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1492208717/real 1492208717]  req@ffff880f11ac8300 x1564414877971952/t0(0) o39-&amp;gt;lsdraid-MDT0000-lwp-OST0000@192.168.1.5@o2ib:12/10 lens 224/224 e 0 to 1 dl 1492208723 ref 2 fl Rpc:XN/0/ffffffff rc 0/-1
[414188.364839] Lustre: Failing over lsdraid-OST0000
[414192.689319] Lustre: 118209:0:(client.c:2111:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1492208721/real 1492208721]  req@ffff8815f6846f00 x1564414877971968/t0(0) o400-&amp;gt;MGC192.168.1.5@o2ib@192.168.1.5@o2ib:26/25 lens 224/224 e 0 to 1 dl 1492208728 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
[414194.373337] Lustre: 83697:0:(client.c:2111:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1492208723/real 1492208723]  req@ffff880f11ac8300 x1564414877972032/t0(0) o251-&amp;gt;MGC192.168.1.5@o2ib@192.168.1.5@o2ib:26/25 lens 224/224 e 0 to 1 dl 1492208729 ref 2 fl Rpc:XN/0/ffffffff rc 0/-1
[414194.411850] Lustre: server umount lsdraid-OST0000 complete
[414368.256969] Lustre: lsdraid-OST0000: new disk, initializing
[414368.265405] Lustre: srv-lsdraid-OST0000: No data found on store. Initialize space
[414375.147139] Lustre: lsdraid-OST0000: Connection restored to lsdraid-MDT0000-mdtlov_UUID (at 192.168.1.5@o2ib)
[414533.259382] Lustre: Failing over lsdraid-OST0000
[414533.276260] Lustre: server umount lsdraid-OST0000 complete
[414724.001373] Lustre: lsdraid-OST0000: Imperative Recovery enabled, recovery window shrunk from 300-900 down to 150-450
[414725.696637] Lustre: lsdraid-OST0000: Will be in recovery for at least 2:30, or until 1 client reconnects
[414725.709414] Lustre: lsdraid-OST0000: Connection restored to lsdraid-MDT0000-mdtlov_UUID (at 192.168.1.5@o2ib)
[414725.874431] Lustre: lsdraid-OST0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted.
[415336.132350] Lustre: lsdraid-OST0000: Connection restored to bd4f4e40-dbac-a829-f1fd-3c4450a08dcb (at 192.168.1.6@o2ib)
[415406.632740] ------------[ cut here ]------------
[415406.633861] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000401:0x7a:0x0] object 0x0:88 extent [50331648-57343999]: client csum 41b33fd5, server csum 649d3feb
[415406.665939] kernel BUG at include/linux/scatterlist.h:65!
[415406.674352] invalid opcode: 0000 [#1] SMP 
[415406.681344] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate lustre(OE) lmv(OE) mdc(OE) lov(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) libcfs(OE) sha512_generic crypto_null xfs libcrc32c rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm dm_service_time ses enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper mpt3sas ablk_helper cryptd raid_class scsi_transport_sas ipmi_devintf ipmi_ssif iTCO_wdt
[415406.776798]  sg pcspkr iTCO_vendor_support ipmi_si ipmi_msghandler mei_me sb_edac acpi_power_meter ioatdma lpc_ich edac_core acpi_pad shpchp mei wmi i2c_i801 mfd_core dm_multipath dm_mod nfsd auth_rpcgss nfs_acl lockd grace sunrpc binfmt_misc ip_tables ext4 mbcache jbd2 mlx4_en mlx4_ib vxlan ib_sa ip6_udp_tunnel ib_mad udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb crct10dif_pclmul ttm crct10dif_common ptp crc32c_intel ahci pps_core drm mlx4_core libahci dca i2c_algo_bit libata i2c_core [last unloaded: zunicode]
[415406.848441] CPU: 29 PID: 89865 Comm: ll_ost_io01_000 Tainted: G          IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1
[415406.863708] Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015
[415406.878344] task: ffff8817d96e5c00 ti: ffff881a6b35c000 task.ti: ffff881a6b35c000
[415406.889651] RIP: 0010:[&amp;lt;ffffffffa0c0cfef&amp;gt;]  [&amp;lt;ffffffffa0c0cfef&amp;gt;] cfs_crypto_hash_update_page+0x9f/0xb0 [libcfs]
[415406.903951] RSP: 0018:ffff881a6b35fab8  EFLAGS: 00010202
[415406.912870] RAX: 0000000000000002 RBX: ffff8820050b5900 RCX: 0000000000000000
[415406.923849] RDX: 0000000000000020 RSI: 0000000000000000 RDI: ffff881a6b35fad8
[415406.934787] RBP: ffff881a6b35fb00 R08: 00000000000195a0 R09: ffff881a6b35fab8
[415406.945693] R10: ffff88103e807900 R11: 0000000000000001 R12: 3534333231303635
[415406.956568] R13: 0000000032313036 R14: 0000000000000433 R15: 0000000000000000
[415406.967407] FS:  0000000000000000(0000) GS:ffff88203e6c0000(0000) knlGS:0000000000000000
[415406.979287] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[415406.988490] CR2: 00007fc89400b008 CR3: 000000000194a000 CR4: 00000000001407e0
[415406.999227] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[415407.009940] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[415407.020607] Stack:
[415407.025494]  0000000000000002 0000000000000000 0000000000000000 0000000000000000
[415407.036487]  00000000ced088e5 0000000000000000 ffff882024772701 ffff880db7053000
[415407.047418]  0000000000000000 ffff881a6b35fb68 ffffffffa0f8e459 ffff8819d6ea98a8
[415407.058319] Call Trace:
[415407.063640]  [&amp;lt;ffffffffa0f8e459&amp;gt;] tgt_checksum_bulk.isra.33+0x35a/0x4e7 [ptlrpc]
[415407.074501]  [&amp;lt;ffffffffa0f6721d&amp;gt;] tgt_brw_write+0x114d/0x1640 [ptlrpc]
[415407.084323]  [&amp;lt;ffffffff810c15cc&amp;gt;] ? update_curr+0xcc/0x150
[415407.092958]  [&amp;lt;ffffffff810be46e&amp;gt;] ? account_entity_dequeue+0xae/0xd0
[415407.102588]  [&amp;lt;ffffffffa0ebd560&amp;gt;] ? target_send_reply_msg+0x170/0x170 [ptlrpc]
[415407.113192]  [&amp;lt;ffffffffa0f63225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[415407.123952]  [&amp;lt;ffffffffa0f0f1ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[415407.135575]  [&amp;lt;ffffffffa0c13128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[415407.146329]  [&amp;lt;ffffffffa0f0cd68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[415407.156963]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[415407.166363]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[415407.175301]  [&amp;lt;ffffffffa0f13260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[415407.184635]  [&amp;lt;ffffffff81013588&amp;gt;] ? __switch_to+0xf8/0x4b0
[415407.193114]  [&amp;lt;ffffffffa0f127c0&amp;gt;] ? ptlrpc_register_service+0xe40/0xe40 [ptlrpc]
[415407.204113]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[415407.212374]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[415407.222423]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[415407.231187]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[415407.241105] Code: 89 43 38 48 8b 43 20 ff 50 c0 48 8b 55 d8 65 48 33 14 25 28 00 00 00 75 0d 48 83 c4 28 5b 41 5c 41 5d 41 5e 5d c3 e8 61 e0 46 e0 &amp;lt;0f&amp;gt; 0b 0f 1f 44 00 00 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 
[415407.268624] RIP  [&amp;lt;ffffffffa0c0cfef&amp;gt;] cfs_crypto_hash_update_page+0x9f/0xb0 [libcfs]
[415407.279914]  RSP &amp;lt;ffff881a6b35fab8&amp;gt;
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;wolf-3 OSS 10.8.1.3-2017-04-15-00:39:07&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;[ 6415.538534] Lustre: lsdraid-OST0000: Imperative Recovery enabled, recovery window shrunk from 300-900 down to 150-450
[ 6422.155237] Lustre: lsdraid-OST0000: Will be in recovery for at least 2:30, or until 1 client reconnects
[ 6422.165992] Lustre: lsdraid-OST0000: Connection restored to lsdraid-MDT0000-mdtlov_UUID (at 192.168.1.5@o2ib)
[ 6422.291438] Lustre: lsdraid-OST0000: Recovery over after 0:01, of 1 clients 1 recovered and 0 were evicted.
[ 6422.301549] Lustre: lsdraid-OST0000: deleting orphan objects from 0x0:91 to 0x0:129
[ 6474.856831] Lustre: lsdraid-OST0000: Connection restored to  (at 192.168.1.8@o2ib)
[ 6565.960924] BUG: Bad page state in process ll_ost_io01_007  pfn:18eecce
[ 6565.961668] BUG: Bad page state in process ll_ost_io01_006  pfn:18eecca
[ 6565.961672] page:ffffea0063bb3280 count:-1 mapcount:0 mapping:          (null) index:0x0
[ 6565.961674] page flags: 0x6fffff00000000()
[ 6565.961675] page dumped because: nonzero _count
[ 6565.961726] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate dm_service_time xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel mpt3sas raid_class lrw gf128mul glue_helper ablk_helper cryptd scsi_transport_sas iTCO_wdt iTCO_vendor_support mei_me ipmi_devintf ipmi_ssif lpc_ich sb_edac ipmi_si
[ 6565.961778]  edac_core sg ipmi_msghandler mei shpchp pcspkr ioatdma mfd_core i2c_i801 wmi acpi_pad acpi_power_meter nfsd dm_multipath dm_mod nfs_acl lockd binfmt_misc grace auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 mlx4_en vxlan ip6_udp_tunnel udp_tunnel mlx4_ib ib_sa ib_mad ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb ttm ptp crct10dif_pclmul pps_core crct10dif_common ahci drm crc32c_intel dca libahci mlx4_core i2c_algo_bit libata i2c_core
[ 6565.961782] CPU: 31 PID: 10886 Comm: ll_ost_io01_006 Tainted: G          IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1
[ 6565.961784] Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015
[ 6565.961792]  ffffea0063bb3280 000000008d05d0f2 ffff88202236f6f8 ffffffff81636431
[ 6565.961797]  ffff88202236f720 ffffffff81631645 ffff88203e759c68 0000000000003735
[ 6565.961803]  0000000000000001 ffff88202236f828 ffffffff81173028 ffff881022e59370
[ 6565.961804] Call Trace:
[ 6565.961819]  [&amp;lt;ffffffff81636431&amp;gt;] dump_stack+0x19/0x1b
[ 6565.961824]  [&amp;lt;ffffffff81631645&amp;gt;] bad_page.part.59+0xdf/0xfc
[ 6565.961833]  [&amp;lt;ffffffff81173028&amp;gt;] get_page_from_freelist+0x848/0x9b0
[ 6565.961844]  [&amp;lt;ffffffffa06cadaa&amp;gt;] ? spl_kmem_free+0x2a/0x40 [spl]
[ 6565.961848]  [&amp;lt;ffffffff81173327&amp;gt;] __alloc_pages_nodemask+0x197/0xba0
[ 6565.961862]  [&amp;lt;ffffffffa01f9f02&amp;gt;] ? mlx4_ib_post_send+0x4e2/0xb20 [mlx4_ib]
[ 6565.961910]  [&amp;lt;ffffffffa0b68f8d&amp;gt;] ? lu_obj_hop_keycmp+0x1d/0x30 [obdclass]
[ 6565.961927]  [&amp;lt;ffffffffa081d717&amp;gt;] ? cfs_hash_bd_lookup_intent+0x57/0x160 [libcfs]
[ 6565.961935]  [&amp;lt;ffffffff811b4afa&amp;gt;] alloc_pages_current+0xaa/0x170
[ 6565.961952]  [&amp;lt;ffffffffa0d5786b&amp;gt;] osd_bufs_get+0x4cb/0xba0 [osd_zfs]
[ 6565.961970]  [&amp;lt;ffffffffa10ade3d&amp;gt;] ofd_preprw_write.isra.29+0x1bd/0xcd0 [ofd]
[ 6565.961980]  [&amp;lt;ffffffffa10af13a&amp;gt;] ofd_preprw+0x7ea/0x10c0 [ofd]
[ 6565.962092]  [&amp;lt;ffffffffa0e8fce7&amp;gt;] tgt_brw_write+0xc17/0x1640 [ptlrpc]
[ 6565.962098]  [&amp;lt;ffffffff81632d15&amp;gt;] ? __slab_free+0x10e/0x277
[ 6565.962105]  [&amp;lt;ffffffff810c15cc&amp;gt;] ? update_curr+0xcc/0x150
[ 6565.962110]  [&amp;lt;ffffffff810be46e&amp;gt;] ? account_entity_dequeue+0xae/0xd0
[ 6565.962115]  [&amp;lt;ffffffff81639d72&amp;gt;] ? mutex_lock+0x12/0x2f
[ 6565.962178]  [&amp;lt;ffffffffa0e8c225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[ 6565.962234]  [&amp;lt;ffffffffa0e381ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[ 6565.962249]  [&amp;lt;ffffffffa081a128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[ 6565.962302]  [&amp;lt;ffffffffa0e35d68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[ 6565.962311]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[ 6565.962315]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[ 6565.962368]  [&amp;lt;ffffffffa0e3c260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[ 6565.962377]  [&amp;lt;ffffffff81013588&amp;gt;] ? __switch_to+0xf8/0x4b0
[ 6565.962428]  [&amp;lt;ffffffffa0e3b7c0&amp;gt;] ? ptlrpc_register_service+0xe40/0xe40 [ptlrpc]
[ 6565.962436]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[ 6565.962441]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6565.962449]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[ 6565.962454]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6565.962456] Disabling lock debugging due to kernel taint
[ 6565.962539] BUG: Bad page state in process ll_ost_io01_006  pfn:18eecc5
[ 6565.962541] page:ffffea0063bb3140 count:-1 mapcount:0 mapping:          (null) index:0x0
[ 6565.962542] page flags: 0x6fffff00000000()
[ 6565.962543] page dumped because: nonzero _count
[ 6565.962576] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate dm_service_time xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel mpt3sas raid_class lrw gf128mul glue_helper ablk_helper cryptd scsi_transport_sas iTCO_wdt iTCO_vendor_support mei_me ipmi_devintf ipmi_ssif lpc_ich sb_edac ipmi_si
[ 6565.962601]  edac_core sg ipmi_msghandler mei shpchp pcspkr ioatdma mfd_core i2c_i801 wmi acpi_pad acpi_power_meter nfsd dm_multipath dm_mod nfs_acl lockd binfmt_misc grace auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 mlx4_en vxlan ip6_udp_tunnel udp_tunnel mlx4_ib ib_sa ib_mad ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb ttm ptp crct10dif_pclmul pps_core crct10dif_common ahci drm crc32c_intel dca libahci mlx4_core i2c_algo_bit libata i2c_core
[ 6565.962604] CPU: 31 PID: 10886 Comm: ll_ost_io01_006 Tainted: G    B     IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1
[ 6565.962605] Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015
[ 6565.962612]  ffffea0063bb3140 000000008d05d0f2 ffff88202236f6f8 ffffffff81636431
[ 6565.962619]  ffff88202236f720 ffffffff81631645 ffff88203e759c68 0000000000003735
[ 6565.962625]  0000000000000001 ffff88202236f828 ffffffff81173028 ffff881022e59370
[ 6565.962626] Call Trace:
[ 6565.962632]  [&amp;lt;ffffffff81636431&amp;gt;] dump_stack+0x19/0x1b
[ 6565.962636]  [&amp;lt;ffffffff81631645&amp;gt;] bad_page.part.59+0xdf/0xfc
[ 6565.962641]  [&amp;lt;ffffffff81173028&amp;gt;] get_page_from_freelist+0x848/0x9b0
[ 6565.962650]  [&amp;lt;ffffffffa06cadaa&amp;gt;] ? spl_kmem_free+0x2a/0x40 [spl]
[ 6565.962655]  [&amp;lt;ffffffff81173327&amp;gt;] __alloc_pages_nodemask+0x197/0xba0
[ 6565.962669]  [&amp;lt;ffffffffa01f9f02&amp;gt;] ? mlx4_ib_post_send+0x4e2/0xb20 [mlx4_ib]
[ 6565.962711]  [&amp;lt;ffffffffa0b68f8d&amp;gt;] ? lu_obj_hop_keycmp+0x1d/0x30 [obdclass]
[ 6565.962727]  [&amp;lt;ffffffffa081d717&amp;gt;] ? cfs_hash_bd_lookup_intent+0x57/0x160 [libcfs]
[ 6565.962733]  [&amp;lt;ffffffff811b4afa&amp;gt;] alloc_pages_current+0xaa/0x170
[ 6565.962745]  [&amp;lt;ffffffffa0d5786b&amp;gt;] osd_bufs_get+0x4cb/0xba0 [osd_zfs]
[ 6565.962767]  [&amp;lt;ffffffffa10ade3d&amp;gt;] ofd_preprw_write.isra.29+0x1bd/0xcd0 [ofd]
[ 6565.962781]  [&amp;lt;ffffffffa10af13a&amp;gt;] ofd_preprw+0x7ea/0x10c0 [ofd]
[ 6565.962855]  [&amp;lt;ffffffffa0e8fce7&amp;gt;] tgt_brw_write+0xc17/0x1640 [ptlrpc]
[ 6565.962861]  [&amp;lt;ffffffff81632d15&amp;gt;] ? __slab_free+0x10e/0x277
[ 6565.962866]  [&amp;lt;ffffffff810c15cc&amp;gt;] ? update_curr+0xcc/0x150
[ 6565.962870]  [&amp;lt;ffffffff810be46e&amp;gt;] ? account_entity_dequeue+0xae/0xd0
[ 6565.962875]  [&amp;lt;ffffffff81639d72&amp;gt;] ? mutex_lock+0x12/0x2f
[ 6565.962949]  [&amp;lt;ffffffffa0e8c225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[ 6565.963019]  [&amp;lt;ffffffffa0e381ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[ 6565.963034]  [&amp;lt;ffffffffa081a128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[ 6565.963103]  [&amp;lt;ffffffffa0e35d68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[ 6565.963109]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[ 6565.963112]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[ 6565.963181]  [&amp;lt;ffffffffa0e3c260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[ 6565.963187]  [&amp;lt;ffffffff81013588&amp;gt;] ? __switch_to+0xf8/0x4b0
[ 6565.963256]  [&amp;lt;ffffffffa0e3b7c0&amp;gt;] ? ptlrpc_register_service+0xe40/0xe40 [ptlrpc]
[ 6565.963262]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[ 6565.963267]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6565.963273]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[ 6565.963278]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6565.963280] BUG: Bad page state in process ll_ost_io01_006  pfn:18eecc6
[ 6565.963282] page:ffffea0063bb3180 count:-1 mapcount:0 mapping:          (null) index:0x0
[ 6565.963284] page flags: 0x6fffff00000000()
[ 6565.963285] page dumped because: nonzero _count
[ 6565.963320] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate dm_service_time xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel mpt3sas raid_class lrw gf128mul glue_helper ablk_helper cryptd scsi_transport_sas iTCO_wdt iTCO_vendor_support mei_me ipmi_devintf ipmi_ssif lpc_ich sb_edac ipmi_si
[ 6565.963346]  edac_core sg ipmi_msghandler mei shpchp pcspkr ioatdma mfd_core i2c_i801 wmi acpi_pad acpi_power_meter nfsd dm_multipath dm_mod nfs_acl lockd binfmt_misc grace auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 mlx4_en vxlan ip6_udp_tunnel udp_tunnel mlx4_ib ib_sa ib_mad ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb ttm ptp crct10dif_pclmul pps_core crct10dif_common ahci drm crc32c_intel dca libahci mlx4_core i2c_algo_bit libata i2c_core
[ 6565.963349] CPU: 31 PID: 10886 Comm: ll_ost_io01_006 Tainted: G    B     IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1
[ 6565.963350] Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015
[ 6565.963358]  ffffea0063bb3180 000000008d05d0f2 ffff88202236f6f8 ffffffff81636431
[ 6565.963365]  ffff88202236f720 ffffffff81631645 ffff88203e759c68 0000000000003735
[ 6565.963372]  0000000000000001 ffff88202236f828 ffffffff81173028 ffff881022e59370
[ 6565.963372] Call Trace:
[ 6565.963378]  [&amp;lt;ffffffff81636431&amp;gt;] dump_stack+0x19/0x1b
[ 6565.963383]  [&amp;lt;ffffffff81631645&amp;gt;] bad_page.part.59+0xdf/0xfc
[ 6565.963388]  [&amp;lt;ffffffff81173028&amp;gt;] get_page_from_freelist+0x848/0x9b0
[ 6565.963397]  [&amp;lt;ffffffffa06cadaa&amp;gt;] ? spl_kmem_free+0x2a/0x40 [spl]
[ 6565.963403]  [&amp;lt;ffffffff81173327&amp;gt;] __alloc_pages_nodemask+0x197/0xba0
[ 6565.963416]  [&amp;lt;ffffffffa01f9f02&amp;gt;] ? mlx4_ib_post_send+0x4e2/0xb20 [mlx4_ib]
[ 6565.963458]  [&amp;lt;ffffffffa0b68f8d&amp;gt;] ? lu_obj_hop_keycmp+0x1d/0x30 [obdclass]
[ 6565.963473]  [&amp;lt;ffffffffa081d717&amp;gt;] ? cfs_hash_bd_lookup_intent+0x57/0x160 [libcfs]
[ 6565.963479]  [&amp;lt;ffffffff811b4afa&amp;gt;] alloc_pages_current+0xaa/0x170
[ 6565.963491]  [&amp;lt;ffffffffa0d5786b&amp;gt;] osd_bufs_get+0x4cb/0xba0 [osd_zfs]
[ 6565.963506]  [&amp;lt;ffffffffa10ade3d&amp;gt;] ofd_preprw_write.isra.29+0x1bd/0xcd0 [ofd]
[ 6565.963519]  [&amp;lt;ffffffffa10af13a&amp;gt;] ofd_preprw+0x7ea/0x10c0 [ofd]
[ 6565.963593]  [&amp;lt;ffffffffa0e8fce7&amp;gt;] tgt_brw_write+0xc17/0x1640 [ptlrpc]
[ 6565.963599]  [&amp;lt;ffffffff81632d15&amp;gt;] ? __slab_free+0x10e/0x277
[ 6565.963603]  [&amp;lt;ffffffff810c15cc&amp;gt;] ? update_curr+0xcc/0x150
[ 6565.963607]  [&amp;lt;ffffffff810be46e&amp;gt;] ? account_entity_dequeue+0xae/0xd0
[ 6565.963612]  [&amp;lt;ffffffff81639d72&amp;gt;] ? mutex_lock+0x12/0x2f
[ 6565.963686]  [&amp;lt;ffffffffa0e8c225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[ 6565.963756]  [&amp;lt;ffffffffa0e381ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[ 6565.963778]  [&amp;lt;ffffffffa081a128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[ 6565.963847]  [&amp;lt;ffffffffa0e35d68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[ 6565.963853]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[ 6565.963856]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[ 6565.963925]  [&amp;lt;ffffffffa0e3c260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[ 6565.963931]  [&amp;lt;ffffffff81013588&amp;gt;] ? __switch_to+0xf8/0x4b0
[ 6565.964000]  [&amp;lt;ffffffffa0e3b7c0&amp;gt;] ? ptlrpc_register_service+0xe40/0xe40 [ptlrpc]
[ 6565.964006]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[ 6565.964011]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6565.964016]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[ 6565.964021]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6567.436859] page:ffffea0063bb3380 count:-1 mapcount:0 mapping:          (null) index:0x0
[ 6567.447916] page flags: 0x6fffff00000000()
[ 6567.454287] page dumped because: nonzero _count
[ 6567.461107] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate dm_service_time xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel mpt3sas raid_class lrw gf128mul glue_helper ablk_helper cryptd scsi_transport_sas iTCO_wdt iTCO_vendor_support mei_me ipmi_devintf ipmi_ssif lpc_ich sb_edac ipmi_si
[ 6567.549458]  edac_core sg ipmi_msghandler mei shpchp pcspkr ioatdma mfd_core i2c_i801 wmi acpi_pad acpi_power_meter nfsd dm_multipath dm_mod nfs_acl lockd binfmt_misc grace auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 mlx4_en vxlan ip6_udp_tunnel udp_tunnel mlx4_ib ib_sa ib_mad ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb ttm ptp crct10dif_pclmul pps_core crct10dif_common ahci drm crc32c_intel dca libahci mlx4_core i2c_algo_bit libata i2c_core
[ 6567.606553] CPU: 19 PID: 11266 Comm: ll_ost_io01_007 Tainted: G    B     IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1
[ 6567.619967] Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015
[ 6567.632682]  ffffea0063bb3380 0000000029637c7c ffff880f32283908 ffffffff81636431
[ 6567.642074]  ffff880f32283930 ffffffff81631645 ffffea0063bb3380 0000000000000000
[ 6567.651459]  000fffff00000000 ffff880f32283978 ffffffff811714dd fff00000fe000000
[ 6567.660857] Call Trace:
[ 6567.664645]  [&amp;lt;ffffffff81636431&amp;gt;] dump_stack+0x19/0x1b
[ 6567.671441]  [&amp;lt;ffffffff81631645&amp;gt;] bad_page.part.59+0xdf/0xfc
[ 6567.678829]  [&amp;lt;ffffffff811714dd&amp;gt;] free_pages_prepare+0x16d/0x190
[ 6567.686591]  [&amp;lt;ffffffff81171e21&amp;gt;] free_hot_cold_page+0x31/0x140
[ 6567.694250]  [&amp;lt;ffffffff8117200f&amp;gt;] __free_pages+0x3f/0x60
[ 6567.701235]  [&amp;lt;ffffffffa0d56ad3&amp;gt;] osd_bufs_put+0x123/0x1f0 [osd_zfs]
[ 6567.709381]  [&amp;lt;ffffffffa10ab84a&amp;gt;] ofd_commitrw_write+0xea/0x1c20 [ofd]
[ 6567.717717]  [&amp;lt;ffffffffa10aff2d&amp;gt;] ofd_commitrw+0x51d/0xa40 [ofd]
[ 6567.725522]  [&amp;lt;ffffffffa0eb78d2&amp;gt;] obd_commitrw+0x2ec/0x32f [ptlrpc]
[ 6567.733604]  [&amp;lt;ffffffffa0e8ff71&amp;gt;] tgt_brw_write+0xea1/0x1640 [ptlrpc]
[ 6567.741863]  [&amp;lt;ffffffffa0de6560&amp;gt;] ? target_send_reply_msg+0x170/0x170 [ptlrpc]
[ 6567.751008]  [&amp;lt;ffffffffa0e8c225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[ 6567.760002]  [&amp;lt;ffffffffa0e381ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[ 6567.769852]  [&amp;lt;ffffffffa081a128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[ 6567.778843]  [&amp;lt;ffffffffa0e35d68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[ 6567.787757]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[ 6567.796038]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[ 6567.803866]  [&amp;lt;ffffffffa0e3c260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[ 6567.812150]  [&amp;lt;ffffffff81013588&amp;gt;] ? __switch_to+0xf8/0x4b0
[ 6567.819620]  [&amp;lt;ffffffffa0e3b7c0&amp;gt;] ? ptlrpc_register_service+0xe40/0xe40 [ptlrpc]
[ 6567.828951]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[ 6567.835460]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6567.843817]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[ 6567.850900]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6591.647844] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000402:0x38:0x0] object 0x0:151 extent [67108864-74711039]: client csum 10225ab5, server csum d83f5ab1
[ 6602.366408] LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode [0x200000402:0x46:0x0] object 0x0:158 extent [67108864-82968575]: client csum df6bd34a, server csum a629d34d
[ 6611.821644] general protection fault: 0000 [#1] SMP 
[ 6611.829518] Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache zfs(OE) zunicode(OE) zavl(OE) icp(OE) zcommon(OE) znvpair(OE) spl(OE) zlib_deflate dm_service_time xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel mpt3sas raid_class lrw gf128mul glue_helper ablk_helper cryptd scsi_transport_sas iTCO_wdt iTCO_vendor_support mei_me ipmi_devintf ipmi_ssif lpc_ich sb_edac ipmi_si
[ 6611.923714]  edac_core sg ipmi_msghandler mei shpchp pcspkr ioatdma mfd_core i2c_i801 wmi acpi_pad acpi_power_meter nfsd dm_multipath dm_mod nfs_acl lockd binfmt_misc grace auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 mlx4_en vxlan ip6_udp_tunnel udp_tunnel mlx4_ib ib_sa ib_mad ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb ttm ptp crct10dif_pclmul pps_core crct10dif_common ahci drm crc32c_intel dca libahci mlx4_core i2c_algo_bit libata i2c_core
[ 6611.985416] CPU: 55 PID: 9668 Comm: ll_ost_io01_000 Tainted: G    B     IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1
[ 6611.999894] Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015
[ 6612.013786] task: ffff880fd957e780 ti: ffff880fdb368000 task.ti: ffff880fdb368000
[ 6612.024361] RIP: 0010:[&amp;lt;ffffffffa0814e30&amp;gt;]  [&amp;lt;ffffffffa0814e30&amp;gt;] adler32_update+0x70/0x250 [libcfs]
[ 6612.036764] RSP: 0018:ffff880fdb36b990  EFLAGS: 00010212
[ 6612.044902] RAX: 0000000000000cce RBX: 0000000000000cce RCX: 3433323130363534
[ 6612.055097] RDX: 0000000000000cce RSI: 0cd1944c0d8d4332 RDI: 0cd1944c0d8d4332
[ 6612.065272] RBP: ffff880fdb36b9f8 R08: 00000000000195a0 R09: 0000000000000cce
[ 6612.075453] R10: ffff88103e807900 R11: 0000000000000001 R12: 3433323130363534
[ 6612.085641] R13: 0000000031303635 R14: ffffffffa0834410 R15: 0000000000000001
[ 6612.095830] FS:  0000000000000000(0000) GS:ffff88203e8c0000(0000) knlGS:0000000000000000
[ 6612.107119] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 6612.115792] CR2: 00007f19c6c7c000 CR3: 000000000194a000 CR4: 00000000001407e0
[ 6612.126030] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 6612.136265] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 6612.146492] Stack:
[ 6612.150994]  ffff881d1a0d1cd0 00000ccedb36b9c8 0cd1944c0d8d4332 0000000000000000
[ 6612.161627]  00000cce00000000 ffffffffa0834410 ffff882027752a08 ffff880fdb36b9f0
[ 6612.172284]  0cd1944c0d8d4332 3433323130363534 0000000031303635 ffffffffa0834410
[ 6612.182948] Call Trace:
[ 6612.187988]  [&amp;lt;ffffffff812b1a78&amp;gt;] crypto_shash_update+0x38/0x100
[ 6612.197017]  [&amp;lt;ffffffff812b1d6e&amp;gt;] shash_ahash_update+0x3e/0x70
[ 6612.205854]  [&amp;lt;ffffffff812b1db2&amp;gt;] shash_async_update+0x12/0x20
[ 6612.214676]  [&amp;lt;ffffffffa0813fce&amp;gt;] cfs_crypto_hash_update_page+0x7e/0xb0 [libcfs]
[ 6612.225344]  [&amp;lt;ffffffffa0eb7459&amp;gt;] tgt_checksum_bulk.isra.33+0x35a/0x4e7 [ptlrpc]
[ 6612.236606]  [&amp;lt;ffffffffa0e9021d&amp;gt;] tgt_brw_write+0x114d/0x1640 [ptlrpc]
[ 6612.246831]  [&amp;lt;ffffffff810c15cc&amp;gt;] ? update_curr+0xcc/0x150
[ 6612.255910]  [&amp;lt;ffffffff810be46e&amp;gt;] ? account_entity_dequeue+0xae/0xd0
[ 6612.265910]  [&amp;lt;ffffffffa0de6560&amp;gt;] ? target_send_reply_msg+0x170/0x170 [ptlrpc]
[ 6612.276879]  [&amp;lt;ffffffffa0e8c225&amp;gt;] tgt_request_handle+0x915/0x1320 [ptlrpc]
[ 6612.287460]  [&amp;lt;ffffffffa0e381ab&amp;gt;] ptlrpc_server_handle_request+0x21b/0xa90 [ptlrpc]
[ 6612.298869]  [&amp;lt;ffffffffa081a128&amp;gt;] ? lc_watchdog_touch+0x68/0x180 [libcfs]
[ 6612.309312]  [&amp;lt;ffffffffa0e35d68&amp;gt;] ? ptlrpc_wait_event+0x98/0x340 [ptlrpc]
[ 6612.319759]  [&amp;lt;ffffffff810b8952&amp;gt;] ? default_wake_function+0x12/0x20
[ 6612.329610]  [&amp;lt;ffffffff810af0b8&amp;gt;] ? __wake_up_common+0x58/0x90
[ 6612.338955]  [&amp;lt;ffffffffa0e3c260&amp;gt;] ptlrpc_main+0xaa0/0x1de0 [ptlrpc]
[ 6612.348565]  [&amp;lt;ffffffff81013588&amp;gt;] ? __switch_to+0xf8/0x4b0
[ 6612.357360]  [&amp;lt;ffffffffa0e3b7c0&amp;gt;] ? ptlrpc_register_service+0xe40/0xe40 [ptlrpc]
[ 6612.368146]  [&amp;lt;ffffffff810a5b8f&amp;gt;] kthread+0xcf/0xe0
[ 6612.376092]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6612.385802]  [&amp;lt;ffffffff81646a98&amp;gt;] ret_from_fork+0x58/0x90
[ 6612.394179]  [&amp;lt;ffffffff810a5ac0&amp;gt;] ? kthread_create_on_node+0x140/0x140
[ 6612.403767] Code: 44 00 00 8b 5d b8 b8 b0 15 00 00 81 fb b0 15 00 00 0f 46 c3 29 45 b8 83 f8 0f 89 45 a4 0f 8e f8 00 00 00 48 8b 7d a8 89 45 bc 90 &amp;lt;44&amp;gt; 0f b6 2f 44 0f b6 77 01 48 83 c7 10 44 0f b6 67 f2 0f b6 5f 
[ 6612.430647] RIP  [&amp;lt;ffffffffa0814e30&amp;gt;] adler32_update+0x70/0x250 [libcfs]
[ 6612.440428]  RSP &amp;lt;ffff880fdb36b990&amp;gt;
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="192182" author="jsalians_intel" created="Sat, 15 Apr 2017 15:32:48 +0000"  >&lt;p&gt;The above dumps are now on onyx: &lt;br/&gt;
/scratch/johnsali/&lt;br/&gt;
&lt;del&gt;rw-r&lt;/del&gt;&lt;del&gt;r&lt;/del&gt;-  1 johnsali johnsali 430M Apr 15 08:24 10.8.1.3-2017-04-14-224609.tgz&lt;br/&gt;
&lt;del&gt;rw-r&lt;/del&gt;&lt;del&gt;r&lt;/del&gt;-  1 johnsali johnsali 142M Apr 15 08:24 10.8.1.3-2017-04-15-003907.tgz&lt;br/&gt;
&lt;del&gt;rw-r&lt;/del&gt;&lt;del&gt;r&lt;/del&gt;-  1 johnsali johnsali 707M Apr 15 08:26 10.8.1.3-2017-04-15-132245.tgz&lt;br/&gt;
&lt;del&gt;rw-r&lt;/del&gt;&lt;del&gt;r&lt;/del&gt;-  1 johnsali johnsali 274M Apr 15 08:27 10.8.1.4-2017-04-15-002617.tgz&lt;br/&gt;
&lt;del&gt;rw-r&lt;/del&gt;&lt;del&gt;r&lt;/del&gt;-  1 johnsali johnsali 485M Apr 15 08:29 10.8.1.4-2017-04-15-014743.tgz&lt;br/&gt;
&lt;del&gt;rw-r&lt;/del&gt;&lt;del&gt;r&lt;/del&gt;-  1 johnsali johnsali 782M Apr 15 08:31 10.8.1.4-2017-04-15-132247.tgz&lt;/p&gt;</comment>
                            <comment id="192231" author="utopiabound" created="Mon, 17 Apr 2017 12:44:25 +0000"  >&lt;p&gt;Looking at &quot;wolf-3 OSS 10.8.1.3-2017-04-15-00:39:07&quot;:&lt;/p&gt;

&lt;p&gt;Dump the backtrace with stack:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;crash&amp;gt; bt -f
...
 #9 [ffff880fdb36bab0] cfs_crypto_hash_update_page at ffffffffa0813fce [libcfs]
    ffff880fdb36bab8: 3433323130363536 3130363500000332 
    ffff880fdb36bac8: 0000000000000000 0000000000000000 
    ffff880fdb36bad8: 00000000b643a6e1 0000000000000000 
    ffff880fdb36bae8: ffff882027752a01 ffff881dc0c9a600 
    ffff880fdb36baf8: 0000000000000000 ffff880fdb36bb68 
    ffff880fdb36bb08: ffffffffa0eb7459 
#10 [ffff880fdb36bb08] tgt_checksum_bulk at ffffffffa0eb7459 [ptlrpc]
    ffff880fdb36bb10: ffff881969ae18a8 ffff880fd957e780 
    ffff880fdb36bb20: 00000004810b8940 ffff881d1a0d1c80 
    ffff880fdb36bb30: dead000000200200 00000000b643a6e1 
    ffff880fdb36bb40: ffff8817ffd40050 ffff882027752a80 
    ffff880fdb36bb50: ffff881f26180000 0000000000000000 
    ffff880fdb36bb60: ffff8818dee4cc00 ffff880fdb36bcd0 
    ffff880fdb36bb70: ffffffffa0e9021d 
...
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Hunt until we find the &lt;tt&gt;ptlrpc_bulk_desc&lt;/tt&gt;:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;crash&amp;gt; struct ptlrpc_bulk_desc ffff881dc0c9a600
struct ptlrpc_bulk_desc {
  bd_failure = 0, 
  bd_registered = 0, 
  bd_lock = {
    {
      rlock = {
        raw_lock = {
          {
            head_tail = 4587590, 
            tickets = {
              head = 70, 
              tail = 70
            }
          }
        }
      }
    }
  }, 
  bd_import_generation = 0, 
  bd_type = 41, 
  bd_portal = 8, 
  bd_export = 0xffff8818dee4cc00, 
  bd_import = 0x0, 
  bd_req = 0xffff8817ffd40050, 
  bd_frag_ops = 0xffffffffa0ec16a0 &amp;lt;ptlrpc_bulk_kiov_nopin_ops&amp;gt;, 
  bd_waitq = {
    lock = {
      {
        rlock = {
          raw_lock = {
            {
              head_tail = 393222, 
              tickets = {
                head = 6, 
                tail = 6
              }
            }
          }
        }
      }
    }, 
    task_list = {
      next = 0xffff881dc0c9a640, 
      prev = 0xffff881dc0c9a640
    }
  }, 
  bd_iov_count = 3872, 
  bd_max_iov = 3872, 
  bd_nob = 15859712, 
  bd_nob_transferred = 15859712, 
  bd_last_mbits = 0, 
  bd_cbid = {
    cbid_fn = 0xffffffffa0e30b80 &amp;lt;reply_out_callback+736&amp;gt;, 
    cbid_arg = 0xffff881dc0c9a600
  }, 
  bd_sender = 1407378115789062, 
  bd_md_count = 0, 
  bd_md_max_brw = 16, 
  bd_mds = {{
      cookie = 237797
    }, {
      cookie = 237805
    }, {
      cookie = 237813
    }, {
      cookie = 237821
    }, {
      cookie = 237829
    }, {
      cookie = 237837
    }, {
      cookie = 237845
    }, {
      cookie = 237853
    }, {
      cookie = 237861
    }, {
      cookie = 237869
    }, {
      cookie = 237877
    }, {
      cookie = 237885
    }, {
      cookie = 237893
    }, {
      cookie = 237901
    }, {
      cookie = 237909
    }, {
      cookie = 237917
    }}, 
  bd_u = {
    bd_kiov = {
      bd_enc_vec = 0x0, 
      bd_vec = 0xffff881be3ca0000
    }, 
    bd_kvec = {
      bd_enc_kvec = 0x0, 
      bd_kvec = 0xffff881be3ca0000
    }
  }
}
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Since we know the page point in the &lt;tt&gt;bd_vec&lt;/tt&gt; is the issue:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;crash&amp;gt; lnet_kiov_t ffff881be3ca0000
struct lnet_kiov_t {
  kiov_page = 0x3433323130363534, 
  kiov_len = 825243189, 
  kiov_offset = 892613426
}
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The &lt;tt&gt;kiov_page&lt;/tt&gt; isn&apos;t even remotely a valid kernel pointer.  I&apos;ll work on tracking down where the bad values could have come from.&lt;/p&gt;</comment>
                            <comment id="192252" author="jsalians_intel" created="Mon, 17 Apr 2017 14:37:34 +0000"  >&lt;p&gt;I did a mostly successful run with Lustre 2.9.0 + 0.7.0 RC3 (none of our patches) record size 1M on both OSS which forces a max RPC size of 1MB both raidz.  With this combination set I did not not see  Bad page state in process XXX.  It is my assumption based on these runs that the ZFS record size and associated brw size of 16MB is the issue.  &lt;/p&gt;</comment>
                            <comment id="192266" author="utopiabound" created="Mon, 17 Apr 2017 15:24:48 +0000"  >&lt;p&gt;Do you want me to keep looking into this from the Lustre side?&lt;/p&gt;</comment>
                            <comment id="192270" author="jsalians_intel" created="Mon, 17 Apr 2017 15:40:30 +0000"  >&lt;p&gt;Do you have any ideas why in the one case the OSS node hits the Bad page state in process in stock RC3 but not the node crash/reboot?  Is that just different configuration options (like panic on this) or is that a symptom that something additional is wrong in beta-coral-combined?   &lt;/p&gt;

&lt;p&gt;Is there anyway to isolate the memory for ptlrpc?   I am not sure how to figure out what is stepping on these values.  I can run these same tests directly on ZFS without issues &amp;#8211; figuring out the interactions between zfs and lustre is a bit challenging. &lt;/p&gt;
</comment>
                            <comment id="192283" author="jsalians_intel" created="Mon, 17 Apr 2017 16:19:23 +0000"  >&lt;p&gt;For the dumps in 0.7.0 RC3 after an OST process takes a BUG: Bad page state in process ll_ost_io01 but the OSS node doesn&apos;t crash is this process &quot;hung&quot; / dead forever?&lt;/p&gt;</comment>
                            <comment id="192594" author="jsalians_intel" created="Wed, 19 Apr 2017 01:18:17 +0000"  >&lt;p&gt;Here is a test case: &lt;br/&gt;
#!/usr/bin/python3.4&lt;/p&gt;

&lt;p&gt;import os&lt;br/&gt;
import uuid&lt;br/&gt;
import tempfile&lt;/p&gt;

&lt;p&gt;def WriteFileBasic(multiplysize, blocksize, writechar, path):&lt;br/&gt;
    &quot;&quot;&quot;&lt;br/&gt;
    Writes a basic file in a sequanetial fashion and fills it with writechar with the size of the file&lt;br/&gt;
    being equal to writechar * multiplysize and written in chunks at time based on blocksize.&lt;br/&gt;
    &quot;&quot;&quot;&lt;br/&gt;
    writethis = writechar * multiplysize&lt;br/&gt;
    unique_filename = uuid.uuid4()&lt;br/&gt;
    filetowrite = path + &apos;/basic-&apos; + str(unique_filename)&lt;/p&gt;

&lt;p&gt;    fd = open(filetowrite, &apos;wb&apos;)&lt;br/&gt;
    for x in range(blocksize):&lt;br/&gt;
        fd.write(bytes(writethis, &apos;UTF-8&apos;))&lt;br/&gt;
    fd.close()&lt;/p&gt;


&lt;p&gt;directory = tempfile.mkdtemp(dir=&quot;/mnt/lustre&quot;)&lt;br/&gt;
if not os.path.exists(directory):&lt;br/&gt;
    os.makedirs(directory)&lt;/p&gt;

&lt;p&gt;print(&quot;Writing files to: &lt;/p&gt;
{0}
&lt;p&gt;&quot;.format(directory))&lt;br/&gt;
for i in range(100):&lt;br/&gt;
    for x in range(2):&lt;br/&gt;
        WriteFileBasic(multiplysize=204800, blocksize=1024, writechar=&apos;0123456&apos;, path=directory)&lt;br/&gt;
        WriteFileBasic(multiplysize=204800, blocksize=1024, writechar=&apos;0123456&apos;, path=directory)&lt;br/&gt;
        WriteFileBasic(multiplysize=204800, blocksize=128, writechar=&apos;0&apos;, path=directory)&lt;br/&gt;
        WriteFileBasic(multiplysize=204800, blocksize=128, writechar=&apos;0&apos;, path=directory)&lt;br/&gt;
        WriteFileBasic(multiplysize=204800, blocksize=1024, writechar=&apos;0123456&apos;, path=directory)&lt;br/&gt;
        WriteFileBasic(multiplysize=124, blocksize=1024, writechar=&apos;0&apos;, path=directory) &lt;/p&gt;

&lt;p&gt;I start multiple copies of that on one client node at least 2 but 4 sometimes seems to make this come out a little quicker &lt;/p&gt;

&lt;p&gt;Here is an example from tonight running this.  &lt;br/&gt;
mpirun -np 4 -wdir /mnt/lustre -machinefile hosts -env I_MPI_EXTRA_FILESYSTEM=on -env I_MPI_EXTRA_FILESYSTEM_LIST=lustre /home/johnsali/wolf-3/ior/src/ior -a POSIX -F -N 4 -d 2 -i 1 -s 20480 -b 8m -t 8m&lt;br/&gt;
./testcase_pagestate.py &amp;amp;  &lt;br/&gt;
./testcase_pagestate.py &amp;amp;  &lt;/p&gt;

&lt;p&gt;almost immediately: &lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103132.581173&amp;#93;&lt;/span&gt; LustreError: 168-f: BAD WRITE CHECKSUM: lsdraid-OST0000 from 12345-192.168.1.6@o2ib inode &lt;span class=&quot;error&quot;&gt;&amp;#91;0x200000bd0:0xa98:0x0&amp;#93;&lt;/span&gt; object 0x0:1018 extent &lt;span class=&quot;error&quot;&gt;&amp;#91;83886080-92979199&amp;#93;&lt;/span&gt;: client csum 3e2f59b2, server csum cf13a5a5&lt;/p&gt;

&lt;p&gt;But it took several minutes to get:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.411485&amp;#93;&lt;/span&gt; BUG: Bad page state in process ll_ost_io01_000  pfn:171cd2c&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.420695&amp;#93;&lt;/span&gt; page:ffffea005c734b00 count:-1 mapcount:0 mapping:          (null) index:0x0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.431396&amp;#93;&lt;/span&gt; page flags: 0x6fffff00000000()&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.437504&amp;#93;&lt;/span&gt; page dumped because: nonzero _count&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.444053&amp;#93;&lt;/span&gt; Modules linked in: osp(OE) ofd(OE) lfsck(OE) ost(OE) mgc(OE) osd_zfs(OE) lquota(OE) fid(OE) fld(OE) ko2iblnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) sha512_generic crypto_null libcfs(OE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache zfs(POE) zunicode(POE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) zlib_deflate dm_service_time xprtrdma ib_isert iscsi_target_mod ib_iser libiscsi scsi_transport_iscsi ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ses enclosure intel_powerclamp coretemp intel_rapl kvm_intel kvm crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd iTCO_wdt mpt3sas iTCO_vendor_support lpc_ich ipmi_ssif sb_edac ipmi_devintf mei_me raid_class scsi_transport_sas sg mei edac_core i2c_i801 mfd_core pcspkr ipmi_si ioatdma shpchp ipmi_msghandler acpi_pad acpi_power_meter wmi dm_multipath dm_mod binfmt_misc nfsd nfs_acl lockd grace auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 mlx4_en mlx4_ib vxlan ib_sa ip6_udp_tunnel ib_mad udp_tunnel ib_core ib_addr sd_mod crc_t10dif crct10dif_generic mgag200 syscopyarea sysfillrect sysimgblt drm_kms_helper igb crct10dif_pclmul ttm crct10dif_common ahci ptp crc32c_intel libahci pps_core drm mlx4_core dca libata i2c_algo_bit i2c_core&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.588503&amp;#93;&lt;/span&gt; CPU: 58 PID: 7806 Comm: ll_ost_io01_000 Tainted: P    B     IOE  ------------   3.10.0-327.36.3.el7.x86_64 #1&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.601978&amp;#93;&lt;/span&gt; Hardware name: Intel Corporation S2600WT2/S2600WT2, BIOS SE5C610.86B.01.01.0008.021120151325 02/11/2015&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.614871&amp;#93;&lt;/span&gt;  ffffea005c734b00 000000002ff1b307 ffff880ff5e47908 ffffffff81636431&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.624444&amp;#93;&lt;/span&gt;  ffff880ff5e47930 ffffffff81631645 ffffea005c734b00 0000000000000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.634042&amp;#93;&lt;/span&gt;  000fffff00000000 ffff880ff5e47978 ffffffff811714dd fff00000fe000000&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.643624&amp;#93;&lt;/span&gt; Call Trace:&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.647573&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81636431&amp;gt;&amp;#93;&lt;/span&gt; dump_stack+0x19/0x1b&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.654528&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81631645&amp;gt;&amp;#93;&lt;/span&gt; bad_page.part.59+0xdf/0xfc&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.662089&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff811714dd&amp;gt;&amp;#93;&lt;/span&gt; free_pages_prepare+0x16d/0x190&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.670015&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81171e21&amp;gt;&amp;#93;&lt;/span&gt; free_hot_cold_page+0x31/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.677872&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff8117200f&amp;gt;&amp;#93;&lt;/span&gt; __free_pages+0x3f/0x60&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.685071&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0aeead3&amp;gt;&amp;#93;&lt;/span&gt; osd_bufs_put+0x123/0x1f0 &lt;span class=&quot;error&quot;&gt;&amp;#91;osd_zfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.693421&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0b5c84a&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw_write+0xea/0x1c20 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.701942&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0b60f2d&amp;gt;&amp;#93;&lt;/span&gt; ofd_commitrw+0x51d/0xa40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ofd&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.709944&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0ece8d2&amp;gt;&amp;#93;&lt;/span&gt; obd_commitrw+0x2ec/0x32f &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.718231&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0ea6f71&amp;gt;&amp;#93;&lt;/span&gt; tgt_brw_write+0xea1/0x1640 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.726897&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810c15cc&amp;gt;&amp;#93;&lt;/span&gt; ? update_curr+0xcc/0x150&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.734244&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810be46e&amp;gt;&amp;#93;&lt;/span&gt; ? account_entity_dequeue+0xae/0xd0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.742599&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0dfd560&amp;gt;&amp;#93;&lt;/span&gt; ? target_send_reply_msg+0x170/0x170 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.751928&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0ea3225&amp;gt;&amp;#93;&lt;/span&gt; tgt_request_handle+0x915/0x1320 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.761126&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0e4f1ab&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_server_handle_request+0x21b/0xa90 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.770928&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa09c1128&amp;gt;&amp;#93;&lt;/span&gt; ? lc_watchdog_touch+0x68/0x180 &lt;span class=&quot;error&quot;&gt;&amp;#91;libcfs&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.779767&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0e4cd68&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_wait_event+0x98/0x340 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.788569&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810b8952&amp;gt;&amp;#93;&lt;/span&gt; ? default_wake_function+0x12/0x20&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.796803&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810af0b8&amp;gt;&amp;#93;&lt;/span&gt; ? __wake_up_common+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.804581&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0e53260&amp;gt;&amp;#93;&lt;/span&gt; ptlrpc_main+0xaa0/0x1de0 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.812826&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffffa0e527c0&amp;gt;&amp;#93;&lt;/span&gt; ? ptlrpc_register_service+0xe40/0xe40 &lt;span class=&quot;error&quot;&gt;&amp;#91;ptlrpc&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.822297&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5b8f&amp;gt;&amp;#93;&lt;/span&gt; kthread+0xcf/0xe0&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.828979&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.837491&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff81646a98&amp;gt;&amp;#93;&lt;/span&gt; ret_from_fork+0x58/0x90&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;103332.844738&amp;#93;&lt;/span&gt;  &lt;span class=&quot;error&quot;&gt;&amp;#91;&amp;lt;ffffffff810a5ac0&amp;gt;&amp;#93;&lt;/span&gt; ? kthread_create_on_node+0x140/0x140&lt;/p&gt;

&lt;p&gt;The case is basically writing large file, large file, very small file or large file, very small file while stream IO is going on from ior.    &lt;/p&gt;</comment>
                            <comment id="193069" author="jay" created="Fri, 21 Apr 2017 19:11:50 +0000"  >&lt;p&gt;memory corruption.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10010">
                    <name>Duplicate</name>
                                            <outwardlinks description="duplicates">
                                        <issuelink>
            <issuekey id="40183">LU-9305</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                            <outwardlinks description="is related to ">
                                        <issuelink>
            <issuekey id="44749">LU-9279</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="26232" name="debug_info.20170406_143409_48420_wolf-3.wolf.hpdd.intel.com.tgz" size="3616388" author="jsalians_intel" created="Fri, 7 Apr 2017 14:48:21 +0000"/>
                            <attachment id="26231" name="wolf-6_client.tgz" size="5941047" author="jsalians_intel" created="Fri, 7 Apr 2017 14:54:20 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzyxrr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>