<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 02:10:28 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-7619] USE_LU_REF/lu_ref feature broken after some REFASSERT()s have been added without lu_ref::lf_guard protection</title>
                <link>https://jira.whamcloud.com/browse/LU-7619</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Some quite old patches have introduced several REFASSERT()s, to strengthen lu_ref feature controls, but not under required lu_ref::lf_guard spin-lock protection.&lt;br/&gt;
This can cause unexpected &quot;self&quot; dead-lock situations (vs LBUG/ASSERT!!) upon lu_ref failure detection.&lt;br/&gt;
This has been unveiled during my own usage of the feature to track and debug some unreferencing bug/lack.&lt;/p&gt;</description>
                <environment></environment>
        <key id="33893">LU-7619</key>
            <summary>USE_LU_REF/lu_ref feature broken after some REFASSERT()s have been added without lu_ref::lf_guard protection</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="4" iconUrl="https://jira.whamcloud.com/images/icons/priorities/minor.svg">Minor</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="bfaccini">Bruno Faccini</assignee>
                                    <reporter username="bfaccini">Bruno Faccini</reporter>
                        <labels>
                    </labels>
                <created>Wed, 30 Dec 2015 16:20:09 +0000</created>
                <updated>Wed, 13 Jul 2016 18:31:43 +0000</updated>
                            <resolved>Wed, 13 Jul 2016 18:31:43 +0000</resolved>
                                                    <fixVersion>Lustre 2.9.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                            <comments>
                            <comment id="137652" author="bfaccini" created="Wed, 30 Dec 2015 16:51:13 +0000"  >&lt;p&gt;For info, the specific dead-lock situation I have encountered when using lu_ref failure and caused by this bug had the following signature/stack :&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;&amp;lt;1&amp;gt;LustreError: dumping log to /tmp/lustre-log.1451225788.113113
&amp;lt;0&amp;gt;BUG: soft lockup - CPU#1 stuck for 67s! [ldlm_bl_03:113432]
&amp;lt;4&amp;gt;Modules linked in: lustre(U) ofd(U) osp(U) lod(U) ost(U) mdt(U) mdd(U) mgs(U) osd_ldiskfs(U) ldiskfs(U) lquota(U) lfsck(U) jbd2 obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) ksocklnd(U) lnet(U) sha512_generic crc32c_intel libcfs(U) nfsd exportfs nfs lockd fscache auth_rpcgss nfs_acl sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm sg joydev microcode iTCO_wdt iTCO_vendor_support igb i2c_algo_bit sb_edac edac_core i2c_i801 i2c_core lpc_ich mfd_core ioatdma dca shpchp ext3 jbd mbcache sd_mod crc_t10dif isci libsas scsi_transport_sas mlx4_ib ib_sa ib_mad ib_core ib_addr ipv6 mlx4_en ptp pps_core mlx4_core ahci wmi dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]
&amp;lt;4&amp;gt;CPU 1
&amp;lt;4&amp;gt;Modules linked in: lustre(U) ofd(U) osp(U) lod(U) ost(U) mdt(U) mdd(U) mgs(U) osd_ldiskfs(U) ldiskfs(U) lquota(U) lfsck(U) jbd2 obdecho(U) mgc(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) ksocklnd(U) lnet(U) sha512_generic crc32c_intel libcfs(U) nfsd exportfs nfs lockd fscache auth_rpcgss nfs_acl sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm sg joydev microcode iTCO_wdt iTCO_vendor_support igb i2c_algo_bit sb_edac edac_core i2c_i801 i2c_core lpc_ich mfd_core ioatdma dca shpchp ext3 jbd mbcache sd_mod crc_t10dif isci libsas scsi_transport_sas mlx4_ib ib_sa ib_mad ib_core ib_addr ipv6 mlx4_en ptp pps_core mlx4_core ahci wmi dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]
&amp;lt;4&amp;gt;
&amp;lt;4&amp;gt;Pid: 113432, comm: ldlm_bl_03 Not tainted 2.6.32.573.8.1.el6_lustre #1 Intel Corporation S2600GZ/S2600GZ
&amp;lt;4&amp;gt;RIP: 0010:[&amp;lt;ffffffff8153cdfc&amp;gt;]  [&amp;lt;ffffffff8153cdfc&amp;gt;] _spin_lock+0x1c/0x30
&amp;lt;4&amp;gt;RSP: 0018:ffff88030307ba30  EFLAGS: 00000297
&amp;lt;4&amp;gt;RAX: 000000000000059c RBX: ffff88030307ba30 RCX: 0000000000000000
&amp;lt;4&amp;gt;RDX: 000000000000059b RSI: 0000000000000000 RDI: ffffffffa071db68
&amp;lt;4&amp;gt;RBP: ffffffff8100bc0e R08: 00000000fffffffe R09: 0000000000000000
&amp;lt;4&amp;gt;R10: 000000000000000f R11: 000000000000000f R12: ffff8802fac8f970
&amp;lt;4&amp;gt;R13: ffff8803030df060 R14: 0000000000000097 R15: 00000000fffffffc
&amp;lt;4&amp;gt;FS:  0000000000000000(0000) GS:ffff880038620000(0000) knlGS:0000000000000000
&amp;lt;4&amp;gt;CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
&amp;lt;4&amp;gt;CR2: 0000000002527c1c CR3: 0000000001a8d000 CR4: 00000000000407e0
&amp;lt;4&amp;gt;DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
&amp;lt;4&amp;gt;DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
&amp;lt;4&amp;gt;Process ldlm_bl_03 (pid: 113432, threadinfo ffff880303078000, task ffff8803f1eb4040)
&amp;lt;4&amp;gt;Stack:
&amp;lt;4&amp;gt; ffff88030307ba50 ffffffffa05fb78a ffff8803030df060 ffff8803030df060
&amp;lt;4&amp;gt;&amp;lt;d&amp;gt; ffff88030307ba70 ffffffffa05fb9f2 ffff88030307ba70 ffff8803030df000
&amp;lt;4&amp;gt;&amp;lt;d&amp;gt; ffff88030307bab0 ffffffffa05f3955 ffff8802fae12888 ffff8803030df000
&amp;lt;4&amp;gt;Call Trace:
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa05fb78a&amp;gt;] ? lu_ref_print_all+0x1a/0x80 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa05fb9f2&amp;gt;] ? lu_ref_fini+0x82/0x170 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa05f3955&amp;gt;] ? cl_page_free+0xe5/0x540 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa05f3f5c&amp;gt;] ? cl_page_put+0x1ac/0x3e0 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa05fc179&amp;gt;] ? lu_ref_del+0x109/0x2c0 [obdclass]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0aa709c&amp;gt;] ? osc_page_gang_lookup+0x1dc/0x380 [osc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0aa6b80&amp;gt;] ? discard_cb+0x0/0x190 [osc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0aa7384&amp;gt;] ? osc_lock_discard_pages+0x144/0x240 [osc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0a9d8b5&amp;gt;] ? osc_lock_flush+0x55/0x260 [osc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0aa6b80&amp;gt;] ? discard_cb+0x0/0x190 [osc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0a9d8eb&amp;gt;] ? osc_lock_flush+0x8b/0x260 [osc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0a9dd68&amp;gt;] ? osc_ldlm_blocking_ast+0x2a8/0x3c0 [osc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa07c34bc&amp;gt;] ? ldlm_cancel_callback+0x6c/0x1a0 [ptlrpc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa07de40a&amp;gt;] ? ldlm_cli_cancel_local+0x8a/0x470 [ptlrpc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa07e2fac&amp;gt;] ? ldlm_cli_cancel+0x7c/0x380 [ptlrpc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa0a9db9b&amp;gt;] ? osc_ldlm_blocking_ast+0xdb/0x3c0 [osc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa04889f1&amp;gt;] ? libcfs_debug_msg+0x41/0x50 [libcfs]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa07e6ea0&amp;gt;] ? ldlm_handle_bl_callback+0x130/0x400 [ptlrpc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa07e81a1&amp;gt;] ? ldlm_bl_thread_main+0x271/0x3f0 [ptlrpc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff810672b0&amp;gt;] ? default_wake_function+0x0/0x20
&amp;lt;4&amp;gt; [&amp;lt;ffffffffa07e7f30&amp;gt;] ? ldlm_bl_thread_main+0x0/0x3f0 [ptlrpc]
&amp;lt;4&amp;gt; [&amp;lt;ffffffff810a0fce&amp;gt;] ? kthread+0x9e/0xc0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c28a&amp;gt;] ? child_rip+0xa/0x20
&amp;lt;4&amp;gt; [&amp;lt;ffffffff810a0f30&amp;gt;] ? kthread+0x0/0xc0
&amp;lt;4&amp;gt; [&amp;lt;ffffffff8100c280&amp;gt;] ? child_rip+0x0/0x20
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="137653" author="gerrit" created="Wed, 30 Dec 2015 16:52:29 +0000"  >&lt;p&gt;Faccini Bruno (bruno.faccini@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/17756&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/17756&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7619&quot; title=&quot;USE_LU_REF/lu_ref feature broken after some REFASSERT()s have been added without lu_ref::lf_guard protection&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7619&quot;&gt;&lt;del&gt;LU-7619&lt;/del&gt;&lt;/a&gt; obdclass: protect REFASSERT() with lu_ref::lf_guard&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 71ab9f599d2035aac6d906be3402e7b0f8aa9044&lt;/p&gt;</comment>
                            <comment id="158417" author="gerrit" created="Mon, 11 Jul 2016 23:54:12 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/17756/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/17756/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-7619&quot; title=&quot;USE_LU_REF/lu_ref feature broken after some REFASSERT()s have been added without lu_ref::lf_guard protection&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-7619&quot;&gt;&lt;del&gt;LU-7619&lt;/del&gt;&lt;/a&gt; obdclass: protect REFASSERT() with lu_ref::lf_guard&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: 838514369ac245d2dcfdcda7715a2798fe9ee755&lt;/p&gt;</comment>
                            <comment id="158669" author="jgmitter" created="Wed, 13 Jul 2016 18:31:43 +0000"  >&lt;p&gt;Patch landed to master for 2.9.0&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzxx1z:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>