<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 01:55:21 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-5885] LFSCK 3: &#8216;lctl lfsck_start -t namespace&#8217; Not Progressing Under Remove Workload</title>
                <link>https://jira.whamcloud.com/browse/LU-5885</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;While running the LFSCK Phase 3 test plan, I created 10,000 objects; files, remote directories, local directories, links; then ran &lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# lctl lfsck_start -A -M scratch-MDT0000 -r -t namespace -c -C
Started LFSCK on the device scratch-MDT0000: scrub namespace
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;On the client, I then deleted all files and directories in the file system. At some point LFSCK hung and &#8216;lctl lfsck_stop&#8217; will not stop LFSCK and looks like it hangs. LFSCK progresses to a certain point and then hangs; the time counters progress, but none of the other counters increase and we are stuck in &#8220;scanning-phase1&#8221;. &lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;# cat /proc/fs/lustre/mdd/scratch-MDT0000/lfsck_namespace 
name: lfsck_namespace
magic: 0xa0629d03
version: 2
status: scanning-phase1
flags:
param: all_targets,create_ostobj,
time_since_last_completed: 59865 seconds
time_since_latest_start: 8714 seconds
time_since_last_checkpoint: N/A
latest_start_position: 77, N/A, N/A
last_checkpoint_position: N/A, N/A, N/A
first_failure_position: N/A, N/A, N/A
checked_phase1: 3347202
checked_phase2: 0
updated_phase1: 0
updated_phase2: 0
failed_phase1: 0
failed_phase2: 0
directories: 182634
dirent_repaired: 0
linkea_repaired: 0
nlinks_repaired: 0
multiple_linked_checked: 0
multiple_linked_repaired: 0
unknown_inconsistency: 0
unmatched_pairs_repaired: 0
dangling_repaired: 0
multiple_referenced_repaired: 0
bad_file_type_repaired: 0
lost_dirent_repaired: 0
local_lost_found_scanned: 0
local_lost_found_moved: 0
local_lost_found_skipped: 0
local_lost_found_failed: 0
striped_dirs_scanned: 0
striped_dirs_repaired: 0
striped_dirs_failed: 0
striped_dirs_disabled: 0
striped_dirs_skipped: 0
striped_shards_scanned: 1560
striped_shards_repaired: 0
striped_shards_failed: 0
striped_shards_skipped: 0
name_hash_repaired: 0
success_count: 23
run_time_phase1: 8714 seconds
run_time_phase2: 0 seconds
average_speed_phase1: 384 items/sec
average_speed_phase2: N/A
real_time_speed_phase1: 384 items/sec
real_time_speed_phase2: N/A
current_position: 180358673, N/A, N/A
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;On the MDT with index 0,  dmesg contains:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
INFO: task lfsck_namespace:1210 blocked for more than 120 seconds.
      Not tainted 2.6.32-431.29.2.el6_lustre.g8fab48a.x86_64 #1
&quot;echo 0 &amp;gt; /proc/sys/kernel/hung_task_timeout_secs&quot; disables this message.
lfsck_namespa D 0000000000000001     0  1210      2 0x00000080
 ffff880485cfbac0 0000000000000046 0000000000000000 ffff88050b8c13e0
 ffff88050b8c13e0 ffff881023077000 ffff880485cfbac0 ffffffffa06d4e39
 ffff88047443c638 ffff880485cfbfd8 000000000000fbc8 ffff88047443c638
Call Trace:
 [&amp;lt;ffffffffa06d4e39&amp;gt;] ? lu_object_find_try+0x99/0x2b0 [obdclass]
 [&amp;lt;ffffffffa06d5085&amp;gt;] lu_object_find_at+0x35/0x100 [obdclass]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa04f14b3&amp;gt;] ? ldiskfs_mark_inode_dirty+0x83/0x1f0 [ldiskfs]
 [&amp;lt;ffffffffa06d518f&amp;gt;] lu_object_find_slice+0x1f/0x80 [obdclass]
 [&amp;lt;ffffffffa0f8f958&amp;gt;] lfsck_namespace_handle_striped_master+0x118/0xb10 [lfsck]
 [&amp;lt;ffffffffa0b5de4c&amp;gt;] ? fld_local_lookup+0x6c/0x290 [fld]
 [&amp;lt;ffffffffa0f5d23f&amp;gt;] lfsck_namespace_assistant_handler_p1+0x5bf/0x1f40 [lfsck]
 [&amp;lt;ffffffffa06d3743&amp;gt;] ? lu_object_free+0x113/0x1a0 [obdclass]
 [&amp;lt;ffffffffa057b482&amp;gt;] ? cfs_hash_bd_from_key+0x42/0xd0 [libcfs]
 [&amp;lt;ffffffff81283a85&amp;gt;] ? _atomic_dec_and_lock+0x55/0x80
 [&amp;lt;ffffffffa0f4d197&amp;gt;] lfsck_assistant_engine+0x497/0x1c50 [lfsck]
 [&amp;lt;ffffffff81061d00&amp;gt;] ? default_wake_function+0x0/0x20
 [&amp;lt;ffffffffa0f4cd00&amp;gt;] ? lfsck_assistant_engine+0x0/0x1c50 [lfsck]
 [&amp;lt;ffffffff8109abf6&amp;gt;] kthread+0x96/0xa0
 [&amp;lt;ffffffff8100c20a&amp;gt;] child_rip+0xa/0x20
 [&amp;lt;ffffffff8109ab60&amp;gt;] ? kthread+0x0/0xa0
 [&amp;lt;ffffffff8100c200&amp;gt;] ? child_rip+0x0/0x20
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Similar stack traces can be found on the second MDS/MDT and is also stuck in &#8220;scanning -phase1&#8221;.&lt;/p&gt;</description>
                <environment>OpenSFS cluster with two MDSs with one MDT each, three OSSs and three clients. Lustre tag 2.6.54 build 2725</environment>
        <key id="27510">LU-5885</key>
            <summary>LFSCK 3: &#8216;lctl lfsck_start -t namespace&#8217; Not Progressing Under Remove Workload</summary>
                <type id="1" iconUrl="https://jira.whamcloud.com/secure/viewavatar?size=xsmall&amp;avatarId=11303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.whamcloud.com/images/icons/priorities/blocker.svg">Blocker</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="1">Fixed</resolution>
                                        <assignee username="yong.fan">nasf</assignee>
                                    <reporter username="jamesanunez">James Nunez</reporter>
                        <labels>
                            <label>MB</label>
                            <label>lfsck</label>
                    </labels>
                <created>Fri, 7 Nov 2014 19:49:42 +0000</created>
                <updated>Wed, 23 Dec 2015 19:00:00 +0000</updated>
                            <resolved>Wed, 10 Dec 2014 23:43:25 +0000</resolved>
                                    <version>Lustre 2.7.0</version>
                                    <fixVersion>Lustre 2.7.0</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                            <comments>
                            <comment id="98782" author="jlevi" created="Mon, 10 Nov 2014 18:13:35 +0000"  >&lt;p&gt;Fan Yong,&lt;br/&gt;
Could you take a look at this one?&lt;br/&gt;
Thank you!&lt;/p&gt;</comment>
                            <comment id="99023" author="jamesanunez" created="Thu, 13 Nov 2014 03:15:54 +0000"  >&lt;p&gt;I ran this test again for lustre-master tag 2.6.90 build #2734 and was able to reproduce this issue very quickly. I used a workload similar to what was described above; ran test 3.3.3 creating about 130 directories with 10,000 objects each, then ran the same workload in a different directory, started LFSCK on both MDSs and then went back and removed the directories/objects created by test 3.3.3.&lt;/p&gt;

&lt;p&gt;I captured kernel logs on both the MDSs. They are at uploads/&lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5885&quot; title=&quot;LFSCK 3: &#8216;lctl lfsck_start -t namespace&#8217; Not Progressing Under Remove Workload&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5885&quot;&gt;&lt;del&gt;LU-5885&lt;/del&gt;&lt;/a&gt;/lfsck_log_1.txt (MDS0) and lfsck_log_2.txt (MDS1)&lt;/p&gt;

&lt;p&gt;When looking at lfsck_namespace, there might be something wrong with the real-time timers calculating the rate of scanning objects, the real_time_speed_phase1 never decreases, but the average_speed_phase1 does decrease. In this case where LFSCK seems to hang, meaning it is not scanning objects anymore, I&#8217;d expect the real_time_speed to decrease, but it just keeps growing:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;real_time_speed_phase1: 21441823787665 items/sec
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="99298" author="gerrit" created="Sun, 16 Nov 2014 06:46:25 +0000"  >&lt;p&gt;Fan Yong (fan.yong@intel.com) uploaded a new patch: &lt;a href=&quot;http://review.whamcloud.com/12741&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/12741&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5885&quot; title=&quot;LFSCK 3: &#8216;lctl lfsck_start -t namespace&#8217; Not Progressing Under Remove Workload&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5885&quot;&gt;&lt;del&gt;LU-5885&lt;/del&gt;&lt;/a&gt; lfsck: deadlock when remove striped dir&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: 1&lt;br/&gt;
Commit: 4ab1b1b15835879a145002221bb4cc492e57c791&lt;/p&gt;</comment>
                            <comment id="99299" author="yong.fan" created="Sun, 16 Nov 2014 06:48:34 +0000"  >&lt;p&gt;James, would you please to verify the patch &lt;a href=&quot;http://review.whamcloud.com/#/c/12741/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#/c/12741/&lt;/a&gt; ? Thanks!&lt;/p&gt;</comment>
                            <comment id="99545" author="jamesanunez" created="Wed, 19 Nov 2014 04:33:07 +0000"  >&lt;p&gt;With your patch,  &lt;a href=&quot;http://review.whamcloud.com/#/c/12741/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/#/c/12741/&lt;/a&gt; , I can run the remove workload and create files/directories/etc. and LFSCK does not hang. I&apos;ve tried this four times and cannot get LFSCK to hang. So, this patch fixed the LFSCK hang problem.&lt;/p&gt;</comment>
                            <comment id="99546" author="yong.fan" created="Wed, 19 Nov 2014 04:55:09 +0000"  >&lt;p&gt;Thanks James for the verification!&lt;/p&gt;</comment>
                            <comment id="101262" author="gerrit" created="Wed, 10 Dec 2014 23:36:27 +0000"  >&lt;p&gt;Oleg Drokin (oleg.drokin@intel.com) merged in patch &lt;a href=&quot;http://review.whamcloud.com/12741/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://review.whamcloud.com/12741/&lt;/a&gt;&lt;br/&gt;
Subject: &lt;a href=&quot;https://jira.whamcloud.com/browse/LU-5885&quot; title=&quot;LFSCK 3: &#8216;lctl lfsck_start -t namespace&#8217; Not Progressing Under Remove Workload&quot; class=&quot;issue-link&quot; data-issue-key=&quot;LU-5885&quot;&gt;&lt;del&gt;LU-5885&lt;/del&gt;&lt;/a&gt; lfsck: deadlock when remove striped dir&lt;br/&gt;
Project: fs/lustre-release&lt;br/&gt;
Branch: master&lt;br/&gt;
Current Patch Set: &lt;br/&gt;
Commit: f0137d89fd40ae66aa1d3a180e4e5a6240009dcc&lt;/p&gt;</comment>
                            <comment id="101264" author="yong.fan" created="Wed, 10 Dec 2014 23:43:25 +0000"  >&lt;p&gt;The patch has been landed to master.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10011">
                    <name>Related</name>
                                                                <inwardlinks description="is related to">
                                        <issuelink>
            <issuekey id="27118">LU-5774</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                                            <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|hzx0hz:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>16456</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10060" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Severity</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10022"><![CDATA[3]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>