[LU-5165] Test failure on test suite recovery-small, subtest test_29a Created: 09/Jun/14  Updated: 08/Nov/16

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Related
is related to LU-8806 LFSCK hangs on MDT - osp_precreate_cl... Resolved
Severity: 3
Rank (Obsolete): 14238

 Description   

This issue was created by maloo for Nathaniel Clark <nathaniel.l.clark@intel.com>

This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/7e006fa2-ee1e-11e3-8360-52540035b04c.

The sub-test test_29a failed with the following error:

test failed to respond and timed out

Info required for matching: recovery-small 29a

OST Console Log:

18:49:28:INFO: task txg_quiesce:3824 blocked for more than 120 seconds.
18:49:28:      Tainted: P           ---------------    2.6.32-431.17.1.el6_lustre.g357ff2e.x86_64 #1
18:49:28:"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
18:49:28:txg_quiesce   D 0000000000000000     0  3824      2 0x00000080
18:49:28: ffff8800794a9d80 0000000000000046 0000000000000000 0000000000000001
18:49:28: ffff8800794a9d30 ffffffff8103f9d8 ffff8800ffffffff 00000929463a4359
18:49:28: ffff88007af09af8 ffff8800794a9fd8 000000000000fbc8 ffff88007af09af8
18:49:28:Call Trace:
18:49:28: [<ffffffff8103f9d8>] ? pvclock_clocksource_read+0x58/0xd0
18:49:28: [<ffffffff8109b14e>] ? prepare_to_wait_exclusive+0x4e/0x80
18:49:28: [<ffffffffa0142edd>] cv_wait_common+0xed/0x100 [spl]
18:49:28: [<ffffffff8109af00>] ? autoremove_wake_function+0x0/0x40
18:49:28: [<ffffffff81290eac>] ? __bitmap_weight+0x8c/0xb0
18:49:28: [<ffffffffa0142f45>] __cv_wait+0x15/0x20 [spl]
18:49:28: [<ffffffffa024297b>] txg_quiesce_thread+0x20b/0x2d0 [zfs]
18:49:28: [<ffffffff810591a9>] ? set_user_nice+0xc9/0x130
18:49:28: [<ffffffffa0242770>] ? txg_quiesce_thread+0x0/0x2d0 [zfs]
18:49:28: [<ffffffffa013e9bf>] thread_generic_wrapper+0x5f/0x70 [spl]
18:49:28: [<ffffffffa013e960>] ? thread_generic_wrapper+0x0/0x70 [spl]
18:49:28: [<ffffffff8109ab56>] kthread+0x96/0xa0
18:49:28: [<ffffffff8100c20a>] child_rip+0xa/0x20
18:49:28: [<ffffffff8109aac0>] ? kthread+0x0/0xa0
18:49:28: [<ffffffff8100c200>] ? child_rip+0x0/0x20
18:49:28:INFO: task ll_ost00_000:3959 blocked for more than 120 seconds.
18:49:28:      Tainted: P           ---------------    2.6.32-431.17.1.el6_lustre.g357ff2e.x86_64 #1
18:49:28:"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
18:49:28:ll_ost00_000  D 0000000000000000     0  3959      2 0x00000080
18:49:28: ffff880073a6b990 0000000000000046 0000000000000028 0000000000000001
18:49:28: ffff880073a6b990 0000000000000086 0000000000000000 ffff880071b31d80
18:49:28: ffff880079111af8 ffff880073a6bfd8 000000000000fbc8 ffff880079111af8
18:49:28:Call Trace:
18:49:28: [<ffffffff8109b14e>] ? prepare_to_wait_exclusive+0x4e/0x80
18:49:28: [<ffffffffa0142edd>] cv_wait_common+0xed/0x100 [spl]
18:49:28: [<ffffffff8109af00>] ? autoremove_wake_function+0x0/0x40
18:49:28: [<ffffffffa0142f45>] __cv_wait+0x15/0x20 [spl]
18:49:28: [<ffffffffa0241e9b>] txg_wait_open+0x7b/0xa0 [zfs]
18:49:28: [<ffffffffa0206a5d>] dmu_tx_wait+0xed/0xf0 [zfs]
18:49:28: [<ffffffffa0206aee>] dmu_tx_assign+0x8e/0x4e0 [zfs]
18:49:28: [<ffffffffa0e7a56c>] osd_trans_start+0x9c/0x410 [osd_zfs]
18:49:28: [<ffffffffa0f705ac>] ofd_trans_start+0x7c/0x100 [ofd]
18:49:28: [<ffffffffa0f712d3>] ofd_object_destroy+0x203/0x680 [ofd]
18:49:28: [<ffffffffa0f6ceed>] ofd_destroy_by_fid+0x35d/0x620 [ofd]
18:49:28: [<ffffffffa0957bc0>] ? ldlm_blocking_ast+0x0/0x180 [ptlrpc]
18:49:28: [<ffffffffa0959230>] ? ldlm_completion_ast+0x0/0x930 [ptlrpc]
18:49:28: [<ffffffffa0f667d2>] ofd_destroy_hdl+0x2e2/0xb80 [ofd]
18:49:28: [<ffffffffa09e52ac>] tgt_request_handle+0x23c/0xac0 [ptlrpc]
18:49:28: [<ffffffffa0994d1a>] ptlrpc_main+0xd1a/0x1980 [ptlrpc]
18:49:28: [<ffffffffa0994000>] ? ptlrpc_main+0x0/0x1980 [ptlrpc]
18:49:28: [<ffffffff8109ab56>] kthread+0x96/0xa0
18:49:28: [<ffffffff8100c20a>] child_rip+0xa/0x20
18:49:28: [<ffffffff8109aac0>] ? kthread+0x0/0xa0
18:49:28: [<ffffffff8100c200>] ? child_rip+0x0/0x20

Generated at Sat Feb 10 01:49:05 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.