[LU-6005] lnet-selftest test_smoke: st_timer in D state Created: 08/Dec/14  Updated: 28/Nov/16

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.7.0
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None
Environment:

server and client: lustre-master build # 2770 zfs


Issue Links:
Related
is related to LU-1891 Lnet selftest st_timer process in D s... Open
Severity: 3
Rank (Obsolete): 16741

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/5d854ee4-7e53-11e4-afba-5254006e85c2.

The sub-test test_smoke failed with the following error:

test failed to respond and timed out

st_timer in D state on all servers and clients nodes.

11:17:57:lst_t_01_00   S 0000000000000001     0 26345      2 0x00000080
11:17:57: ffff88007ccffe40 0000000000000046 0000000000000000 ffff88007ccffe04
11:17:57: ffff880000000000 ffff880063182000 0000000000000000 ffff880063182048
11:17:57: ffff88007b81f058 ffff88007ccfffd8 000000000000fbc8 ffff88007b81f058
11:17:57:Call Trace:
11:17:57: [<ffffffff8109b1ee>] ? prepare_to_wait_exclusive+0x4e/0x80
11:17:57: [<ffffffffa0b8e277>] cfs_wi_scheduler+0x3d7/0x460 [libcfs]
11:17:57: [<ffffffff8109afa0>] ? autoremove_wake_function+0x0/0x40
11:17:57: [<ffffffffa0b8dea0>] ? cfs_wi_scheduler+0x0/0x460 [libcfs]
11:17:57: [<ffffffff8109abf6>] kthread+0x96/0xa0
11:17:57: [<ffffffff8100c20a>] child_rip+0xa/0x20
11:17:57: [<ffffffff8109ab60>] ? kthread+0x0/0xa0
11:17:57: [<ffffffff8100c200>] ? child_rip+0x0/0x20
11:17:57:st_timer      D 0000000000000001     0 26346      2 0x00000080
11:17:57: ffff88007a4a9dd0 0000000000000046 0000000000000000 ffff88007a4a9d94
11:17:57: 0000000000000000 0000000000000286 ffff88007a4a9d70 ffffffff81083e1c
11:17:57: ffff88007bdf1ab8 ffff88007a4a9fd8 000000000000fbc8 ffff88007bdf1ab8
11:17:57:Call Trace:
11:17:57: [<ffffffff81083e1c>] ? lock_timer_base+0x3c/0x70
11:17:57: [<ffffffff81529c72>] schedule_timeout+0x192/0x2e0
11:17:57: [<ffffffff81083f30>] ? process_timeout+0x0/0x10
11:17:57: [<ffffffffa121e9ce>] stt_timer_main+0xde/0x110 [lnet_selftest]
11:17:57: [<ffffffff8109afa0>] ? autoremove_wake_function+0x0/0x40
11:17:57: [<ffffffffa121e8f0>] ? stt_timer_main+0x0/0x110 [lnet_selftest]
11:17:57: [<ffffffff8109abf6>] kthread+0x96/0xa0
11:17:57: [<ffffffff8100c20a>] child_rip+0xa/0x20
11:17:57: [<ffffffff8109ab60>] ? kthread+0x0/0xa0
11:17:57: [<ffffffff8100c200>] ? child_rip+0x0/0x20

Info required for matching: lnet-selftest smoke



 Comments   
Comment by Saurabh Tandan (Inactive) [ 19/Jan/16 ]

Another instance found for interop : EL6.7 Server/2.7.1 Client
Server: master, build# 3303, RHEL 6.7
Client: 2.7.1, b2_7_fe/34
https://testing.hpdd.intel.com/test_sets/43947c0e-bad8-11e5-87b4-5254006e85c2

Generated at Sat Feb 10 01:56:23 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.