[LU-15255] sanity-hsm: test_10d timeout Created: 19/Nov/21  Updated: 19/Nov/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Sergey Cheremencev <c17829@cray.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/34e4cf81-8e1c-4bf9-adfa-63ca4bf7a12d

client1:

[ 9732.690495] Lustre: 96254:0:(client.c:2290:ptlrpc_expire_one_request()) Skipped 8 previous similar messages
[ 9737.804633] Lustre: 96254:0:(client.c:2290:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1637018296/real 1637018296]  req@0000000030b51d09 x1716532524568640/t0(0) o400->lustre-OST0007-osc-ffff8c09bddee000@10.240.29.236@tcp:28/4 lens 224/224 e 0 to 1 dl 1637018303 ref 1 fl Rpc:XNQr/0/ffffffff rc 0/-1 job:'kworker/u4:1.0'

There is a jbd2 in uninterruptible state at MDS1, however I am not sure how long does it sleep:

https://testing.whamcloud.com/test_logs/53cb436d-b36c-43f3-bace-87dd85e6deb0/show_text
...
[13269.694195] jbd2/vda1-8     D    0   446      2 0x80004000
[13269.695207] Call Trace:
[13269.695684]  __schedule+0x2c4/0x700
[13269.696339]  ? bit_wait_timeout+0x90/0x90
[13269.697120]  schedule+0x38/0xa0
[13269.697724]  io_schedule+0x12/0x40
[13269.698362]  bit_wait_io+0xd/0x50
[13269.698990]  __wait_on_bit+0x6c/0x80
[13269.699659]  out_of_line_wait_on_bit+0x91/0xb0
[13269.700479]  ? init_wait_var_entry+0x50/0x50
[13269.701310]  jbd2_journal_commit_transaction+0x1580/0x19f0 [jbd2]
[13269.702449]  ? finish_task_switch+0x77/0x2a0
[13269.703258]  kjournald2+0xbd/0x270 [jbd2]
[13269.704028]  ? finish_wait+0x80/0x80
[13269.704718]  ? commit_timeout+0x10/0x10 [jbd2]
[13269.705535]  kthread+0x112/0x130
[13269.706145]  ? kthread_flush_work_fn+0x10/0x10
[13269.706981]  ret_from_fork+0x35/0x40

Generated at Sat Feb 10 03:16:46 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.