[LU-10065] ost-pools test_5a: test failed to respond and timed out Created: 03/Oct/17  Updated: 29/Aug/18  Resolved: 29/Aug/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.11.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Casper Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

trevis, full DNE
servers: el7.4, zfs, branch master, v2.10.53.1, b3642
clients: el7.4, branch master, v2.10.53.1, b3642


Issue Links:
Duplicate
duplicates LU-10250 replay-single test_74: hang and time... Open
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

https://testing.hpdd.intel.com/test_sets/e9486028-9cf1-11e7-b778-5254006e85c2

Trace is very similar to that in LU-6649 (6649 in OST dmesg - this is in MDS dmesg).

From MDS dmesg:

[16630.526038] mdt_rdpg00_003  D 0000000000000000     0 28590      2 0x00000080
[16630.527676]  ffff8800686b39c0 0000000000000046 ffff88005b6cdee0 ffff8800686b3fd8
[16630.529444]  ffff8800686b3fd8 ffff8800686b3fd8 ffff88005b6cdee0 ffff88005cf3b2f8
[16630.531188]  ffff88005cf3b240 ffff88005cf3b268 ffff88005cf3b300 0000000000000000
[16630.532894] Call Trace:
[16630.534180]  [<ffffffff816a94a9>] schedule+0x29/0x70
[16630.535616]  [<ffffffffc071c4d5>] cv_wait_common+0x125/0x150 [spl]
[16630.537227]  [<ffffffff810b1910>] ? wake_up_atomic_t+0x30/0x30
[16630.538789]  [<ffffffffc071c515>] __cv_wait+0x15/0x20 [spl]
[16630.540339]  [<ffffffffc086b17f>] txg_wait_synced+0xef/0x140 [zfs]
[16630.541943]  [<ffffffffc0820a75>] dmu_tx_wait+0x275/0x3c0 [zfs]
[16630.543448]  [<ffffffffc0820c51>] dmu_tx_assign+0x91/0x490 [zfs]
[16630.545007]  [<ffffffffc10a6efa>] osd_trans_start+0xaa/0x3c0 [osd_zfs]
[16630.546617]  [<ffffffffc1071128>] qmt_trans_start_with_slv+0x248/0x530 [lquota]
[16630.548238]  [<ffffffffc106a196>] qmt_dqacq0+0x1a6/0xf00 [lquota]
[16630.549890]  [<ffffffffc1052a36>] ? lqe_locate+0x36/0x830 [lquota]
[16630.551450]  [<ffffffffc107501e>] ? qmt_pool_lqe_lookup+0x6e/0x1eb [lquota]
[16630.553043]  [<ffffffffc106b4a9>] qmt_dqacq+0x5b9/0x8c0 [lquota]
[16630.554625]  [<ffffffffc121324f>] mdt_quota_dqacq+0x5f/0x150 [mdt]
[16630.556219]  [<ffffffffc0ee3225>] tgt_request_handle+0x925/0x1370 [ptlrpc]
[16630.557873]  [<ffffffffc0e8c0c6>] ptlrpc_server_handle_request+0x236/0xa90 [ptlrpc]
[16630.559534]  [<ffffffff810ba588>] ? __wake_up_common+0x58/0x90
[16630.561063]  [<ffffffffc0e8f862>] ptlrpc_main+0xa92/0x1e40 [ptlrpc]
[16630.562705]  [<ffffffffc0e8edd0>] ? ptlrpc_register_service+0xe80/0xe80 [ptlrpc]
[16630.564375]  [<ffffffff810b098f>] kthread+0xcf/0xe0
[16630.565803]  [<ffffffff810b08c0>] ? insert_kthread_work+0x40/0x40
[16630.567370]  [<ffffffff816b4f18>] ret_from_fork+0x58/0x90
[16630.568877]  [<ffffffff810b08c0>] ? insert_kthread_work+0x40/0x40


 Comments   
Comment by Andreas Dilger [ 29/Aug/18 ]

Close as a duplicate of LU-10250.

Generated at Sat Feb 10 02:31:44 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.