[LU-16339] bug in qmt may cause clients to hang in cl_sync_io_wait Created: 23/Nov/22  Updated: 29/Jan/24  Resolved: 24/Jul/23

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Sergey Cheremencev Assignee: Sergey Cheremencev
Resolution: Fixed Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Bug in pool quotas that may cause write on a client to hang with following bt:

[<ffffffffc0731da5>] cl_sync_io_wait+0x1c5/0x480 [obdclass]
[<ffffffffc0732228>] cl_io_submit_sync+0x188/0x280 [obdclass]
[<ffffffffc134735e>] vvp_io_commit_sync+0x10e/0x360 [lustre]
[<ffffffffc1349a0a>] vvp_io_write_commit+0x3aa/0x5a0 [lustre]
[<ffffffffc134a28b>] vvp_io_write_start+0x68b/0xd60 [lustre]
[<ffffffffc072fe60>] cl_io_start+0x70/0x150 [obdclass]
[<ffffffffc073251f>] cl_io_loop+0x9f/0x220 [obdclass]
[<ffffffffc12edd96>] ll_file_io_generic+0x346/0xec0 [lustre]
[<ffffffffc12eef66>] ll_file_aio_write+0x656/0xa50 [lustre]
[<ffffffffc12ef471>] ll_file_write+0x111/0x1e0 [lustre]
[<ffffffffb7041320>] vfs_write+0xc0/0x1f0
[<ffffffffb704213f>] SyS_write+0x7f/0xf0
[<ffffffffb7576ddb>] system_call_fastpath+0x22/0x27
[<ffffffffffffffff>] 0xffffffffffffffff
[root@dhcppc4 tests]# 


 Comments   
Comment by Gerrit Updater [ 23/Nov/22 ]

"Sergey Cheremencev <sergey.cheremencev@hpe.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49227
Subject: LU-16339 tests: pool quota hung on write
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 84e33ae0015e709c4bc0cc9749f454b94d4a0478

Comment by Gerrit Updater [ 23/Nov/22 ]

"Sergey Cheremencev <sergey.cheremencev@hpe.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49228
Subject: LU-16339 quota: notify OSTs until lge_qunit_nu is set
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 77652e858f60ce7fc29f5f870e126d4162722eaa

Comment by Gerrit Updater [ 11/Apr/23 ]

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/49228/
Subject: LU-16339 quota: notify OSTs until lge_qunit_nu is set
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 6c0b4329d046de283eeb254fca561be9386df68a

Comment by Cory Spitz [ 21/Jul/23 ]

scherementsev, I think you can resolve this issue now, agreed?

Generated at Sat Feb 10 03:26:09 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.