[LU-8914] write call hung in wait_on_page_bit Created: 06/Dec/16  Updated: 06/Jul/21  Resolved: 06/Jul/21

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Olaf Faaland Assignee: Zhenyu Xu
Resolution: Cannot Reproduce Votes: 0
Labels: llnl
Environment:

lustre-2.5.5-10chaos.4
rhel 7.3


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

write system call hangs with identical stack to LU-4540.

Comments in that ticket ascribe the error to LU-3321 commits, however our patch stack does not have most of the LU-3321 commits. The only one I found is:

LU-4786 osc: to not pick busy pages for ELC

so it's unclear to me whether the problem we are seeing is due to the same root cause.



 Comments   
Comment by Olaf Faaland [ 06/Dec/16 ]

See repository is named "lustre-release-fe-llnl" hosted on your gerritt server, for our exact code. This lustre was built from the head of branch 2.5.5-10chaos.

Comment by Olaf Faaland [ 06/Dec/16 ]

gerrit link to the LU-3321 related change in our stack is
https://review.whamcloud.com/#/c/10795/

Comment by Peter Jones [ 07/Dec/16 ]

Bobijam

Could you please assist with this one?

Thanks

Peter

Comment by Zhenyu Xu [ 08/Dec/16 ]

Hi Jinshan,

I think the scenario is similar, ll_write_begin() grabbed a page lock and waiting for another page's WriteBack bit to be cleared, am I right?

Generated at Sat Feb 10 02:21:38 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.