[LU-481] sanity test_119d fails (ASSERTION((struct cl_page *)vmpage->private != slice->cpl_page) failed) Created: 04/Jul/11 Updated: 13/Jul/11 Resolved: 13/Jul/11 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Lustre 2.1.0 |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Maloo | Assignee: | Niu Yawei (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Severity: | 3 |
| Rank (Obsolete): | 4261 |
| Description |
|
This issue was created by maloo for bobijam <bobijam@whamcloud.com> This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/fe768762-a677-11e0-bd2a-52540025f9af. 11:29:15:Lustre: DEBUG MARKER: == sanity test 119d: The DIO path should try to send a new rpc once one is completed ================= 11:29:13 (1309804153) |
| Comments |
| Comment by Peter Jones [ 05/Jul/11 ] |
|
Niu will look into this one |
| Comment by Niu Yawei (Inactive) [ 05/Jul/11 ] |
|
In cl_page_alloc(), when o->co_ops->coo_page_init() failed, we call cl_page_free() to free the cl_page without calling cl_page_delete0() prior, which could cause this ASSERT in cl_page_free(), since the linkage between cl_page and vmpage hasn't been broken by cl_page_delete0() yet. I think adding cl_page_delete0() before cl_page_free() in cl_page_alloc() should simply fix this ASSERT error, however, when I look into the code to see why coo_page_init() failed, following piece of code in cl_page_find0() confused me: if (page->cp_type == CPT_TRANSIENT && type == CPT_CACHEABLE) { /* XXX: We should make sure that inode sem * keeps being held in the lifetime of * transient pages, so it is impossible to * have conflicting transient pages. */ cfs_spin_unlock(&hdr->coh_page_guard); cl_page_put(env, page); cfs_spin_lock(&hdr->coh_page_guard); page = ERR_PTR(-EBUSY); } I don't see why we should return error here, in my opinion, it should be a legal race for concurrent dio and bufferred read. Xiong, any comment? Thank you. |
| Comment by Alex Zhuravlev [ 05/Jul/11 ] |
|
does master branch hit this as well? |
| Comment by Niu Yawei (Inactive) [ 05/Jul/11 ] |
|
Hi, Alex I'm not sure, from the code, looks master should have this problem. |
| Comment by Niu Yawei (Inactive) [ 05/Jul/11 ] |
|
Yes, master hit this as well, https://maloo.whamcloud.com/test_sets/9918c09c-a384-11e0-a0cf-52540025f9af |
| Comment by Jinshan Xiong (Inactive) [ 07/Jul/11 ] |
|
Hi Alex, Maloo didn't catch any log for this case. |
| Comment by Niu Yawei (Inactive) [ 07/Jul/11 ] |
|
Hi, Xiong The reason of this ASSERT is explained in my previous comment:
The first is easy to fix, the second I think we need the fix in |
| Comment by Niu Yawei (Inactive) [ 08/Jul/11 ] |
|
The patch is at: http://review.whamcloud.com/1072 (the fix of |
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Build Master (Inactive) [ 13/Jul/11 ] |
|
Integrated in Oleg Drokin : 9e213f7975423b69eae06b1e561516e6b26a2c72
|
| Comment by Peter Jones [ 13/Jul/11 ] |
|
Landed for 2.1 |