[LU-5459] (cl_page.c:694:cl_page_assume()) ASSERTION( 0 ) failed Created: 06/Aug/14  Updated: 30/Jan/15  Resolved: 26/Aug/14

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.6.0, Lustre 2.7.0
Fix Version/s: Lustre 2.7.0

Type: Bug Priority: Critical
Reporter: John Hammond Assignee: Jinshan Xiong (Inactive)
Resolution: Fixed Votes: 0
Labels: hsm

Issue Links:
Related
Severity: 3
Rank (Obsolete): 15204

 Description   

To reproduce perform repeated HSM release and explicit restore of file:

t:lustre-release# cd /mnt/lustre
t:lustre# echo 'Uludag Gazoz!' > f0
t:lustre# lfs hsm_archive f0
t:lustre# while true; do
> lfs hsm_release f0
> cat f0^C
t:lustre# while true; do
> lfs hsm_release f0
> cat f0
> done
Uludag Gazoz!
Uludag Gazoz!
Uludag Gazoz!
Uludag Gazoz!
Uludag Gazoz!
Uludag Gazoz!
...
Uludag Gazoz!

Message from syslogd@t at Aug  6 15:21:37 ...
 kernel:[84351.829259] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) ASSERTION( 0 ) failed: 

Message from syslogd@t at Aug  6 15:21:37 ...
 kernel:[84351.830735] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) LBUG
Write failed: Broken pipe
[84351.808432] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) page@ffff8801dccfd878[2 ffff8801df332f68 0 0 1 d84156c5635688c0 ffff8801cdc6a820]
[84351.808436] 
[84351.812902] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) vvp-page@ffff8801dccfd8e0(0:0:0) vm@ffffea000553c768 40000000000801 4:0 ffff8801dccfd878 0 lru
[84351.812906] 
[84351.817618] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) lov-page@ffff8801dccfd930, raid0
[84351.817620] 
[84351.820793] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) osc-page@ffff8801dcd0d788 0: 1< 0x845fed 0 0 - - > 2< 0 0 0 0x0 0x100 | (null) ffff8802157630f0 ffff8801d20d5818 > 3< - (null) 0 0 0 > 4< 0 0 8 10481664 - | - - - - > 5< - - - - | 0 - | 0 - ->
[84351.820798] 
[84351.826036] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) end page@ffff8801dccfd878
[84351.826037] 
[84351.827838] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) pg->cp_owner == NULL
[84351.829259] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) ASSERTION( 0 ) failed: 
[84351.830735] LustreError: 3111:0:(cl_page.c:694:cl_page_assume()) LBUG
[84351.831958] Pid: 3111, comm: cat
[84351.832583] 
[84351.832584] Call Trace:
[84351.833370]  [<ffffffffa0eae8c5>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]
[84351.834657]  [<ffffffffa0eaeec7>] lbug_with_loc+0x47/0xb0 [libcfs]
[84351.835855]  [<ffffffffa10077e2>] cl_page_assume+0x202/0x210 [obdclass]
[84351.837103]  [<ffffffffa09a5d4d>] ll_readpage+0xbd/0x280 [lustre]
[84351.838241]  [<ffffffff81133335>] generic_file_aio_read+0x215/0x740
[84351.839416]  [<ffffffffa09d864e>] vvp_io_read_start+0x22e/0x450 [lustre]
[84351.840677]  [<ffffffffa10101ca>] cl_io_start+0x6a/0x140 [obdclass]
[84351.841857]  [<ffffffffa10141f4>] cl_io_loop+0xb4/0x1b0 [obdclass]
[84351.843023]  [<ffffffffa0976872>] ll_file_io_generic+0x412/0x8f0 [lustre]
[84351.844298]  [<ffffffffa0976e80>] ll_file_aio_read+0x130/0x2b0 [lustre]
[84351.845516]  [<ffffffffa0977329>] ll_file_read+0x159/0x290 [lustre]
[84351.846674]  [<ffffffff811a3185>] vfs_read+0xb5/0x1a0
[84351.847626]  [<ffffffff81554298>] ? lockdep_sys_exit_thunk+0x35/0x67
[84351.848791]  [<ffffffff811a32c1>] sys_read+0x51/0x90
[84351.849724]  [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
[84351.850837] 
[84351.851538] Kernel panic - not syncing: LBUG


 Comments   
Comment by Frank Zago (Inactive) [ 09/Aug/14 ]

Hi John,

I've seen the same thing, but it's much harder to reproduce. With your test, the copytool hangs within seconds
This bug is also present in Lustre 2.5

Could you test again with this patch: http://review.whamcloud.com/#/c/11386/

Comment by Jinshan Xiong (Inactive) [ 11/Aug/14 ]

Thanks Zago, there is a similar patch at http://review.whamcloud.com/11394

Comment by Frank Zago (Inactive) [ 11/Aug/14 ]

Thanks. That looks better than my patch.

Comment by Peter Jones [ 26/Aug/14 ]

Landed for 2.7

Comment by James Nunez (Inactive) [ 30/Jan/15 ]

Patch for b2_5 at http://review.whamcloud.com/#/c/12393/

Generated at Sat Feb 10 01:51:40 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.