[LU-361] Lustre Client crashes due to ASSERTION(!request->rq_replay) failed Created: 26/May/11  Updated: 15/May/12  Resolved: 26/May/11

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.0.0
Fix Version/s: Lustre 2.1.0

Type: Bug Priority: Major
Reporter: Sebastien Buisson (Inactive) Assignee: Johann Lombardi (Inactive)
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Duplicate
duplicates LU-190 random mode opencreate will LBUG lust... Resolved
Severity: 3
Rank (Obsolete): 8536

 Description   

At CEA they face Lustre-Client crashes due to ASSERTION(!request->rq_replay) failed.
As a consequence the application is lost and the node needs to be rebooted.

The crashing task stack always looks like :
===========================================
crash_kexec()
panic()
lbug_with_loc()
__ptlrpc_req_finished()
ptlrpc_req_finished()
ll_intent_release()
ll_create_nd()
vfs_create()
do_filp_open()
do_sys_open()
sys_open()
system_call_fastpath()
===========================================

It is possible to reproduce this issue very simply, by running on a Lustre client a short program that do an 'open' with the O_CREAT flag, but without specifying the third, mode parameter.
I know that the third parameter is mandatory when using the O_CREAT flag. If it is not given, then arbitrary permissions will be set, depending on the content of the stack.
But in any case it should lead to a client crash. When running the same test on ext4 for instance, the permissions set are non-sense but no crash occurs.

Sebastien.



 Comments   
Comment by Johann Lombardi (Inactive) [ 26/May/11 ]

I think this is fixed on master already, see bug LU-190.

Comment by Sebastien Buisson (Inactive) [ 26/May/11 ]

Oh great, sorry for missing that.
I will retrieve the patch and test it immediately, thank you!

Comment by Peter Jones [ 26/May/11 ]

Duplicate of LU190

Generated at Sat Feb 10 01:06:16 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.