[LU-6172] Client LBUG in null_free_reqbuf Created: 28/Jan/15 Updated: 12/Feb/15 Resolved: 12/Feb/15 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.4.3 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Jay Lan (Inactive) | Assignee: | Yang Sheng |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: |
(Git repo at https://github.com/jlan/lustre-nas) |
||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 17268 | ||||||||
| Description |
|
We have several clients panicked yesterday on this problem. There was ptlrpc_expire_one_request message on nbp9 (10.151.26.5): The client hits LBUG as shown in dmesg below: [1422397952.774718] LustreError: 22882:0:(sec_null.c:196:null_free_reqbuf()) ASSERTION( req->rq_reqmsg == req->rq_reqbuf ) failed: req ffff880837378000: reqmsg ffff8808373780e8 is not reqbuf ffff8805b759a400 in null sec |
| Comments |
| Comment by Jay Lan (Inactive) [ 29/Jan/15 ] |
|
The Call Trace part in the decription of the problem was cut-n-paste from a screeen that was contaminated by broadcast messages from root. The correct stack trace should be as below: [1422397952.818716] Kernel panic - not syncing: LBUG |
| Comment by Peter Jones [ 29/Jan/15 ] |
|
Yang Sheng Could you please advise? Peter |
| Comment by Oleg Drokin [ 02/Feb/15 ] |
|
I suspect this might be an instance of |
| Comment by Yang Sheng [ 04/Feb/15 ] |
|
Hi, Oleg. I think you may right. Looks like not other place can cause such issue. But seem we won't landed patch to 2.4. How to handle this ticket? |
| Comment by Peter Jones [ 04/Feb/15 ] |
|
Yang Sheng NASA have their own Lustre distribution and so can pick up the patch. They will advise if they need any assistance in porting the fix back to b2_4. Alternatively, if this is a rare issue, they may wait, knowing that this issue will disappear when they upgrade to 2.5.x clients. Peter |
| Comment by Jay Lan (Inactive) [ 04/Feb/15 ] |
|
Thanks, guys! I cherry-picked the patch to nas-2.4.3 branch with no problem. |
| Comment by Yang Sheng [ 09/Feb/15 ] |
|
So can we mark this ticket as duplicated with lu-3333? |
| Comment by John Fuchs-Chesney (Inactive) [ 12/Feb/15 ] |
|
Duplicate of |