[LU-1336] OSS GPF at ptlrpc_send_reply+0x470 Created: 18/Apr/12 Updated: 30/Apr/12 Resolved: 30/Apr/12 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.1.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Critical |
| Reporter: | Ned Bass | Assignee: | Zhenyu Xu |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: | |||
| Severity: | 3 |
| Rank (Obsolete): | 6411 |
| Description |
|
Back trace looks like this: machine_kexec That RIP resolves to lustre/ptlrpc/niobuf.c:436 which in our tree is here: 434 /* There may be no rq_export during failover */ 435 436 if (unlikely(req->rq_export && req->rq_export->exp_obd && 437 req->rq_export->exp_obd->obd_fail)) { 438 /* Failed obd's only send ENODEV */ 439 req->rq_type = PTL_RPC_MSG_ERR; 440 req->rq_status = -ENODEV; 441 CDEBUG(D_HA, "sending ENODEV from failed obd %d\n", 442 req->rq_export->exp_obd->obd_minor); 443 } Server was handling many client reconnects, under similar conditions as reported in |
| Comments |
| Comment by Peter Jones [ 18/Apr/12 ] |
|
Bobi Could you please comment on this one? Thanks Peter |
| Comment by Mikhail Pershin [ 19/Apr/12 ] |
|
isn't that Also other |
| Comment by Ned Bass [ 19/Apr/12 ] |
|
Hi Mikhail, I've been tracking those separately because the of the different exception sites. That would be nice if they were all symptoms of the same bug. Can we consider landing the |
| Comment by Peter Jones [ 30/Apr/12 ] |
|
Ned We have landed LU1092 for 2.1.2 also. Please reopen if this is not a duplicate after all Peter |