[LU-1093] unable to handle kernel paging request in target_handle_connect() Created: 10/Feb/12 Updated: 30/Apr/12 Resolved: 30/Apr/12 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.1.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Ned Bass | Assignee: | Oleg Drokin |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Environment: | |||
| Severity: | 3 |
| Rank (Obsolete): | 6462 |
| Description |
|
We had one occurence of this bug on a classified Lustre 2.1 OSS. Timeframe coincided with LustreError: 14210:0:(genops.c:1270:class_disconnect_stale_exports()) ls5-OST0349: disconnect stale client [UUID]@<unknown> BUG: unable to handle kernel paging request at 0000000100000017 Pid: 15974, comm: ll_ost_506 |
| Comments |
| Comment by Oleg Drokin [ 10/Feb/12 ] |
|
The disconnect stale client message is about clients that failed to contact the server during the recovery window. |
| Comment by Ned Bass [ 13/Feb/12 ] |
|
We are still trying to understand what happened, but it's hard to identify the clients because the UUIDS are all @<unknown>. It could be that they were BGP nodes that get rebooted between jobs. We suspect RPC traffic was not moving through the system well, but we don't know if it was due to high server load or some network or LNET router issue. |
| Comment by Peter Jones [ 30/Apr/12 ] |
|
Believed to be a duplicate of |