[LU-290] Reconnects are not throttled Created: 06/May/11 Updated: 16/Aug/16 Due: 21/May/11 Resolved: 16/Aug/16 |
|
| Status: | Closed |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.0.0 |
| Fix Version/s: | Lustre 2.1.0 |
| Type: | Bug | Priority: | Minor |
| Reporter: | Lai Siyao | Assignee: | Lai Siyao |
| Resolution: | Won't Fix | Votes: | 0 |
| Labels: | None | ||
| Severity: | 3 |
| Bugzilla ID: | 22,423 |
| Epic: | connect, ping |
| Rank (Obsolete): | 4933 |
| Description |
|
It seems that clients can flood a server with reconnect requests e.g. seen on 1.8.2 with a cluster having ~800 clients: Mar 3 20:26:16 md061i kernel: Lustre: 27033:0:(ldlm_lib.c:835:target_handle_connect()) From code review, this looks like a side effect of bug 18674. |
| Comments |
| Comment by Peter Jones [ 06/May/11 ] |
|
Lai Just to warn you on this one - Oleg was not sure whether this would even be a problem on master so the first step is to establish whether this is before investing time in trying to port the patch Regards Peter |
| Comment by Lai Siyao [ 08/May/11 ] |
|
I see, I'll investigate first. |
| Comment by Lai Siyao [ 14/Jun/11 ] |
|
The comments in bz22423 and current code shows a patch for 2.x was committed, but caused a conf-sanity.sh failure, and then reverted. I'll do some test and find out the cause of that failure. |
| Comment by Lai Siyao [ 01/Aug/11 ] |
|
Autotest result looks normal, the patch will be put to review. |
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Build Master (Inactive) [ 04/Aug/11 ] |
|
Integrated in Oleg Drokin : 86b2211e55dcc509da85b21ece8830e2a9b70db1
|
| Comment by Alex Zhuravlev [ 09/Aug/11 ] |
|
please see this: https://maloo.whamcloud.com/test_sets/e098dda6-c262-11e0-8bdf-52540025f9af ~30K messages "recovery is timed out, evict stale exports" in conf-sanity.test_47.console.client-9-ib.log |
| Comment by Li Wei (Inactive) [ 24/Aug/11 ] |
|
Several occurrences on Orion with the crazy "recovery is timed out, evict stale exports" flood: https://maloo.whamcloud.com/test_sets/8d84d62e-ce24-11e0-8d02-52540025f9af (8c8e6dc) |
| Comment by Li Wei (Inactive) [ 15/Jan/12 ] |
|
Commit: 526c43ec2e47ead878f0df552b74c78b4fc79d1f (Jan 13, 2012) Another flood. |
| Comment by James A Simmons [ 16/Aug/16 ] |
|
Old ticket for unsupported version |