[LU-1629] MDS waiting for 0 clients in recovery Created: 13/Jul/12 Updated: 01/Aug/12 Resolved: 01/Aug/12 |
|
| Status: | Closed |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.3.0 |
| Fix Version/s: | Lustre 2.3.0 |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Cliff White (Inactive) | Assignee: | Li Wei (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Hyperion |
||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 4511 | ||||||||
| Description |
|
Hyperion clustre, 100+ mounted clients. Lustre: lustre-MDT0000: recovery is timed out, evict stale exports |
| Comments |
| Comment by Cliff White (Inactive) [ 13/Jul/12 ] |
|
System log from MGS/MDS for the recovery |
| Comment by Andreas Dilger [ 13/Jul/12 ] |
|
I thought I saw a very similar bug report a week or two ago. Did you do a search for this first? I believe there was already a patch for it. |
| Comment by Mikhail Pershin [ 13/Jul/12 ] |
|
The same as ORI-668, there is discussion about that issue |
| Comment by Mikhail Pershin [ 13/Jul/12 ] |
|
This is not recovery issue but reporting, we are reporting currently just number of clients in recovery, that is why there is 0. There is no patch for this yet. |
| Comment by Jodi Levi (Inactive) [ 16/Jul/12 ] |
|
Mike, |
| Comment by Ian Colle (Inactive) [ 25/Jul/12 ] |
|
Li Wei - Mikhail is swamped with rebase. Can you please work on this? He says fix should be to change message and output slightly different values. |
| Comment by Mikhail Pershin [ 25/Jul/12 ] |
|
Message can be changed like the following:
LCONSOLE_WARN("%s: Denying connection for new client "
"%s (at %s), total clients to recover %d,"
" %d clients in recovery for %d:%.02d\n",
target->obd_name,
libcfs_nid2str(req->rq_peer.nid),
cluuid.uuid,
target->obd_max_recoverable_clients,
cfs_atomic_read(&target-> \
obd_lock_replay_clients),
(int)t / 60, (int)t % 60);
It outputs obd_max_recoverable_clients as total expected number of client to recover and uses obd_lock_replay_clients counter to show how many of them are already participating in recovery |
| Comment by Li Wei (Inactive) [ 27/Jul/12 ] |
| Comment by Li Wei (Inactive) [ 01/Aug/12 ] |
|
The patch has landed to master. |