Details
-
Bug
-
Resolution: Cannot Reproduce
-
Minor
-
None
-
Lustre 1.8.x (1.8.0 - 1.8.5)
-
None
-
Lustre-1.8.8, Infiniband
-
3
-
5809
Description
we have "never recovery finished" conditions at the customer site. Even it took a couple of hours after MDT starts, it was still RECOVERING in recovery_status. We tried umount and remount, but it was still same situation and denied new clients connection Finally, we did "-o abort_recovery" to mount options, to fix this problem. So, why the recovery can't finished in reasonable time?
# cat /proc/fs/lustre/mds/*/recovery_status status: RECOVERING recovery_start: 0 time_remaining: 0 connected_clients: 0/2 delayed_clients: 0/2 completed_clients: 0/2 replayed_requests: 0/?? queued_requests: 0 next_transno: 38672353440