[LU-1470] MDS Crash & reboot Created: 04/Jun/12 Updated: 29/May/17 Resolved: 29/May/17 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.2.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Fabio Verzelloni | Assignee: | Zhenyu Xu |
| Resolution: | Incomplete | Votes: | 0 |
| Labels: | None | ||
| Environment: |
MDS HW MDT LSI 5480 Pikes Peak OSS HW OST LSI 7900 1 MDS + 1 fail over |
||
| Attachments: |
|
| Severity: | 3 |
| Rank (Obsolete): | 4039 |
| Description |
|
MDS hang and the fail over take over, attached the /var/log/messages. Fabio |
| Comments |
| Comment by Peter Jones [ 04/Jun/12 ] |
|
Bobijam will help with this one |
| Comment by Zhenyu Xu [ 04/Jun/12 ] |
|
from the messages, the system received heartbeat shutdown notice from weisshorn02 node, that maked MDS reboot. Before that
|
| Comment by Zhenyu Xu [ 04/Jun/12 ] |
|
Can not find the evidence of MDS crash, would you mind collecting MDS debug logs and tell me when did the MDS crash? Also from the log you've uploaded, there are several messages showing that "ldap_result() failed: Can't contact LDAP server", does the network have problem, is it the same network which heartbeat uses? |