[LU-12283] Lustre client can not mount filesystem Created: 10/May/19 Updated: 02/Mar/21 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Question/Request | Priority: | Major |
| Reporter: | sebg-crd-pm (Inactive) | Assignee: | Peter Jones |
| Resolution: | Unresolved | Votes: | 1 |
| Labels: | None | ||
| Environment: |
lustre 2.10.6 |
||
| Attachments: |
|
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
1.other lustre client have mounted lustre io access ok. 2.the client can lctl ping mgs node ok but can not mount filesystem with these error in client. [Wed May 8 02:44:47 2019] LustreError: 166-1: MGC172.20.0.201@o2ib1: Connection to MGS (at 172.20.0.201@o2ib1) was lost; in progress operations using this service will fail
3.the lustre client can mount filesystem after re-mount mgt |
| Comments |
| Comment by Peter Jones [ 10/May/19 ] |
|
Could you please provide logs as per the comment on |
| Comment by sebg-crd-pm (Inactive) [ 13/May/19 ] |
|
The attached file is mgs server log. It looks like something wrong from May 7 14:09:09 till I try to restart mgt in May 8. |
| Comment by Thomas Roth [ 02/Mar/21 ] |
|
We seem to hit this with 2.12.5. Server lxmds19 has the combined MGS + MDT. Every 10 minutes, the connection to the MGS is lost and restored:
Feb 28 11:24:31 lxmds19 kernel: LustreError: 166-1: MGC10.20.3.0@o2ib5: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail
This is seen by the other servers and clients, consequently new mounts will fail often, but not always.
|