[LU-3005] MDT attempted to access beyond the disk Created: 21/Mar/13 Updated: 22/Mar/13 Resolved: 22/Mar/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.4.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Blocker |
| Reporter: | James A Simmons | Assignee: | Bruno Faccini (Inactive) |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | HB | ||
| Environment: |
Lustre 2.3.62 running the servers as well as the clients. |
||
| Issue Links: |
|
||||||||||||||||
| Severity: | 4 | ||||||||||||||||
| Rank (Obsolete): | 7318 | ||||||||||||||||
| Description |
|
While running mdtest the file system went into read only mode and the file system reported corruption on the MDS. The error is: [56736.791445] attempt to access beyond end of device |
| Comments |
| Comment by Keith Mannthey (Inactive) [ 21/Mar/13 ] |
|
It seems to be WAAAY past the end. "md5: rw=0, want=10484831811994872656, limit=282775552" Are there any other relevant md messages in your logs? How often have you seen this? Can you describe your test configuration a bit more? Can you share your mdtest values and mount info? |
| Comment by James A Simmons [ 21/Mar/13 ] |
|
First time today and no other info showed up in the logs. Rebuilt the file system and now it seems to have gone away. I have DDN 9900 attached to 4 OSS. Each OSS has 7 OSTs. The MGS has a simple sata disk and the MDS has a md device. Attached to clients with DDR Inifiniband. |
| Comment by Bruno Faccini (Inactive) [ 21/Mar/13 ] |
|
Also a debugfs "stat <22016621>" sub-command output run on MDT mounted as ldiskfs could give more infos and help to see if corruption was on-disk or not. |
| Comment by James A Simmons [ 22/Mar/13 ] |
|
This is a duplicate of |
| Comment by Peter Jones [ 22/Mar/13 ] |
|
Let's focus discussion under the original ticket - |