[LU-2985] e2fsck fails and aborted if it generates the database for lfsck Created: 19/Mar/13 Updated: 19/Mar/13 Resolved: 19/Mar/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Critical |
| Reporter: | Shuichi Ihara (Inactive) | Assignee: | Niu Yawei (Inactive) |
| Resolution: | Won't Fix | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Lustre-2.1.0 CentOS5.5 |
||
| Attachments: |
|
| Severity: | 1 |
| Rank (Obsolete): | 7277 |
| Description |
|
The customer's MDT crashed and they ran e2fsck and a lot of problem couldn't be fixed. They upgraded e2fsck to latest version and fixed corruptions with it and they are now going to lfsck before mount MDT. However, when they ran e2fsck to generage database, we saw the following error messages and can't generate database from MDT. /sbin/e2fsck -n -v --mdsdb /tmp/mdsdb /dev/ExHwRaid10VolGroup/mdt MDS: got 4168 bytes = 521 entries in lov_objids |
| Comments |
| Comment by nasf (Inactive) [ 19/Mar/13 ] |
|
Ihara, are there more logs can be used? |
| Comment by Niu Yawei (Inactive) [ 19/Mar/13 ] |
|
Apparently there is old data in the db which has duplicated key? Does removing the old db and rerun lfsck works? |
| Comment by Shuichi Ihara (Inactive) [ 19/Mar/13 ] |
|
Niu, FanYong |
| Comment by Shuichi Ihara (Inactive) [ 19/Mar/13 ] |
|
it's log file when they ran e2fsck to generate database. |
| Comment by Shuichi Ihara (Inactive) [ 19/Mar/13 ] |
|
just attached e2fsck log. |
| Comment by Niu Yawei (Inactive) [ 19/Mar/13 ] |
|
seems like there are duplicated inode number on the local mds filesystem, did you run 'e2fsck -f' to fix the local filesystem before generating mds db? |
| Comment by Andreas Dilger [ 19/Mar/13 ] |
|
For 2.x lfsck, it is using the FID as the key for the mdsdb. I don't think there could be duplicate inode numbers in the filesystem, but it is possible that lfsck isn't handling hard links correctly? If the customer has run e2fsck on the MDS filesystem, it isn't strictly necessary to run lfsck. Until this problem with lfsck is better understood, they should be able to use the filesystem without lfsck. |
| Comment by Peter Jones [ 19/Mar/13 ] |
|
Please cease all work on this. This is for an unsupported site running and unsupported version of Lustre. |