[LU-8315] Checksum/erasure code of EAs for better recovery of Lustre Created: 21/Jun/16 Updated: 23/Jun/16 Resolved: 23/Jun/16 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | New Feature | Priority: | Minor |
| Reporter: | Li Xi (Inactive) | Assignee: | WC Triage |
| Resolution: | Incomplete | Votes: | 0 |
| Labels: | None | ||
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
In order to save some private informations of the file/object belonging to a Unfortunately, this situation could happen if a server or storage crashes on Because of these reasons, I am wondering whether a checksum/erasure code of the 1) An checksum/erasure code of the Lustre EAs (e.g. trusted.lov + trusted.lma 2) When the OST/MDT objects of a Lustre file is accessed/repaired, the 3) When the Lustre EAs are updated, the checksum/erasure code will be updated. 4) The checksum/erasure code of the MDT EA (i.e. "trusted.mdt_checksum") will 5) A series of ultilities should be provided for better recovering of the We could use similar mechanism from lower file system, for example, the |
| Comments |
| Comment by Andreas Dilger [ 22/Jun/16 ] |
|
There is already a feature in upstream ext4 that adds checksums for all metadata (metacsum) that should be used. For ZFS the checksum is already present for all metadata. I don't think that random EA corruption is likely to be a source of problems here, since there are 128-bit identifiers for the objects on the MDT and OST so it is unlikely that random data will match any existing object. The PFID EA on the OST is essentially already a distributed copy of the LOV EA that can be used by LFSCK to recover the LOV EA if it is corrupted. If LFSCK were run periodically on a filesystem it will ensure that the PFID EA copy is updated and correct on the OSTs. One area that could be improved is the connection between the LOV EA on the MDT and the PFID on the OST. For the PFL project there will be more information in the PFID to contain the stripe size and total stripe count for the file (or component) layout that will help in recovery of the LOV EA. For the File Level Redundancy project there will also be a layout generation stored with each object that matches the MDT EA to the OST EA to ensure they are in sync, but this will not help with recovering the layout if it is corrupted. |
| Comment by Li Xi (Inactive) [ 22/Jun/16 ] |
|
Thank you Andreas for the information! |
| Comment by Andreas Dilger [ 22/Jun/16 ] |
|
Li Xi, I think that all of your feature requests are already being addressed in other ways, can this ticket be closed? |
| Comment by Li Xi (Inactive) [ 23/Jun/16 ] |
|
OK. Please close it. |
| Comment by Andreas Dilger [ 23/Jun/16 ] |
|
According to https://ext4.wiki.kernel.org/index.php/Ext4_Metadata_Checksums the metadata_csum feature started to land in 3.6, and since RHEL 7 is 3.10 based it is possible that this would be usable already with e2fsprogs 1.43? Please reopen if you try any of this out. |