[LU-9538] Size on MDT with guarantee of eventual consistency - Whamcloud Community JIRA

Details

Type: New Feature
Resolution: Fixed
Priority: Minor
Fix Version/s: Lustre 2.12.0
Affects Version/s: None
Labels:
- LSOM
- patch

Rank (Obsolete):
9223372036854775807

Description

I belive that size on MDT has been discussed for a long time, and there were even some implementations of it before. I am creating this ticket to discuss it again, because keeping file size on MDTs seems very important for the new policy engine of Lustre (LiPE) that I am currently work on.

LiPE scans MDTs directly and extracts almost all the file attributes. Values of a series of mathematical expressions are calculated by LiPE according to these attribute values. And the expression values determine which rules the corresponding file matches with. This works perfectly for almost all metadata of files, except the file sizes, because MDT doesn't keep file sizes. That is the reason why we want to add file size on MDT.

Given the fact that file size on MDT has been discussed for a long time, I believe a lot of problems/difficulties of implementing this feature has been recognized by people in Lustre community. And I think is obvious that implementing a strict size on MDT with strong guarantees is too hard.

For LiPE, I think file sizes with guarantees of eventual consistency should be enough for most use cases. Because 1) smart administrators will leave enough margin of data management. I don't think smart administrator will define any dangerous rule based on the strict file size without enough margins of timestamps and file size. 2) Most management actions can be withdrawn without any data lose. And 3) Data removing are usually double/triple checked before being committed. It is reasonable to ask administrator to double check the sizes of removing files on Lustre client if file size on MDT is not precise all the time.

Still, we have a lot of choices about how to implement file size on MDT, even we choose to imlement a relax/lazy version. I believe that a lot of related work in the history could be reused . I guess using a new extended attribute for file size on MDT might be better than using the i_size in inode structure, since data on MDT is coming. And file size on MDT should be synced in a couple of scenarios which provides enough consistency guarantees yet at the same time introduces little performance impact, for example 1) when the last file close finishes, and 2) when a significant time has been past since last sync

I'd like to work on this when this is fully discussed and a design is agreed by all people involved. Any advice would be appreciated. Thanks!

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

2018 LSOM Test Plan.pdf
1010 kB
12/Dec/18 11:46 PM

Issue Links

is blocked by

LUDOC-402 Add documentation for lazy size on MDT

Resolved

is related to

LU-10370 "truncate" does not update blocks count on client

Resolved

LU-11466 DoM files should not need LSOM sync for valid attributes on the MDS

Resolved

LU-11479 Error replicating xattr for /tmp/target/d8.lustre-rsync-test/d07/d073/b4: 2

Resolved

LU-11696 "lfs getsom" returns "24" (sizeof lustre_som_attr) to userspace

Resolved

LU-12026 verify that MDS stores atime/mtime/ctime during LSOM update

Resolved

LU-11367 integrate LSOM with lfs find

Resolved

LU-10934 integrate statx() API with Lustre

Resolved

LU-11190 LSOM size/age accounting histogram

Resolved

LU-11473 Add ‘lfs getsom’ to the lfs man page

Resolved

(5 is related to)

Activity

People

Assignee:: Li Xi (Inactive)

Reporter:: Li Xi (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 26 Start watching this issue

Dates

Created:: 19/May/17 4:03 AM

Updated:: 05/Feb/20 7:16 PM

Resolved:: 09/Aug/18 7:04 PM