[LU-12970] improve mballoc for huge filesystems - Whamcloud Community JIRA

Details

Type: Improvement
Resolution: Unresolved
Priority: Minor
Fix Version/s: None
Affects Version/s: None
Labels:

Rank (Obsolete):
9223372036854775807

Description

there are number of reports demonstrating a poor behaviour of mballoc on huge filesystems. in one report it was 688TB filesystem with 5.3M groups.
mballoc tries to allocate large chunks of space, for small allocations it tries to preallocate and share large chunks. while this is good in terms of fragmentation and streaming IO allocation itself may need to scan many groups to find a good candidate.
mballoc maintains internal in-memory structures (buddy cache) to speed up searching, but that cache is built from regular on-disk bitmaps, meaning IO. and if cache is cold, populating it may take a lot of time.

there are few ideas how to improve that:

skip more groups using less information when possible
stop scanning if too many groups have been scanned (loaded) and use best found
prefetch bitmaps (use lazy init thread? prefetch at scanning)

another option for prefetching would be to skip non-initialized groups, but start an async read for the corresponding bitmap.
also, when mballoc marks the blocks used (allocation has been just made) it could make sense to check/prefetch the subsequent group(s) which is likely a goal for subsequent allocation - while the caller are writting IO to just allocated blocks, the next group(s) will be prefetchted and ready to use.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

ext4-loadbitmaps.patch
7 kB
15/Nov/19 11:06 AM

Issue Links

is related to

LU-8365 Fix mballoc stream allocator to better use free space at start of drive

Open

LU-15319 Weird mballoc behaviour

Resolved

LU-12976 Bigalloc sub cluster allocation for ldiskfs

Open

LU-16155 allow importing inode/block allocation maps to new ldisks filesystem

Open

LU-12988 improve mount time on huge ldiskfs filesystem

Resolved

LU-16691 optimize ldiskfs prealloc (PA) under random read workloads

Resolved

is related to

LU-12103 Improve block allocation for large partitions

Resolved

LU-14438 backport ldiskfs mballoc patches

Resolved

LU-10946 add an interface to load ldiskfs block bitmaps

Closed

(1 is related to, 3 is related to )

Activity

People

Assignee:: WC Triage

Reporter:: Alex Zhuravlev

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 14/Nov/19 4:56 PM

Updated:: 18/Mar/25 6:46 AM