Details
-
Improvement
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
3
-
9223372036854775807
Description
It would be useful for "lfs find" to perform directory scans with multiple threads in parallel. It could (potentially) fork a new thread (or put a work item into a pool) for each subdirectory so that they can be scanned in parallel.
There is "libcircle" and "libpcircle" that can perform workload sharing to speed up directory traversal. Also, the pfind code in IO500 is also doing efficient parallel directory traversal, including splitting up large directories by hash index to traverse in parallel.
Integrating one of these algorithms into "lfs find" with pthreads would allow a many-fold improvement in directory scanning performance.
Attachments
Issue Links
- is related to
-
LU-19052 add 'lfs find --skip-rebalance' parameter to return a fraction of files for OST rebalancing
-
- Open
-
-
LU-17699 add 'lfs find' parameter to return only a fraction of files for rebalancing
-
- Resolved
-
- is related to
-
LU-5170 lfs usability
-
- Open
-
-
LU-14610 Make mpiFileUtils better support statahead
-
- Open
-
-
LU-18586 add llapi_find_param_alloc() function
-
- Open
-
- mentioned in
-
Page Loading...
"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/57293/
Subject: LU-17814 utils: Add work unit management
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: e505e7dbfb8b58a7a2ef9178b56f8b7668a18329