[LU-17240] change test-framework to format and mount targets in parallel Created: 28/Oct/23 Updated: 20/Dec/23 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Improvement | Priority: | Minor |
| Reporter: | Andreas Dilger | Assignee: | WC Triage |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | medium, test_script_improvements | ||
| Issue Links: |
|
||||||||||||
| Severity: | 3 | ||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||
| Description |
|
It would be useful for a number of reasons to change test-framework.sh to format and mount the MDTs and OSTs in parallel (if not both MDTs and OSTs at the same time, then at least in two sets).
|
| Comments |
| Comment by Patrick Farrell [ 28/Oct/23 ] |
|
timday - I'm just living in hope here, but maybe this would be of interest to you? It would certainly make reloading a test node faster, which would be very nice. |
| Comment by Patrick Farrell [ 28/Oct/23 ] |
|
Andreas, Would it make sense to put "unmounting"/Stopping targets in parallel here under this same ticket? That's closely related and also takes a while. |
| Comment by Tim Day [ 29/Oct/23 ] |
|
I've actually written some tests to do a bunch of parallel mounts, but that was client-side. It was to test out OBD device registration. I never got around to cleaning the test up and submitting it. While mounting targets in parallel would make testing faster, I'm not sure if it would meaningfully improve test coverage. I haven't seen/heard of issues with mounting targets in parallel (even with 100s of OSS/OST). It would useful if we could find a way to register a few hundred OSS/MDS in parallel. I think that would surface more bugs faster. I think it would go: 1) Stop all clients, MDS, MDS, OSS 2) Make a bunch of small temp disks in /tmp/ on each node 3) Start a bunch of a services using those disks, hope nothing explodes 4) Cleanup and restart services Andreas, could you link some of the known issues you mentioned (in the description) to this ticket? I'm curious what people have seen go wrong. |
| Comment by Andreas Dilger [ 30/Oct/23 ] |
|
Patrick, Tim, |
| Comment by Alexey Lyashkov [ 30/Oct/23 ] |
|
Andreas, it's Zam patch I think. I have just move all handling in single thread. |
| Comment by Andreas Dilger [ 31/Oct/23 ] |
|
Tim, it looks like the patch I was thinking about is https://review.whamcloud.com/44594 " |
| Comment by Andreas Dilger [ 31/Oct/23 ] |
|
The other ones are patch https://review.whamcloud.com/45259 " |
| Comment by Gerrit Updater [ 20/Dec/23 ] |
|
"Timothy Day <timday@amazon.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/53518 |