The patch from LU-4755 increased the OST_MAXREQSIZE to accommodate very large numbers of niobufs in a single request (up to 1024 with a 4MB RPC). However, per my last comments in LU-4755 it isn't clear whether there is an advantage to having so many small IOs in a single large RPC vs. having multiple separate RPCs in parallel.
t is worthwhile to ask if there is any performance improvement from sending 4096 random pages in one RPC compared to 16 x 256 random pages in separate RPCs? It might even be faster to send parallel RPCs due to checksums running on separate cores and being handled in parallel on the OST. If there is no improvement from many random pages in one RPC, it is better to just limit the number of niobufs that the client sends in one RPC.
It would be useful to test a random write workload with 1MB, 4MB, and other RPC sizes to see if there is an improvement from sending multiple 1MB RPCs in parallel vs. larger single RPCs.
Landed as commit v2_8_58_0-8-g7f2aae8.