The problem you found by run.sh is probably related to the following code:
When we set stripe for root(mount point), set_default is enabled in ll_dir_ioctl()
case LL_IOC_LOV_SETSTRIPE: {
...
int set_default = 0;
...
if (inode->i_sb->s_root == file->f_dentry)
set_default = 1;
rc = ll_dir_setstripe(inode, lumv1, set_default);
Then, in ll_dir_setstripe() if set_default=1, we will call ll_send_mgc_param() to set information asynchronously.
if (set_default && mgc->u.cli.cl_mgc_mgsexp) {
}
Since you run setstripe very frequently and many times in run.sh, the config log queue might be very long (bottleneck), and mgs will take more time to process it.
BTW, can you hit this problem if you don't use run.sh, just run sanity.sh regularly?
Haven't seen this in a long time.