Details
-
Bug
-
Resolution: Fixed
-
Minor
-
None
-
None
-
3
-
9223372036854775807
Description
This issue was created by maloo for Frederick Dilger <fdilger@whamcloud.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/51c8b27a-f8cb-44a1-a6d9-0f1d091f7d0f
test_2 failed with the following error:
test_2 failed with 1
Test session details:
clients: https://build.whamcloud.com/job/lustre-reviews/112764 - 4.18.0-553.46.1.el8_10.x86_64
servers: https://build.whamcloud.com/job/lustre-reviews/112764 - 4.18.0-553.46.1.el8_lustre.x86_64
<<Please provide additional information about the failure here>>
Large files creation performance test
set file size to 1073741824
install: cannot change permissions of '/mnt/lustre/mdtest': No such file or directory
stat: cannot read file system information for '/mnt/lustre/mdtest': No such file or directory
CMD: onyx-117vm9 params=\$(/usr/sbin/lctl get_param mdt.*.enable_remote_dir_gid);
[[ -z \"lustre-MDT0000\" ]] && param= ||
param=\$(grep lustre-MDT0000 <<< \"\$params\");
[[ -z \$param ]] && param=\"\$params\";
while read s; do echo mds1 \$s;
done <<< \"\$param\"
CMD: onyx-117vm10 params=\$(/usr/sbin/lctl get_param mdt.*.enable_remote_dir_gid);
[[ -z \"lustre-MDT0001\" ]] && param= ||
param=\$(grep lustre-MDT0001 <<< \"\$params\");
[[ -z \$param ]] && param=\"\$params\";
while read s; do echo mds2 \$s;
done <<< \"\$param\"
CMD: onyx-117vm9 params=\$(/usr/sbin/lctl get_param mdt.*.enable_remote_dir_gid);
[[ -z \"lustre-MDT0002\" ]] && param= ||
param=\$(grep lustre-MDT0002 <<< \"\$params\");
[[ -z \$param ]] && param=\"\$params\";
while read s; do echo mds3 \$s;
done <<< \"\$param\"
CMD: onyx-117vm10 params=\$(/usr/sbin/lctl get_param mdt.*.enable_remote_dir_gid);
[[ -z \"lustre-MDT0003\" ]] && param= ||
param=\$(grep lustre-MDT0003 <<< \"\$params\");
[[ -z \$param ]] && param=\"\$params\";
while read s; do echo mds4 \$s;
done <<< \"\$param\"
CMD: onyx-117vm10,onyx-117vm9 /usr/sbin/lctl set_param mdt.*.enable_remote_dir_gid=-1
mdt.lustre-MDT0000.enable_remote_dir_gid=-1
mdt.lustre-MDT0002.enable_remote_dir_gid=-1
mdt.lustre-MDT0001.enable_remote_dir_gid=-1
mdt.lustre-MDT0003.enable_remote_dir_gid=-1
+ chmod 0777 /mnt/lustre
drwxrwxrwx 3 root root 4096 Apr 24 17:34 /mnt/lustre
+ su mpiuser bash -c "/usr/lib64/openmpi/bin/mpirun --mca btl tcp,self --mca btl_tcp_if_include eth0 -mca boot ssh --oversubscribe -machinefile /tmp/auster.machines -np 1 -npernode 2 /usr/lib64/openmpi/bin/mdtest -w=1073741824 -d=/mnt/lustre/mdtest -n=16 -F -R "
onyx-139vm1.onyx.whamcloud.com:rank0.mdtest: Failed to get eth0 (unit 0) cpu set
onyx-139vm1.onyx.whamcloud.com:rank0: PSM3 can't open nic unit: 0 (err=23)
--------------------------------------------------------------------------
Open MPI failed an OFI Libfabric library call (fi_endpoint). This is highly
unusual; your job may behave unpredictably (and/or abort) after this.
Local host: onyx-139vm1
Location: mtl_ofi_component.c:513
Error: Invalid argument (22)
--------------------------------------------------------------------------
– started at 04/24/2025 17:34:21 –
mdtest-4.0.0 was launched with 1 total task(s) on 1 node(s)
Command line used: /usr/lib64/openmpi/bin/mdtest '-w=1073741824' '-d=/mnt/lustre/mdtest' '-n=16' '-F' '-R'
WARNING: Unable to create test directory path /mnt/lustre/mdtest
Path : /mnt/lustre/mdtest
FS : 58.7 GiB Used FS: 0.0% Inodes: 3.0 Mi Used Inodes: 0.0%
Nodemap: 1
random seed: 1745516061
1 tasks, 16 files
WARNING: Unable to create test directory /mnt/lustre/mdtest/test-dir.0-0
WARNING: unable to create tree directory '/mnt/lustre/mdtest/test-dir.0-0/mdtest_tree.0/'
ERROR: open64("/mnt/lustre/mdtest/test-dir.0-0/mdtest_tree.0/file.mdtest.0.0", 66, 0664) failed. Error: No such file or directory, (aiori-POSIX.c:570)
VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
performance-sanity test_2 - test_2 failed with 1
Attachments
Issue Links
- mentioned in
-
Page Loading...