Details
-
Bug
-
Resolution: Fixed
-
Minor
-
Lustre 2.16.0
-
3
-
9223372036854775807
Description
This issue was created by maloo for jianyu <yujian@whamcloud.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/06d7916c-1d43-4976-ae97-a956a99f9115
test_4 failed with the following error:
+ su mpiuser bash -c "/usr/lib64/openmpi/bin/mpirun --mca btl tcp,self --mca btl_tcp_if_include eth0 -mca boot ssh --oversubscribe -machinefile /tmp/auster.machines -np 1 -npernode 2 /usr/lib64/openmpi/bin/mdtest -d=/mnt/lustre/mdtest -I=1488 -n=148893 -r " onyx-41vm1.onyx.whamcloud.com:rank0.mdtest: Failed to get eth0 (unit 0) cpu set onyx-41vm1.onyx.whamcloud.com:rank0: PSM3 can't open nic unit: 0 (err=23) -------------------------------------------------------------------------- Open MPI failed an OFI Libfabric library call (fi_endpoint). This is highly unusual; your job may behave unpredictably (and/or abort) after this. Local host: onyx-41vm1 Location: mtl_ofi_component.c:513 Error: Invalid argument (22)
Test session details:
clients: https://build.whamcloud.com/job/lustre-master/4587 - 5.14.0-427.31.1.el9_4.x86_64
servers: https://build.whamcloud.com/job/lustre-master/4587 - 5.14.0-427.31.1_lustre.el9.x86_64
<<Please provide additional information about the failure here>>
VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
performance-sanity test_4 - test_4 failed with 1