Details
-
Bug
-
Resolution: Fixed
-
Blocker
-
Lustre 2.10.0
-
Spirit performance cluster
-
3
-
9223372036854775807
Description
Attempting to run P02 and P03 performance tests, with striping set as:
$LFS setstripe $testdir --pool $ior_ostPool -E 64M -c 1 -E 4G -c 4 -E -1 -c -1I
Immediate MPI failures with IOR
Commencing write performance test: Thu Apr 13 21:04:16 2017 024: ior ERROR: write() failed, errno 61, No data available (aiori-POSIX.c:335) 024: -------------------------------------------------------------------------- 024: MPI_ABORT was invoked on rank 24 in communicator MPI_COMM_WORLD -- .......... 231: ior ERROR: write() failed, errno 61, No data available (aiori-POSIX.c:335) 088: In: PMI_Abort(-1, N/A) 287: ior ERROR: write() failed, errno 61, No data available (aiori-POSIX.c:335) 134: In: PMI_Abort(-1, N/A) 057: -------------------------------------------------------------------------- 057: MPI_ABORT was invoked on rank 57 in communicator MPI_COMM_WORLD -- 057: 057: NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes. 057: You may or may not see output from other processes, depending on 057: exactly when Open MPI kills them. 057: -------------------------------------------
Lustre Errors on all nodes attached.
Attachments
Issue Links
Activity
Resolution | New: Fixed [ 1 ] | |
Status | Original: Reopened [ 4 ] | New: Resolved [ 5 ] |
Comment | [ cliff - are you able to reproduce this issue on spirit? ] |
Resolution | Original: Fixed [ 1 ] | |
Status | Original: Resolved [ 5 ] | New: Reopened [ 4 ] |
Resolution | New: Fixed [ 1 ] | |
Status | Original: Open [ 1 ] | New: Resolved [ 5 ] |
Attachment | New: ior-stripe.txt [ 26582 ] |
Attachment | New: spirit-7.lustre.dump.gz [ 26574 ] | |
Attachment | New: spirit-8.lustre.dump.gz [ 26575 ] | |
Attachment | New: spirit-9.lustre.dump.gz [ 26576 ] | |
Attachment | New: spirit-10.lustre.dump.gz [ 26577 ] | |
Attachment | New: spirit-29.lustre.dump.gz [ 26578 ] | |
Attachment | New: spirit-30.lustre.dump.gz [ 26579 ] |
Remote Link | New: This issue links to "Page (HPDD Community Wiki)" [ 20272 ] |