Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-713

performance-sanity fail to start mpi tests

    XMLWordPrintable

Details

    • 3
    • 6553

    Description

      Report: https://maloo.whamcloud.com/test_sets/6e29d8a4-e58c-11e0-9909-52540025f9af

      ===== mdsrate-create-small.sh ### 1 NODE CREATE ###
      + /usr/lib64/lustre/tests/mdsrate --create --time 600
      --nfiles 447838 --dir /mnt/lustre/mdsrate/single --filefmt 'f%%d'
      + chmod 0777 /mnt/lustre
      drwxrwxrwx 4 root root 4096 Sep 22 17:51 /mnt/lustre
      + su mpiuser sh -c "/usr/lib64/mpi/gcc/openmpi/bin/mpirun -mca boot ssh -mca btl tcp,self -np 1 -machinefile /tmp/mdsrate-create-small.machines /usr/lib64/lustre/tests/mdsrate --create --time 600 --nfiles 447838 --dir /mnt/lustre/mdsrate/single --filefmt 'f%%d' "
      --------------------------------------------------------------------------
      It looks like opal_init failed for some reason; your parallel process is
      likely to abort. There are many reasons that a parallel process can
      fail during opal_init; some of which are due to configuration or
      environment problems. This failure appears to be an internal failure;
      here's some additional information (which may only be relevant to an
      Open MPI developer):

      opal_paffinity_base_select failed
      --> Returned value -13 instead of OPAL_SUCCESS
      --------------------------------------------------------------------------
      [client-22vm1:27504] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file runtime/orte_init.c at line 77
      [client-22vm1:27504] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file orterun.c at line 543
      log: + chmod 0777 /mnt/lustre

      Attachments

        Activity

          People

            chris Chris Gearing (Inactive)
            mdiep Minh Diep
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: