[Scalablecr-discuss] Question about using SCR with SLRUM
Brought to you by:
kathrynmohror,
moody20
From: Teranishi, K. <kn...@sa...> - 2014-04-18 20:42:20
|
Hi, I have been trying to run OpenMPI program with SCR on a PC cluster. This machine uses SLRUM for batch scheduler. In the interactive session, I have been trying to run an example program test_api 3 times in the same batch session (I allocated 2 extra nodes). However, the second and third execution tries to pick up the same nodes that may have hanging processes (processes terminated without MPI_Finalize). I¹d like to know if I have to put any environment variables or something I have to do with slrum to pick up spare nodes. Please let me know. Here is the error message for the second and third execution. SCR ERROR: rank 0 on chama648: Failed to read username or jobname from environment, disabling logging @ scr.c:6451 SCR: rank 0 on chama648: Job exiting: Reason: SCR_FINALIZE_CALLED. -------------------------------------------------------------------------- mpirun has exited due to process rank 122 with PID 171448 on node chama1046 exiting without calling "finalize". This may have caused other processes in the application to be terminated by signals sent by mpirun (as reported here). ‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹‹ SCR ERROR: rank 0 on chama648: Failed to read username or jobname from environment, disabling logging @ scr.c:6451 SCR: rank 0 on chama648: Job exiting: Reason: SCR_FINALIZE_CALLED. -------------------------------------------------------------------------- mpirun has exited due to process rank 11 with PID 148920 on node chama648 exiting without calling "finalize". This may have caused other processes in the application to be terminated by signals sent by mpirun (as reported here). ------------------------------------------------------------------------- --------------------------------------------------------------------------- -- Keita Teranishi Principal Member of Technical Staff Scalable Modeling and Analysis Systems Sandia National Laboratories Livermore, CA 94551 +1 (925) 294-3738 |