any one please help me How to install SLURM in computer node Rocks cluster 7.
i am new on rocks i am install roll and install head not but in this doucument i am not find any update how to install compute node SLURM
Please help me on this
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
before installling the slurm-roll, your headnode should be running and
properly configured, and you already
installed a compute node by running insert-ethers. If you have trouble,
please ask the rocks-discuss group.
When all is running well, read at first slurm-roll.pdf, then download
the the iso file and install it on the Headnode
as described in the file README.txt in the download area at sourceforge.
Then you can install your compute nodes
with insert-ethers. That's all.
Best regards
Werner
On 8/30/19 11:52 AM, Sandip Saha wrote:
any one please help me How to install SLURM in computer node Rocks cluster 7.
i am new on rocks i am install roll and install head not but in this doucument i am not find any update how to install compute node SLURM
as per doucument i am successfully install head note slurm roll.
but i am already running my compute node can i reinstall all my compute node with insert-ethers ??
Thanks
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Simply set the bootaction for all the compute-nodes like:
rocks set host boot compute-0-0 action=install and then reboot the
compute-nodes. Best regards Werner
On 8/31/19 7:18 AM, Sandip Saha wrote:
Hello,
Thanks for your time and support.
as per doucument i am successfully install head note slurm roll.
but i am already running my compute node can i reinstall all my compute node with insert-ethers ??
systemctl status slurmctld.service
Unit slurmctld.service could not be found.
rocks sync slurm
compute-0-0: bash: /etc/slurm/slurm-prep.sh: No such file or directory
pdsh@mnode: compute-0-0: ssh exited with exit code 127
compute-0-0: Failed to restart slurmd.service: Unit not found.
Please help me
Thanks for your support.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
systemctl status slurmctld.service
Unit slurmctld.service could not be found.
rocks sync slurm
compute-0-0: bash: /etc/slurm/slurm-prep.sh: No such file or directory
pdsh@mnode: compute-0-0: ssh exited with exit code 127
compute-0-0: Failed to restart slurmd.service: Unit not found.
any one please help me How to install SLURM in computer node Rocks cluster 7.
i am new on rocks i am install roll and install head not but in this doucument i am not find any update how to install compute node SLURM
Please help me on this
Hi,
before installling the slurm-roll, your headnode should be running and
properly configured, and you already
installed a compute node by running insert-ethers. If you have trouble,
please ask the rocks-discuss group.
When all is running well, read at first slurm-roll.pdf, then download
the the iso file and install it on the Headnode
as described in the file README.txt in the download area at sourceforge.
Then you can install your compute nodes
with insert-ethers. That's all.
Best regards
Werner
On 8/30/19 11:52 AM, Sandip Saha wrote:
Hello,
Thanks for your time and support.
as per doucument i am successfully install head note slurm roll.
but i am already running my compute node can i reinstall all my compute node with insert-ethers ??
Thanks
Hi,
you don't need to run inert-ethers again.
Simply set the bootaction for all the compute-nodes like:
rocks set host boot compute-0-0 action=install and then reboot the
compute-nodes. Best regards Werner
On 8/31/19 7:18 AM, Sandip Saha wrote:
Thanks again for your support.
i will check and confirm.
Hello,
i dont know whare i am miss out
rocks set host boot compute-0-0 action=install
ssh compute-0-0 "shutdown -r now"
after reboot compute node same problem.
systemctl status slurmctld.service
Unit slurmctld.service could not be found.
rocks sync slurm
compute-0-0: bash: /etc/slurm/slurm-prep.sh: No such file or directory
pdsh@mnode: compute-0-0: ssh exited with exit code 127
compute-0-0: Failed to restart slurmd.service: Unit not found.
Please help me
Thanks for your support.
Hi,
send me the output of:
rocks list roll
Werner
On 8/31/19 7:49 AM, Sandip Saha wrote:
hello
sorry for dealy response
[root@mnode ~]# rocks list roll
NAME VERSION ARCH ENABLED
base: 7.0 x86_64 yes
CentOS: 7.4.1708 x86_64 yes
core: 7.0 x86_64 yes
ganglia: 7.0 x86_64 yes
hpc: 7.0 x86_64 yes
htcondor: 8.6.8 x86_64 yes
kernel: 7.0 x86_64 yes
python: 7.0 x86_64 yes
sge: 7.0 x86_64 yes
Updates-CentOS-7.4.1708: 2017-12-01 x86_64 yes
cuda: 7.0 x86_64 yes
slurm: 7.0.0.220 x86_64 yes
[root@mnode ~]#
thanks for your time