I have setup vm host as a controller and build both slurm and munge. Installation seems OK with no error and daemons started OK (slurmctld, slurmdbd). However when I do sinfo from host controller itself it says can not communicate:
I initially set hostname in the slurm.conf but caused dns error so I directly put IP adderss and got errors above:
I am wondering why it can not?
I am wondering if this has anything to do with the way daemon is running. If I get daemon status periodical it seems to be exiting and entering the running over and over, is it normal?? (below):
I have setup vm host as a controller and build both slurm and munge. Installation seems OK with no error and daemons started OK (slurmctld, slurmdbd). However when I do sinfo from host controller itself it says can not communicate:
[root@localhost slurm]# sinfo
slurm_load_partitions: Unable to contact slurm controller (connect failure)
I initially set hostname in the slurm.conf but caused dns error so I directly put IP adderss and got errors above:
I am wondering why it can not?
I am wondering if this has anything to do with the way daemon is running. If I get daemon status periodical it seems to be exiting and entering the running over and over, is it normal?? (below):