Menu

Error: Could not reach a master server

2018-06-13
2018-06-13
  • mahmood naderan

    mahmood naderan - 2018-06-13

    Hi Werner,
    Do you know what does the following output mean?

    [root@rocks7 /etc]# rocks sync slurm
    compute-0-6: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
    compute-0-5: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
    compute-0-4: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
    compute-0-2: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
    compute-0-0: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
    compute-0-1: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
    compute-0-3: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
    

    Slurm processes are fine on the frontend (10.1.1.1)

    [root@rocks7 etc]# ps aux | grep slurm
    kouhika+  1733  0.0  0.0   4348   336 pts/23   S+   Jun10   0:00 /usr/libexec/slurm-spank-x11 -t compute-0-6 -i 629.0 -cgw -s ssh -o
    kouhika+  1734  4.1  0.0 186380 14160 pts/23   S+   Jun10 218:47 ssh -Y compute-0-6 /usr/libexec/slurm-spank-x11 -i 629.0 -c -g -w -s "ssh" -o ""
    root      2823  0.0  0.0 340988  2832 ?        Sl   Apr13   0:31 /usr/sbin/slurmdbd
    root      3149  0.0  0.0 146200  2632 ?        S    May19   0:10 /usr/sbin/slurmd
    root     21618  0.0  0.0 1007740 4188 ?        Sl   18:23   0:00 /usr/sbin/slurmctld
    root     22253  0.0  0.0 273228  3200 ?        Sl   07:19   0:21 slurmstepd: [685.batch]
    azimi    22257  0.0  0.0   9696  1536 ?        S    07:19   0:00 /bin/bash /var/spool/slurmd/job00685/slurm_script
    root     22467  0.0  0.0 112712   964 pts/0    S+   18:25   0:00 grep --color=auto slurm
    
     
    • Werner Saar

      Werner Saar - 2018-06-13

      Hi,
      the httpd daemon on the Head-Node is listening on the ports 80, 372 and 443.
      It seems, that the httpd daemon is not running.

      Werner

      On 06/13/2018 03:56 PM, mahmood naderan wrote:

      Hi Werner,
      Do you know what does the following output mean?

      ~~~
      [root@rocks7 /etc]# rocks sync slurm
      compute-0-6: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
      compute-0-5: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
      compute-0-4: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
      compute-0-2: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
      compute-0-0: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
      compute-0-1: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
      compute-0-3: Error: Could not reach a master server. Masters: [http://10.1.1.1:372/411.d/ (-1)]
      ~~~

      Slurm processes are fine on the frontend (10.1.1.1)

      ~~~
      [root@rocks7 etc]# ps aux | grep slurm
      kouhika+ 1733 0.0 0.0 4348 336 pts/23 S+ Jun10 0:00 /usr/libexec/slurm-spank-x11 -t compute-0-6 -i 629.0 -cgw -s ssh -o
      kouhika+ 1734 4.1 0.0 186380 14160 pts/23 S+ Jun10 218:47 ssh -Y compute-0-6 /usr/libexec/slurm-spank-x11 -i 629.0 -c -g -w -s "ssh" -o ""
      root 2823 0.0 0.0 340988 2832 ? Sl Apr13 0:31 /usr/sbin/slurmdbd
      root 3149 0.0 0.0 146200 2632 ? S May19 0:10 /usr/sbin/slurmd
      root 21618 0.0 0.0 1007740 4188 ? Sl 18:23 0:00 /usr/sbin/slurmctld
      root 22253 0.0 0.0 273228 3200 ? Sl 07:19 0:21 slurmstepd: [685.batch]
      azimi 22257 0.0 0.0 9696 1536 ? S 07:19 0:00 /bin/bash /var/spool/slurmd/job00685/slurm_script
      root 22467 0.0 0.0 112712 964 pts/0 S+ 18:25 0:00 grep --color=auto slurm
      ~~~


      Error: Could not reach a master server


      Sent from sourceforge.net because you indicated interest in https://sourceforge.net/p/slurm-roll/discussion/general/

      To unsubscribe from further messages, please visit https://sourceforge.net/auth/subscriptions/

       
      • mahmood naderan

        mahmood naderan - 2018-06-13

        Yes. the httpd was dead!
        Thanks

         

        Last edit: mahmood naderan 2018-06-13

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.