Menu

A lots of error message when reinstalling slurm roll

2018-03-22
2018-03-22
  • Christophe Guilbert

    Hi , I have a lots of error trying to reinstall slurm , see bellow , could you please tell me if its normal ?

    rocks run roll slurm|sh
    Loaded plugins: fastestmirror, langpacks
    Cleaning repos: Rocks-7.0
    Cleaning up everything
    Maybe you want: rm -rf /var/cache/yum, to also free up space taken by orphaned data from disabled or removed repos
    Cleaning up list of fastest mirrors
    Loaded plugins: fastestmirror, langpacks
    Rocks-7.0 | 3.6 kB 00:00:00
    (1/2): Rocks-7.0/primary_db | 5.8 MB 00:00:00
    (2/2): Rocks-7.0/group_gz | 156 kB 00:00:00
    Determining fastest mirrors
    Package hwloc-1.11.2-2.el7.x86_64 already installed and latest version
    Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
    Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
    Package 1:mariadb-server-5.5.56-2.el7.x86_64 already installed and latest version
    Package munge-0.5.13-3.el7.centos.x86_64 already installed and latest version
    Package munge-libs-0.5.13-3.el7.centos.x86_64 already installed and latest version
    Package pdsh-2.26-1.x86_64 already installed and latest version
    Package rocks-command-slurm-7.0.0-17.02.07.08.x86_64 already installed and latest version
    Package slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
    Package slurm-devel-17.11.5-1.el7.centos.x86_64 already installed and latest version
    No package slurm-munge available.
    Package slurm-pam_slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
    Package slurm-perlapi-17.11.5-1.el7.centos.x86_64 already installed and latest version
    No package slurm-plugins available.
    Package slurm-rolldoc-7.0.0-17.11.5.x86_64 already installed and latest version
    Package slurm-slurmctld-17.11.5-1.el7.centos.x86_64 already installed and latest version
    Package slurm-slurmd-17.11.5-1.el7.centos.x86_64 already installed and latest version
    Package slurm-slurmdbd-17.11.5-1.el7.centos.x86_64 already installed and latest version
    No package slurm-sql available.
    No package slurm-sql available.
    Package slurm-torque-17.11.5-1.el7.centos.x86_64 already installed and latest version
    Nothing to do
    /bin/mkdir: cannot create directory '/etc/slurm': File exists
    /bin/mkdir: cannot create directory '/var/spool/slurmd': File exists
    /bin/mkdir: cannot create directory '/var/log/slurm': File exists
    /bin/mkdir: cannot create directory '/var/spool/slurm.checkpoint': File exists
    /bin/mkdir: cannot create directory '/etc/slurm': File exists
    FILES += /etc/slurm/slurm.conf
    FILES += /etc/slurm/head.conf
    FILES += /etc/slurm/node.conf
    FILES += /etc/slurm/parts.conf
    FILES += /etc/slurm/topo.conf
    FILES += /etc/slurm/cgroup.conf
    FILES += /etc/slurm/gres.conf.1
    FILES += /etc/slurm/gres.conf.2
    FILES += /etc/slurm/gres.conf.3
    FILES += /etc/slurm/gres.conf.4
    ERROR 1007 (HY000) at line 1: Can't create database 'slurm_acct_db'; database exists
    mkdir: cannot create directory '/var/spool/slurm.state': File exists
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
    Error with request: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurmdbd: DBD_GET_QOS failure: No error
    sacctmgr: error: We need a qos list to translate
    You gave a bad default qos 'normal'. Use 'list qos' to get complete list.
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
    Error with request: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
    Problem getting clusters from database. Contact your admin.
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
    Error with request: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
    sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
    sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
    Problem getting clusters from database. Contact your admin.

    WARNING: The command:

    sacctmgr -i create cluster jcluster

    failed. Please run this command again

    ERROR 1008 (HY000) at line 1: Can't drop database 'test'; database doesn't exist

    Thanks

    Chris

     
    • Werner Saar

      Werner Saar - 2018-03-22

      The command:

      rocks run roll slurm|sh

      should only be run once at the first install time.
      A second run will fail.

      On 03/22/2018 04:59 AM, Christophe Guilbert wrote:

      Hi , I have a lots of error trying to reinstall slurm , see bellow , could you please tell me if its normal ?

      rocks run roll slurm|sh
      Loaded plugins: fastestmirror, langpacks
      Cleaning repos: Rocks-7.0
      Cleaning up everything
      Maybe you want: rm -rf /var/cache/yum, to also free up space taken by orphaned data from disabled or removed repos
      Cleaning up list of fastest mirrors
      Loaded plugins: fastestmirror, langpacks
      Rocks-7.0 | 3.6 kB 00:00:00
      (1/2): Rocks-7.0/primary_db | 5.8 MB 00:00:00
      (2/2): Rocks-7.0/group_gz | 156 kB 00:00:00
      Determining fastest mirrors
      Package hwloc-1.11.2-2.el7.x86_64 already installed and latest version
      Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
      Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
      Package 1:mariadb-server-5.5.56-2.el7.x86_64 already installed and latest version
      Package munge-0.5.13-3.el7.centos.x86_64 already installed and latest version
      Package munge-libs-0.5.13-3.el7.centos.x86_64 already installed and latest version
      Package pdsh-2.26-1.x86_64 already installed and latest version
      Package rocks-command-slurm-7.0.0-17.02.07.08.x86_64 already installed and latest version
      Package slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
      Package slurm-devel-17.11.5-1.el7.centos.x86_64 already installed and latest version
      No package slurm-munge available.
      Package slurm-pam_slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
      Package slurm-perlapi-17.11.5-1.el7.centos.x86_64 already installed and latest version
      No package slurm-plugins available.
      Package slurm-rolldoc-7.0.0-17.11.5.x86_64 already installed and latest version
      Package slurm-slurmctld-17.11.5-1.el7.centos.x86_64 already installed and latest version
      Package slurm-slurmd-17.11.5-1.el7.centos.x86_64 already installed and latest version
      Package slurm-slurmdbd-17.11.5-1.el7.centos.x86_64 already installed and latest version
      No package slurm-sql available.
      No package slurm-sql available.
      Package slurm-torque-17.11.5-1.el7.centos.x86_64 already installed and latest version
      Nothing to do
      /bin/mkdir: cannot create directory '/etc/slurm': File exists
      /bin/mkdir: cannot create directory '/var/spool/slurmd': File exists
      /bin/mkdir: cannot create directory '/var/log/slurm': File exists
      /bin/mkdir: cannot create directory '/var/spool/slurm.checkpoint': File exists
      /bin/mkdir: cannot create directory '/etc/slurm': File exists
      FILES += /etc/slurm/slurm.conf
      FILES += /etc/slurm/head.conf
      FILES += /etc/slurm/node.conf
      FILES += /etc/slurm/parts.conf
      FILES += /etc/slurm/topo.conf
      FILES += /etc/slurm/cgroup.conf
      FILES += /etc/slurm/gres.conf.1
      FILES += /etc/slurm/gres.conf.2
      FILES += /etc/slurm/gres.conf.3
      FILES += /etc/slurm/gres.conf.4
      ERROR 1007 (HY000) at line 1: Can't create database 'slurm_acct_db'; database exists
      mkdir: cannot create directory '/var/spool/slurm.state': File exists
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
      Error with request: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurmdbd: DBD_GET_QOS failure: No error
      sacctmgr: error: We need a qos list to translate
      You gave a bad default qos 'normal'. Use 'list qos' to get complete list.
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
      Error with request: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
      Problem getting clusters from database. Contact your admin.
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
      Error with request: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
      sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
      sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
      Problem getting clusters from database. Contact your admin.

      WARNING: The command:

      sacctmgr -i create cluster jcluster

      failed. Please run this command again

      ERROR 1008 (HY000) at line 1: Can't drop database 'test'; database doesn't exist

      Thanks

      Chris


      A lots of error message when reinstalling slurm roll


      Sent from sourceforge.net because you indicated interest in https://sourceforge.net/p/slurm-roll/discussion/general/

      To unsubscribe from further messages, please visit https://sourceforge.net/auth/subscriptions/

       

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.