Hi , I have a lots of error trying to reinstall slurm , see bellow , could you please tell me if its normal ?
rocks run roll slurm|sh
Loaded plugins: fastestmirror, langpacks
Cleaning repos: Rocks-7.0
Cleaning up everything
Maybe you want: rm -rf /var/cache/yum, to also free up space taken by orphaned data from disabled or removed repos
Cleaning up list of fastest mirrors
Loaded plugins: fastestmirror, langpacks
Rocks-7.0 | 3.6 kB 00:00:00
(1/2): Rocks-7.0/primary_db | 5.8 MB 00:00:00
(2/2): Rocks-7.0/group_gz | 156 kB 00:00:00
Determining fastest mirrors
Package hwloc-1.11.2-2.el7.x86_64 already installed and latest version
Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
Package 1:mariadb-server-5.5.56-2.el7.x86_64 already installed and latest version
Package munge-0.5.13-3.el7.centos.x86_64 already installed and latest version
Package munge-libs-0.5.13-3.el7.centos.x86_64 already installed and latest version
Package pdsh-2.26-1.x86_64 already installed and latest version
Package rocks-command-slurm-7.0.0-17.02.07.08.x86_64 already installed and latest version
Package slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-devel-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-munge available.
Package slurm-pam_slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-perlapi-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-plugins available.
Package slurm-rolldoc-7.0.0-17.11.5.x86_64 already installed and latest version
Package slurm-slurmctld-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-slurmd-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-slurmdbd-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-sql available.
No package slurm-sql available.
Package slurm-torque-17.11.5-1.el7.centos.x86_64 already installed and latest version
Nothing to do
/bin/mkdir: cannot create directory '/etc/slurm': File exists
/bin/mkdir: cannot create directory '/var/spool/slurmd': File exists
/bin/mkdir: cannot create directory '/var/log/slurm': File exists
/bin/mkdir: cannot create directory '/var/spool/slurm.checkpoint': File exists
/bin/mkdir: cannot create directory '/etc/slurm': File exists
FILES += /etc/slurm/slurm.conf
FILES += /etc/slurm/head.conf
FILES += /etc/slurm/node.conf
FILES += /etc/slurm/parts.conf
FILES += /etc/slurm/topo.conf
FILES += /etc/slurm/cgroup.conf
FILES += /etc/slurm/gres.conf.1
FILES += /etc/slurm/gres.conf.2
FILES += /etc/slurm/gres.conf.3
FILES += /etc/slurm/gres.conf.4
ERROR 1007 (HY000) at line 1: Can't create database 'slurm_acct_db'; database exists
mkdir: cannot create directory '/var/spool/slurm.state': File exists
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_QOS failure: No error
sacctmgr: error: We need a qos list to translate
You gave a bad default qos 'normal'. Use 'list qos' to get complete list.
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
Problem getting clusters from database. Contact your admin.
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
Problem getting clusters from database. Contact your admin.
WARNING: The command:
sacctmgr -i create cluster jcluster
failed. Please run this command again
ERROR 1008 (HY000) at line 1: Can't drop database 'test'; database doesn't exist
Thanks
Chris
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
should only be run once at the first install time.
A second run will fail.
On 03/22/2018 04:59 AM, Christophe Guilbert wrote:
Hi , I have a lots of error trying to reinstall slurm , see bellow , could you please tell me if its normal ?
rocks run roll slurm|sh
Loaded plugins: fastestmirror, langpacks
Cleaning repos: Rocks-7.0
Cleaning up everything
Maybe you want: rm -rf /var/cache/yum, to also free up space taken by orphaned data from disabled or removed repos
Cleaning up list of fastest mirrors
Loaded plugins: fastestmirror, langpacks
Rocks-7.0 | 3.6 kB 00:00:00
(1/2): Rocks-7.0/primary_db | 5.8 MB 00:00:00
(2/2): Rocks-7.0/group_gz | 156 kB 00:00:00
Determining fastest mirrors
Package hwloc-1.11.2-2.el7.x86_64 already installed and latest version
Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
Package 1:mariadb-server-5.5.56-2.el7.x86_64 already installed and latest version
Package munge-0.5.13-3.el7.centos.x86_64 already installed and latest version
Package munge-libs-0.5.13-3.el7.centos.x86_64 already installed and latest version
Package pdsh-2.26-1.x86_64 already installed and latest version
Package rocks-command-slurm-7.0.0-17.02.07.08.x86_64 already installed and latest version
Package slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-devel-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-munge available.
Package slurm-pam_slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-perlapi-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-plugins available.
Package slurm-rolldoc-7.0.0-17.11.5.x86_64 already installed and latest version
Package slurm-slurmctld-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-slurmd-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-slurmdbd-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-sql available.
No package slurm-sql available.
Package slurm-torque-17.11.5-1.el7.centos.x86_64 already installed and latest version
Nothing to do
/bin/mkdir: cannot create directory '/etc/slurm': File exists
/bin/mkdir: cannot create directory '/var/spool/slurmd': File exists
/bin/mkdir: cannot create directory '/var/log/slurm': File exists
/bin/mkdir: cannot create directory '/var/spool/slurm.checkpoint': File exists
/bin/mkdir: cannot create directory '/etc/slurm': File exists
FILES += /etc/slurm/slurm.conf
FILES += /etc/slurm/head.conf
FILES += /etc/slurm/node.conf
FILES += /etc/slurm/parts.conf
FILES += /etc/slurm/topo.conf
FILES += /etc/slurm/cgroup.conf
FILES += /etc/slurm/gres.conf.1
FILES += /etc/slurm/gres.conf.2
FILES += /etc/slurm/gres.conf.3
FILES += /etc/slurm/gres.conf.4
ERROR 1007 (HY000) at line 1: Can't create database 'slurm_acct_db'; database exists
mkdir: cannot create directory '/var/spool/slurm.state': File exists
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_QOS failure: No error
sacctmgr: error: We need a qos list to translate
You gave a bad default qos 'normal'. Use 'list qos' to get complete list.
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
Problem getting clusters from database. Contact your admin.
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
Problem getting clusters from database. Contact your admin.
WARNING: The command:
sacctmgr -i create cluster jcluster
failed. Please run this command again
ERROR 1008 (HY000) at line 1: Can't drop database 'test'; database doesn't exist
Hi , I have a lots of error trying to reinstall slurm , see bellow , could you please tell me if its normal ?
rocks run roll slurm|sh
Loaded plugins: fastestmirror, langpacks
Cleaning repos: Rocks-7.0
Cleaning up everything
Maybe you want: rm -rf /var/cache/yum, to also free up space taken by orphaned data from disabled or removed repos
Cleaning up list of fastest mirrors
Loaded plugins: fastestmirror, langpacks
Rocks-7.0 | 3.6 kB 00:00:00
(1/2): Rocks-7.0/primary_db | 5.8 MB 00:00:00
(2/2): Rocks-7.0/group_gz | 156 kB 00:00:00
Determining fastest mirrors
Package hwloc-1.11.2-2.el7.x86_64 already installed and latest version
Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
Package 1:mariadb-5.5.56-2.el7.x86_64 already installed and latest version
Package 1:mariadb-server-5.5.56-2.el7.x86_64 already installed and latest version
Package munge-0.5.13-3.el7.centos.x86_64 already installed and latest version
Package munge-libs-0.5.13-3.el7.centos.x86_64 already installed and latest version
Package pdsh-2.26-1.x86_64 already installed and latest version
Package rocks-command-slurm-7.0.0-17.02.07.08.x86_64 already installed and latest version
Package slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-devel-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-munge available.
Package slurm-pam_slurm-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-perlapi-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-plugins available.
Package slurm-rolldoc-7.0.0-17.11.5.x86_64 already installed and latest version
Package slurm-slurmctld-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-slurmd-17.11.5-1.el7.centos.x86_64 already installed and latest version
Package slurm-slurmdbd-17.11.5-1.el7.centos.x86_64 already installed and latest version
No package slurm-sql available.
No package slurm-sql available.
Package slurm-torque-17.11.5-1.el7.centos.x86_64 already installed and latest version
Nothing to do
/bin/mkdir: cannot create directory '/etc/slurm': File exists
/bin/mkdir: cannot create directory '/var/spool/slurmd': File exists
/bin/mkdir: cannot create directory '/var/log/slurm': File exists
/bin/mkdir: cannot create directory '/var/spool/slurm.checkpoint': File exists
/bin/mkdir: cannot create directory '/etc/slurm': File exists
FILES += /etc/slurm/slurm.conf
FILES += /etc/slurm/head.conf
FILES += /etc/slurm/node.conf
FILES += /etc/slurm/parts.conf
FILES += /etc/slurm/topo.conf
FILES += /etc/slurm/cgroup.conf
FILES += /etc/slurm/gres.conf.1
FILES += /etc/slurm/gres.conf.2
FILES += /etc/slurm/gres.conf.3
FILES += /etc/slurm/gres.conf.4
ERROR 1007 (HY000) at line 1: Can't create database 'slurm_acct_db'; database exists
mkdir: cannot create directory '/var/spool/slurm.state': File exists
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_QOS failure: No error
sacctmgr: error: We need a qos list to translate
You gave a bad default qos 'normal'. Use 'list qos' to get complete list.
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
Problem getting clusters from database. Contact your admin.
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_MODIFY_QOS failure: No error
Error with request: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to jcluster:6819: Unable to connect to database
sacctmgr: error: slurmdbd: Sending PersistInit msg: No error
sacctmgr: error: slurmdbd: DBD_GET_CLUSTERS failure: No error
Problem getting clusters from database. Contact your admin.
WARNING: The command:
sacctmgr -i create cluster jcluster
failed. Please run this command again
ERROR 1008 (HY000) at line 1: Can't drop database 'test'; database doesn't exist
Thanks
Chris
The command:
rocks run roll slurm|sh
should only be run once at the first install time.
A second run will fail.
On 03/22/2018 04:59 AM, Christophe Guilbert wrote: