Redirecting to /bin/systemctl status slurmdbd.service
● slurmdbd.service - Slurm DBD accounting daemon
Loaded: loaded (/usr/lib/systemd/system/slurmdbd.service; enabled;
vendor preset: disabled)
Active: active (running) since Sun 2020-05-31 15:47:18 CST; 1h 8min ago
Process: 17825 ExecStart=/usr/sbin/slurmdbd $SLURMDBD_OPTIONS
(code=exited, status=0/SUCCESS)
Main PID: 17828 (slurmdbd)
Tasks: 1
Memory: 868.0K
CGroup: /system.slice/slurmdbd.service
└─17828 /usr/sbin/slurmdbd
May 31 15:47:18 cluster-gpu.ysu.edu.cn systemd[1]: Starting Slurm DBD
accounting daemon...
May 31 15:47:18 cluster-gpu.ysu.edu.cn systemd[1]: Can't open PID file
/var/run/slurmdbd.pid (yet?) after start: No such file or directory
May 31 15:47:18 cluster-gpu.ysu.edu.cn systemd[1]: Started Slurm DBD
accounting daemon.
And the output of "netstat -antp | grep slurm" is:
The slurmdbd is not found. So after run "sacct -j 1755". It return:
sacct: error: slurm_persist_conn_open_without_init: failed to open
persistent connection to cluster-gpu:6819: Connection refused
sacct: error: slurmdbd: Sending PersistInit msg: Connection refused
sacct: error: Problem talking to the database: Connection refused
The output of "service slurmdbd status" is:
Redirecting to /bin/systemctl status slurmdbd.service
● slurmdbd.service - Slurm DBD accounting daemon
Loaded: loaded (/usr/lib/systemd/system/slurmdbd.service; enabled;
vendor preset: disabled)
Active: active (running) since Sun 2020-05-31 15:47:18 CST; 1h 8min ago
Process: 17825 ExecStart=/usr/sbin/slurmdbd $SLURMDBD_OPTIONS
(code=exited, status=0/SUCCESS)
Main PID: 17828 (slurmdbd)
Tasks: 1
Memory: 868.0K
CGroup: /system.slice/slurmdbd.service
└─17828 /usr/sbin/slurmdbd
May 31 15:47:18 cluster-gpu.ysu.edu.cn systemd[1]: Starting Slurm DBD
accounting daemon...
May 31 15:47:18 cluster-gpu.ysu.edu.cn systemd[1]: Can't open PID file
/var/run/slurmdbd.pid (yet?) after start: No such file or directory
May 31 15:47:18 cluster-gpu.ysu.edu.cn systemd[1]: Started Slurm DBD
accounting daemon.
And the output of "netstat -antp | grep slurm" is:
tcp 0 0 0.0.0.0:6817 0.0.0.0: LISTEN
17776/slurmctld
tcp 0 0 0.0.0.0:6818 0.0.0.0: LISTEN
17747/slurmd
The slurmdbd is not found. So after run "sacct -j 1755". It return:
sacct: error: slurm_persist_conn_open_without_init: failed to open
persistent connection to cluster-gpu:6819: Connection refused
sacct: error: slurmdbd: Sending PersistInit msg: Connection refused
sacct: error: Problem talking to the database: Connection refused
The log of slurmdbd is empty,.Is something wrong?
It has been sloved.
My MariaDB is not running. No database listen on port 3306.
After restart the MariaDB service, slurmdbd is runing on port 6819