Menu

Tree [4fdf40] master /
 History

HTTPS access


File Date Author Commit
 debian 2020-04-24 Michael Meskes Michael Meskes [8d6489] Pepare for upload new release.
 examples 2013-09-10 Michael Meskes Michael Meskes [a1c866] Change error codes to be compatible with old ne...
 include 2019-07-10 Paul Crawford Paul Crawford [03d67d] Separate error-timer for allocatable memory
 redhat 2017-07-31 Paul Crawford Paul Crawford [d7f483] Fix automated CentOS 7 build
 src 2021-10-14 Paul Crawford Paul Crawford [4fdf40] Minor tidy
 suse 2009-03-13 Michael Meskes Michael Meskes [62aec6] Some resturcturing of distribution files.
 .gitignore 2019-02-27 Paul Crawford Paul Crawford [7e1479] Ignore coverity tool's directory
 AUTHORS 2009-03-12 Michael Meskes Michael Meskes [21cba2] Imported version 5.2.5
 COPYING 2020-04-24 Michael Meskes Michael Meskes [be8aec] Updated copyright notice
 ChangeLog 2011-12-15 Michael Meskes Michael Meskes [6ba2c1] Added reference to git log which replaced our C...
 IAFA-PACKAGE 2020-04-24 Michael Meskes Michael Meskes [8d6489] Pepare for upload new release.
 INSTALL 2009-03-12 Michael Meskes Michael Meskes [1c2686] Imported version 5.3
 Makefile.am 2015-11-30 Paul Crawford Paul Crawford [da8f1e] Apply patch for '#7 Fix out of tree build suppo...
 NEWS 2009-03-12 Michael Meskes Michael Meskes [4c3b39] Imported version 4.2
 README 2009-03-12 Michael Meskes Michael Meskes [1e99dd] Imported version 4.3
 README.watchdog.ipmi 2009-03-15 Michael Meskes Michael Meskes [1e71bd] Added two READMEs provided by Fedora/RedHat pac...
 TODO 2019-03-06 Paul Crawford Paul Crawford [b03517] Update TODO list
 configure.ac 2020-01-03 Michael Meskes Michael Meskes [c3e11c] It's time to release a new version.
 create_tarball.sh 2014-04-26 Michael Meskes Michael Meskes [95713b] Some changes for newer autotools
 watchdog.8 2019-02-27 Paul Crawford Paul Crawford [623896] Update man pages
 watchdog.conf 2019-07-09 Paul Crawford Paul Crawford [76b509] Add separate test of swap use
 watchdog.conf.5 2019-07-09 Paul Crawford Paul Crawford [76b509] Add separate test of swap use
 watchdog.lsm 2020-04-24 Michael Meskes Michael Meskes [8d6489] Pepare for upload new release.
 watchdog.sysconfig 2009-03-12 Michael Meskes Michael Meskes [78e101] Imported version 5.2
 wd_identify.8 2015-12-07 Paul Crawford Paul Crawford [265264] Updated wd_identify man page to correct minor e...
 wd_keepalive.8 2015-06-05 Paul Crawford Paul Crawford [a9c5ec] Added the --loop-exit command line option to wd...

Read Me

Watchdog is a daemon that checks if your system is still working. If
programs in user space are not longer executed it will reboot the system.
However, this will not always work.

From the kernel:

>	Watchdog Timer Interfaces For The Linux Operating System
>
>		Alan Cox <alan@lxorguk.ukuu.org.uk>
>
>	    Custom Linux Driver And Program Development
>
>[...]
>
>All four interfaces provide /dev/watchdog, which when open must be written
>to within a minute or the machine will reboot. Each write delays the reboot
>time another minute. In the case of the software watchdog the ability to 
>reboot will depend on the state of the machines and interrupts. The hardware
>boards physically pull the machine down off their own onboard timers and
>will reboot from almost anything.

This tool proved very useful for me, because I always run the latest kernel
and the latest libc release, but rely on the machine to be up and running
for email.

Note, that you have to enable the software watchdog driver in your kernel
for the program to be able to hard reset the system unless you have a
hardware watchdog of course. Make sure you don't compile watchdog with a
different timer margin than the kernel driver.

As of version 4.0 watchdog is able to make itself a real-time application.
It will lock all its pages in memory and set itīs scheduler to round-robin.
This will take up to 900 pages of memory but guarantees you that watchdog
will get its share of processor time. If you ever experienced a hard reset
just because watchdog wasnīt scheduled for a minute, you will probably have
no problem with the 900 pages which is not so much anyway.

Of course this is not needed for machines running on low load.

Don't be surprised if you see any zombie processes laying around. This is
normal. They will be removed the next time watchdog wakes up. Of course a
new zombie is created then, too. This zombie process is a result of the
process table check.

Michael
meskes@debian.org