I'm a long time Big Brother user and I'm trying to convert to Xymon. I've installed it and configured it to monitor about 180 machines. After running well for anywhere from a few hours to a few days, it has a problem that shows these symptoms on the Xymon server:
This all happens pretty quickly. It will be running along fine for hours. Something happens that causes those symptoms to all happen within 10-15 minutes.
This is happening with Xymon 4.3.17. I've tried it on two different serves (both HP ProLiant DL360 G4) and two different versions of Fedora (20 and 18). In both cases, Fedora was the 64-bit version and was fully updated. It was freshly installed solely for the purpose of running Xymon. The fact that it happens on two entirely different hardware and OS platforms seems to rule out everything except Xymon.
It appears that whatever happens blocks all disk write access. If I already have a login session, I can run commands like top, ps, iostat, iotop. I can't do anything that causes any writes, including starting a new login session.
Does anyone have any suggestions or recommendations? I wanted to go with Xymon because of its similarities to Big Brother and so I could continue to use some of the custom scripts I've written over the years.
FYI - After many different attempts to get this to work, I finally rebuilt it using an HP ProLiant DL360 G5 (instead of G4). I haven't had the problem since.