|
From: Julian S. <js...@ac...> - 2005-08-08 17:10:36
|
> The machine is relatively new (Intel Zeon hyper threading P4 3.2GHz) > and I have had problems keeping it cool, I do get a lot of messages from > syslogd about passing temperature threshold. Usually if you have overheating problems, it'll kill the system at unpredictable times and you wind up with either a complete hang of the system or a spontaneous reboot. Anyway, are you sure? P4's have pretty sophisticated thermal management and fall back to half-speed or less if the temperature gets too high; if it gets worse they shut down completely. > I need to find out if it is the overheating of the CPU or ?something > else? which causes the failure? > > My question is, what could valgrind be doing that might stop the problem > from occurring? It might be worth trying to cut out some of the variables by trying an identical software setup on a different machine to see if that makes any difference. If you are worried about the hardware, also run memtest86 for a couple hours and see if it picks up any memory problems. It's very good at doing so (imo) and easy to use; for one thing recent SuSE install CDs have it. J |