From: Volker S. <vo...@vo...> - 2005-07-15 08:59:35
|
On Fr, 15 Jul 2005, Arno Lehmann wrote: > >I'll upgrade to 1.36.3 and see what happens. Maybe "Fix deadlock in > >multiple simultaneous jobs." (from ReleaseNotes) could be the right one. > >I already setup this site with 1.36.3 FileFormat because I knew it's > >going to be required! > I had the same problem of a locking DIR, which worked ok after a=20 > restart, and I could never find a reason (partly because I never=20 > investigated with gdb, but that's beyond my skills and as long as I=20 > could restart my backups rather easily that was ok). > With 1.36.3 this problem vanished. > Until yesterday. Yes, the same with me. I upgraded to 1.36.3 and the problem occured again, yesterday. Now I setup "trace on" and "setdebug 100" for dir and sd and I'm waiting for the problem to occur again! [...] > Conclusion 1: > SD problems _can_ hang the DIR. This confirms what Volker found.=20 > Probably there is some timeout in the DIR, but in such a case this=20 > timeout is either too long (my opinion) or shouldn't block console=20 > connections (and probably the rest of the DIR working). yes, that for sure! >=20 > Conclusion 2: > mtx is crap. It shouldn't segfault. I'll look at Peters replacement,=20 > although I couldn't get it to work before... I also heard about mover... I didn't have any problems with mtx so far.... >=20 > Conclusion 3: > Don't use decades-old hardware in a backup system ;-) The only device connected to the scsi bus is the Overland Loader. Actually rather new! >=20 > Conclusion 4: > When the SD hangs, check your system logs for hardware errors.=20 > Carefully. Take them serious (Had I done this yesterday morning would=20 > have been much more fun). I haven't seen hardware errors in the syslog so far, but I'll take attention as soon as the problem occurs again! --=20 Volker Sauer * Alexanderstrasse 39/217 * 64283 Darmstadt Telefon: 06151-154260 * Mobil: 0179-6901475 * ICQ#98164307 mailto:vo...@vo... * http://www.volker-sauer.de PGPKey-Fingerprint: DB2611C7B12E0B2739992E4F7E354E4D5DD5D0E0 |