From: Brad P. <b_p...@ya...> - 2007-01-31 19:05:43
|
Aaah, I've finally figured it out. The very common Linksys WRT54G v5 route= r IS dropping inactive sockets after exactly 10 minutes. =0A=0AI verified t= his through a process of elimination. Any time the Linksys router was used= , I'd get a socket drop at 10 minutes (wireless or ethernet cable included)= . But when I bypassed the Linksys router and kept everything else the same= , it all worked.=0A=0AI think that makes any Bacula job longer than 10 minu= tes impossible using this Linksys router. Looks like I'm out of luck. I h= ave updated to the newest firmware, and the Linksys config doesn't have any= ability to modify the timeout value. I suppose I could buy a new router,= or set up a new offsite backup storage daemon. Unless anyone else has an= y brilliant ideas :)=0A=0AKern, if you are reading this, what are the chan= ces that a heartbeat could be implemented between the director and the stor= age daemon?=0A=0ABrad Peterson=0Ab...@ya...=0A=0A =0A----- Origi= nal Message ----=0AFrom: Brad Peterson <b_p...@ya...>=0ATo: bacula-= us...@li...=0ASent: Tuesday, January 30, 2007 3:00:03 PM=0A= Subject: [Bacula-users] Director losing socket with SD=0A=0AFirst, my reque= st: Is there anything in Bacula I can do to keep the socket between the di= rector and the storage daemon alive?=0A=0ANow, my explanation why I need th= is. As I'm trying to narrow down why my lengthy backups to an offsite stor= age daemon don't work, I sat and watched the debug output for the director,= the storage daemon, and the file daemon. About almost exactly 10 minutes = in, the director's debug output said this:=0A=0Amsgchan.c:333 =3D=3D=3D End= msg_thread. use=3D2=0A=0AAfter a lot of research, it appears what's going = on is this:=0A=0A1) The director starts the job.=0A2) A socket is opened be= tween the director and the storage daemon.=0A3) A socket is opened between = the file daemon and the storage daemon.=0A4) The file data transfers just f= ine over the file daemon/storage daemon socket. =0A5) At almost exactly 10= minutes in, I get the above debug message which means the socket between t= he director and storage daemon has been closed. netstat confirmed this.=0A= 6) The file data continues to transfer just fine to the storage daemon.=0A7= ) When the file daemon is done, it tells the director that it is finished.= =0A8) The storage daemon tries to tell the director it received the data pe= rfectly, but cannot, because it cannot communicate with the director anymor= e (which makes sense, because the socket died).=0A9) *I think* the director= waits a bit for the the storage daemon, or it just knows it can't receive = info from the storage daemon. In any event, the director quickly marks the= job as having an error because it never heard from the storage daemon as t= o its final result.=0A=0ASo, what can I do in Bacula to keep the socket bet= ween the director and the storage daemon alive?=0A=0AI'v already set a hear= tbeats of 30 seconds, but according to the manual, the heartbeats help the = file daemon talk to the director, the file daemon talk to the storage daemo= n, and the storage daemon talk to the file daemon. But in my situation, I'= m losing a socket between the director and the storage daemon, and the hear= tbeat doesn't help out with that.=0A=0AI'm also starting to think a Linksys= router may be the reason why it loses the inactive socket after almost exa= ctly 10 minutes, as other Linksys users have found this happens to them. I= 'll be able to test this out tonight by bypassing the router completely. = =0A=0AAnyways, in the meantime, anybody know how I can keep that socket ali= ve?=0A=0ABrad Peterson=0Ab...@ya...=0A=0A=0A=0A=0A=0A=0A=0A =0A__= ___________________________________________________________________________= _______=0ADon't pick lemons.=0ASee all the new 2007 cars at Yahoo! Autos.= =0Ahttp://autos.yahoo.com/new_cars.html =0A=0A-----------------------------= --------------------------------------------=0ATake Surveys. Earn Cash. Inf= luence the Future of IT=0AJoin SourceForge.net's Techsay panel and you'll g= et the chance to share your=0Aopinions on IT & business topics through brie= f surveys - and earn cash=0Ahttp://www.techsay.com/default.php?page=3Djoin.= php&p=3Dsourceforge&CID=3DDEVDEV=0A________________________________________= _______=0ABacula-users mailing list=0AB...@li...=0A= https://lists.sourceforge.net/lists/listinfo/bacula-users=0A=0A=0A=0A=0A=0A= =0A_______________________________________________________________________= _____________=0ADo you Yahoo!?=0AEveryone is raving about the all-new Yahoo= ! Mail beta.=0Ahttp://new.mail.yahoo.com |