From: Kern S. <ke...@si...> - 2010-01-27 16:36:55
|
Hello Georg, Nice to hear from you, sorry it is because of Bacula problems. I've copied the bacula-devel list because they are generally much better at answering these kinds of problems than I am. On Tuesday 26 January 2010 21:39:48 Georg C. F. Greve wrote: > Dear Kern, > > Apologies for droppping this on you directly. I hope you'll forgive me and > give me a hint on a problem with Bacula that seriously confuses me. > > I have a hosted machine (nine) that is connected to my server (fusebox) at > home through an OpenVPN tunnel and is backed up every couple of hours. > > This used to fine until recently I updated one of the components. What did you update? > > Now the bacula director correctly triggers the update. What does that mean? The Director doesn't trigger any updates that I am aware of -- it does start Jobs. > > The client correctly connects to the storage daemon, and the storage daemon > correctly receives the data. In fact both the client & the storage daemon > are convinced the backup succeeded. > > Unfortunately the director believes it failed, and the next time it > triggers a backup of too high a level - resulting in a constantly failed > state for the backup and substantially duplicated data traffic on a server > that has a maximum of data per month... :-/ > > So what to do when Bacula becomes schizophrenic? ;) I notice that you are running a 2.4.4 FD with a 3.0.2 Director/SD. Perhaps there is some compatibility issue there. The first thing I would look at is upgrading the FD. The second thing that I see is that the Dir complains that the comm connection with the FD was dropped so the Dir never received a good status. This is the reason the Director is marking the job as failed. The dropped comm connection could be either from a mismatched Dir/FD or perhaps a comm line or VPN problem. If you start by upgrading the FD, and the problem persists, I would look at possible problems with the VPN -- in particular, perhaps it is timing out, and setting a HeartBeat of 5 mins in both the Dir and FD conf files could fix the problem if it is a network timeout. Hope that helps, Kern > > Best regards, > Georg > > > > > 26-Jan 18:45 fusebox-dir JobId 5500: Start Backup JobId 5500, > Job=Nine.2010-01-26_18.45.52_04 > 26-Jan 18:46 fusebox-dir JobId 5500: Created new Volume "2010-1-26.5500" in > catalog. > 26-Jan 18:46 fusebox-dir JobId 5500: Using Device "FileStorage" > 26-Jan 18:46 fusebox-sd JobId 5500: Warning: dev.c:534 dev.c:532 Could not > open: /media/backup/bacula/2010-1-26.5500, ERR=No such file or directory > 26-Jan 18:46 fusebox-sd JobId 5500: Warning: dev.c:534 dev.c:532 Could not > open: /media/backup/bacula/2010-1-26.5500, ERR=No such file or directory > 26-Jan 18:46 fusebox-sd JobId 5500: Labeled new Volume "2010-1-26.5500" on > device "FileStorage" (/media/backup/bacula). > 26-Jan 18:46 fusebox-sd JobId 5500: Wrote label to prelabeled Volume > "2010-1-26.5500" on device "FileStorage" (/media/backup/bacula) > 26-Jan 18:46 fusebox-dir JobId 5500: Max Volume jobs exceeded. Marking > Volume "2010-1-26.5500" as Used. > 26-Jan 20:12 fusebox-sd JobId 5500: Job write elapsed time = 01:26:32, > Transfer rate = 657.5 K bytes/second > 26-Jan 20:46 fusebox-dir JobId 5500: Fatal error: Network error with FD > during Backup: ERR=Connection reset by peer > 26-Jan 20:46 fusebox-dir JobId 5500: Fatal error: No Job status returned > from FD. > 26-Jan 20:46 fusebox-dir JobId 5500: Error: Bacula fusebox-dir 3.0.2 > (18Jul09): 26-Jan-2010 20:46:01 > Build OS: i486-pc-linux-gnu debian squeeze/sid > JobId: 5500 > Job: Nine.2010-01-26_18.45.52_04 > Backup Level: Full > Client: "nine-fd" 2.4.4 (28Dec08) i486-pc-linux- > gnu,debian,5.0 > FileSet: "FullSet" 2008-02-03 12:14:03 > Pool: "File" (From Job resource) > Catalog: "MainCatalog" (From Client resource) > Storage: "File" (From Job resource) > Scheduled time: 26-Jan-2010 18:45:41 > Start time: 26-Jan-2010 18:46:00 > End time: 26-Jan-2010 20:46:01 > Elapsed time: 2 hours 1 sec > Priority: 15 > FD Files Written: 0 > SD Files Written: 150,723 > FD Bytes Written: 0 (0 B) > SD Bytes Written: 3,414,186,221 (3.414 GB) > Rate: 0.0 KB/s > Software Compression: None > VSS: no > Encryption: no > Accurate: no > Volume name(s): 2010-1-26.5500 > Volume Session Id: 1 > Volume Session Time: 1264527721 > Last Volume Bytes: 3,422,080,423 (3.422 GB) > Non-fatal FD errors: 0 > SD Errors: 0 > FD termination status: Error > SD termination status: OK > Termination: *** Backup Error *** > > > Storage Director: > > Connecting to Storage daemon File at fusebox.lair:9103 > > fusebox-sd Version: 3.0.2 (18 July 2009) i486-pc-linux-gnu debian > squeeze/sid Daemon started 26-Jan-10 18:42, 1 Job run since started. > Heap: heap=348,160 smbytes=88,955 max_bytes=156,526 bufs=93 max_bufs=110 > Sizes: boffset_t=8 size_t=4 int32_t=4 int64_t=8 > > Running Jobs: > No Jobs running. > ==== > > Jobs waiting to reserve a drive: > ==== > > JobId Level Files Bytes Status Finished Name > =================================================================== > 5490 Incr 0 0 Cancel 26-Jan-10 03:40 Katana > 5491 Incr 0 0 Cancel 26-Jan-10 03:43 misanthrope > 5493 Incr 0 0 Cancel 26-Jan-10 03:43 Reason > 5494 Incr 99 1.054 M OK 26-Jan-10 03:44 Tmfkah > 5495 Incr 184 627.4 M OK 26-Jan-10 03:54 Fusebox > 5496 Full 1 287.1 M OK 26-Jan-10 05:14 BackupCatalog > 5497 Incr 34,567 741.5 M OK 26-Jan-10 09:18 Nine > 5498 Incr 34,708 742.1 M OK 26-Jan-10 12:18 Nine > 5499 Incr 34,830 742.5 M OK 26-Jan-10 15:18 Nine > 5500 Full 150,723 3.414 G OK 26-Jan-10 20:12 Nine > > > File Director: > > nine-fd Version: 2.4.4 (28 December 2008) i486-pc-linux-gnu debian 5.0 > Daemon started 26-Jan-10 18:43, 1 Job run since started. > Heap: heap=675,840 smbytes=84,853 max_bytes=584,317 bufs=81 max_bufs=1,499 > Sizeof: boffset_t=8 size_t=4 debug=0 trace=0 > > Running Jobs: > Director connected at: 26-Jan-10 21:31 > No Jobs running. > ==== > > Terminated Jobs: > JobId Level Files Bytes Status Finished Name > ====================================================================== > 5484 Incr 33,623 730.5 M OK 25-Jan-10 09:18 Nine > 5485 Incr 33,713 731.1 M OK 25-Jan-10 12:18 Nine > 5486 Incr 33,739 732.5 M OK 25-Jan-10 15:18 Nine > 5487 Incr 33,833 733.2 M OK 25-Jan-10 18:18 Nine > 5488 Incr 34,252 734.8 M OK 25-Jan-10 21:18 Nine > 5489 Incr 34,389 735.6 M OK 26-Jan-10 01:58 Nine > 5497 Incr 34,567 736.3 M OK 26-Jan-10 09:18 Nine > 5498 Incr 34,708 737.0 M OK 26-Jan-10 12:18 Nine > 5499 Incr 34,830 737.4 M OK 26-Jan-10 15:18 Nine > 5500 Full 150,723 3.392 G OK 26-Jan-10 20:12 Nine > ==== |