From: <bac...@li...> - 2005-10-25 13:43:14
|
The following bug has been CLOSED ====================================================================== http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000450 ====================================================================== Reported By: Lomat Assigned To: ====================================================================== Project: bacula Bug ID: 450 Category: Director Reproducibility: always Severity: crash Priority: normal Status: closed ====================================================================== Date Submitted: 10-19-2005 01:58 PDT Last Modified: 10-25-2005 06:42 PDT ====================================================================== Summary: Fatal error: bnet.c:209 Packet size too big from Director daemon Description: Director crashes when bconsole tries to connect to it. I installed bacula via the FreeBSD ports system (server and client ports) and selected the MySQL database option. ====================================================================== ---------------------------------------------------------------------- Lomat - 10-20-2005 00:23 PDT ---------------------------------------------------------------------- I though that I would leave the director running and see if it still went ahead to do a backup. It did try :) But i'm also getting a crash when the director is trying to authenticate with the storage daemon... 20-Oct 01:05 lynott-dir: Created new FileSet record "lynott" 2005-10-20 01:05:02 20-Oct 01:05 lynott-dir: No prior Full backup Job record found. 20-Oct 01:05 lynott-dir: No prior or suitable Full backup found. Doing FULL backup. 20-Oct 01:05 lynott-dir: Start Backup JobId 1, Job=lynott.2005-10-20_01.05.00 20-Oct 01:05 lynott-dir: lynott.2005-10-20_01.05.00 Fatal error: authenticate.c:92 Unable to authenticate wit h Storage daemon. Possible causes: Passwords or names not the same or Maximum Concurrent Jobs exceeded on the SD or SD networking messed up (restart daemon). Please see http://www.bacula.org/html-manual/faq.html#AuthorizationErrors for help. 20-Oct 01:05 lynott-dir: lynott.2005-10-20_01.05.00 Fatal error: bnet.c:209 Packet size too big from "Storage daemon:lynott.as6911.net:9103. Terminating connection. 20-Oct 01:05 lynott-dir: lynott.2005-10-20_01.05.00 Error: Bacula 1.36.3 (22Apr05): 20-Oct-2005 01:05:07 JobId: 1 Job: lynott.2005-10-20_01.05.00 Backup Level: Full (upgraded from Incremental) Client: lynott-fd FileSet: "lynott" 2005-10-20 01:05:02 Pool: "lynott.full" Storage: "lynott-sd" Start time: 20-Oct-2005 01:05:02 End time: 20-Oct-2005 01:05:07 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 SD Bytes Written: 0 Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 0 Volume Session Time: 0 Last Volume Bytes: 0 Non-fatal FD errors: 0 SD Errors: 0 FD termination status: SD termination status: Termination: *** Backup Error *** ---------------------------------------------------------------------- Lomat - 10-20-2005 05:58 PDT ---------------------------------------------------------------------- Updated system to FreeBSD 5.4-STABLE. No change. ---------------------------------------------------------------------- Lomat - 10-20-2005 07:15 PDT ---------------------------------------------------------------------- Just downloaded the release and configured by hand using more or less the same configure options as the freeBSD port... Same issue, keeps crashing with packet size too big message. Removed the --with-tcp-wrappers configure directive and everything now seems to work without issue. My current configure command is... ./configure \ --enable-smartalloc \ --with-working-dir=/var/db/bacula \ --with-scriptdir=/usr/local/share/bacula \ --with-fd-user=root \ --with-fd-group=wheel \ --with-dir-user=bacula \ --with-dir-group=bacula \ --with-sd-user=bacula \ --with-sd-group=operator \ --with-readline=/usr/include/readline \ --disable-conio \ --with-mysql=yes \ --prefix=/usr/local Below is the contents on the config.out file... Configuration on Thu Oct 20 15:02:58 BST 2005: Host: i386-unknown-freebsd5.4 -- freebsd 5.4-STABLE Bacula version: 1.36.3 (22 April 2005) Source code location: . Install binaries: /usr/local/sbin Install config files: /usr/local/etc Scripts directory: /usr/local/share/bacula Working directory: /var/db/bacula PID directory: /var/run Subsys directory: /var/run/subsys C Compiler: gcc 3.4.2 C++ Compiler: g++ 3.4.2 Compiler flags: -g -O2 -Wall Linker flags: -O Libraries: -lpthread -lxpg4 Statically Linked Tools: no Statically Linked FD: no Statically Linked SD: no Statically Linked DIR: no Statically Linked CONS: no Database type: MySQL Database lib: -L/usr/local/lib/mysql -lmysqlclient_r -lz Job Output Email: root@localhost Traceback Email: root@localhost SMTP Host Address: localhost Director Port: 9101 File daemon Port: 9102 Storage daemon Port: 9103 Director User: bacula Director Group: bacula Storage Daemon User: bacula Storage DaemonGroup: operator File Daemon User: root File Daemon Group: wheel SQL binaries Directory /usr/local/bin Large file support: yes Bacula conio support: no -lreadline -ltermcap readline support: yes TCP Wrappers support: no ZLIB support: yes enable-smartalloc: yes enable-gnome: no enable-wx-console: no enable-tray-monitor: client-only: no ACL support: no ---------------------------------------------------------------------- kern - 10-25-2005 00:28 PDT ---------------------------------------------------------------------- Most likely you have some other program such as an HP Printer that is using port 9101, and this is causing a problem. You might be able to get more information by doing a netstat before and after starting bacula. Also, you will need to run run the the director and bconsole with -d100. If the Director is dying, you will need to run it under GDB to get a traceback -- see the Kaboom chapter of the manual. ---------------------------------------------------------------------- Lomat - 10-25-2005 01:46 PDT ---------------------------------------------------------------------- Recompiled with the --with-tcp-wrappers option to get back crashing version and started only the director. Hope the following helps... $ sockstat -l USER COMMAND PID FD PROTO LOCAL ADDRESS FOREIGN ADDRESS mysql mysqld 481 3 stream /tmp/mysql.sock root sendmail 407 4 tcp4 127.0.0.1:25 *:* root sshd 401 3 tcp6 *:22 *:* root sshd 401 4 tcp4 *:22 *:* root syslogd 274 3 dgram /var/run/log root syslogd 274 4 dgram /var/run/logpriv root devd 256 4 stream /var/run/devd.pipe $ ./bacula-dir -u root -g wheel -v -c /usr/local/etc/bacula-dir.conf -d 100 bacula-dir: dird.c:131 Debug level = 100 bacula-dir: mysql.c:81 db_open first time bacula-dir: mysql.c:141 mysql_init done bacula-dir: mysql.c:161 mysql_real_connect done bacula-dir: mysql.c:163 db_user=bacula db_name=bacula_as6911 db_password=***** lynott# lynott-dir: jcr.c:82 read_last_jobs seek to 188 lynott-dir: jcr.c:89 Read num_items=1 lynott-dir: bnet_server.c:88 Addresses host[ipv4:0.0.0.0:9101] $ sockstat -l USER COMMAND PID FD PROTO LOCAL ADDRESS FOREIGN ADDRESS root bacula-dir 10415 4 tcp4 *:9101 *:* mysql mysqld 481 3 stream /tmp/mysql.sock root sendmail 407 4 tcp4 127.0.0.1:25 *:* root sshd 401 3 tcp6 *:22 *:* root sshd 401 4 tcp4 *:22 *:* root syslogd 274 3 dgram /var/run/log root syslogd 274 4 dgram /var/run/logpriv root devd 256 4 stream /var/run/devd.pipe $ gdb -quiet -batch -x /usr/local/share/bacula/btraceback.gdb ./bacula-dir 10415 > /mnt/data1/dir.debug.log $ ./bconsole -c /usr/local/etc/bconsole.conf -d 100 Connecting to Director lynott.as6911.net:9101 bconsole: bnet.c:698 Current host[ipv4:192.168.1.198:9101] All host[ipv4:192.168.1.198:9101] bconsole: bnet.c:952 who=Director daemon host=lynott.as6911.net port=9101 bconsole: btimers.c:169 Start bsock timer 0x80751d8 tid=0x8072000 for 300 secs at 1130229459 25-Oct 09:37 bconsole: Fatal error: bnet.c:209 Packet size too big from "Director daemon:lynott.as6911.net:9101. Terminating connection. bconsole: btimers.c:183 Stop bsock timer 0x80751d8 tid=0x8072000 at 1130229464. Director authorization problem. Most likely the passwords do not agree. Please see http://www.bacula.org/html-manual/faq.html#AuthorizationErrors for help. ERR= $ sockstat -l USER COMMAND PID FD PROTO LOCAL ADDRESS FOREIGN ADDRESS mysql mysqld 481 3 stream /tmp/mysql.sock root sendmail 407 4 tcp4 127.0.0.1:25 *:* root sshd 401 3 tcp6 *:22 *:* root sshd 401 4 tcp4 *:22 *:* root syslogd 274 3 dgram /var/run/log root syslogd 274 4 dgram /var/run/logpriv root devd 256 4 stream /var/run/devd.pipe $ cat /mnt/data1/dir.debug.log [Switching to LWP 100133] 0x28154353 in pthread_testcancel () from /usr/lib/libpthread.so.1 $1 = "lynott-dir", '\0' <repeats 19 times> $2 = 0x80c1018 "bacula-dir" $3 = 0x80b6618 "/root/bacula-1.36.3/src/dird" $4 = "MySQL" $5 = 0x80a3fbc "1.36.3 (22 April 2005)" $6 = 0x809ccff "i386-unknown-freebsd5.4" $7 = 0x809ccf7 "freebsd" $8 = 0x809ccec "5.4-STABLE" http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000000 0x28154353 in pthread_testcancel () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000001 0x2814cc05 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000002 0x28150be0 in pthread_setconcurrency () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000003 0x2814c195 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000004 0x2815227d in _pthread_cond_timedwait () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000005 0x281524b9 in pthread_cond_timedwait () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000006 0x0808f0eb in watchdog_thread (arg=0x0) at watchdog.c:289 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000007 0x28140b09 in pthread_create () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000008 0x282f384b in _ctx_start () from /lib/libc.so.5 Thread 5 (LWP 100133): http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000000 0x28154353 in pthread_testcancel () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000001 0x2814cc05 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000002 0x28150be0 in pthread_setconcurrency () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000003 0x2814c195 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000004 0x2815227d in _pthread_cond_timedwait () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000005 0x281524b9 in pthread_cond_timedwait () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000006 0x0808f0eb in watchdog_thread (arg=0x0) at watchdog.c:289 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000007 0x28140b09 in pthread_create () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000008 0x282f384b in _ctx_start () from /lib/libc.so.5 Thread 4 (Thread 4 (sleeping)): http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000000 0x2814c233 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000001 0x28147405 in _nanosleep () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000002 0x281474fe in nanosleep () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000003 0x08079534 in bmicrosleep (sec=60, usec=0) at bsys.c:59 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000004 0x0805d9e8 in wait_for_next_job (one_shot_job_to_run=0x0) at scheduler.c:101 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000005 0x0804c12e in main (argc=0, argv=0x809189d) at dird.c:244 Thread 3 (Thread 3 (LWP 100134)): http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000000 0x28154353 in pthread_testcancel () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000001 0x2814c92e in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000002 0x00000000 in ?? () Thread 2 (Thread 2 (runnable)): http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000000 0x282f3848 in _ctx_start () from /lib/libc.so.5 Thread 1 (Thread 1 (sleeping)): http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000000 0x2814c233 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000001 0x2814c195 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000002 0x2815227d in _pthread_cond_timedwait () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000003 0x281524b9 in pthread_cond_timedwait () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000004 0x0808f0eb in watchdog_thread (arg=0x0) at watchdog.c:289 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000005 0x28140b09 in pthread_create () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000006 0x282f384b in _ctx_start () from /lib/libc.so.5 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000000 0x28154353 in pthread_testcancel () from /usr/lib/libpthread.so.1 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000000 0x28154353 in pthread_testcancel () from /usr/lib/libpthread.so.1 No symbol table info available. http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000001 0x2814cc05 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 No symbol table info available. http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000002 0x28150be0 in pthread_setconcurrency () from /usr/lib/libpthread.so.1 No symbol table info available. http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000003 0x2814c195 in pthread_mutexattr_init () from /usr/lib/libpthread.so.1 No symbol table info available. http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000004 0x2815227d in _pthread_cond_timedwait () from /usr/lib/libpthread.so.1 No symbol table info available. http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000005 0x281524b9 in pthread_cond_timedwait () from /usr/lib/libpthread.so.1 No symbol table info available. http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000006 0x0808f0eb in watchdog_thread (arg=0x0) at watchdog.c:289 289 pthread_cond_timedwait(&timer, &timer_mutex, &timeout); Current language: auto; currently c++ errstat = 383 p = (watchdog_t *) 0x0 timeout = {tv_sec = 1130229381, tv_nsec = 343963000} tv = {tv_sec = 1130229351, tv_usec = 343963} tz = {tz_minuteswest = 0, tz_dsttime = 0} next_time = 1130229381 http://bugs.bacula.org/bug_view_advanced_page.php?bug_id=0000007 0x28140b09 in pthread_create () from /usr/lib/libpthread.so.1 No symbol table info available. ---------------------------------------------------------------------- kern - 10-25-2005 06:42 PDT ---------------------------------------------------------------------- Unfortunately, the information you sent me is not of much use. The debug listing with -d100 doesn't show the console trying to connect, and the gdb traceback doesn't show the program crashing. I think something very fundamental is wrong with your build or your conf files. I have personally built and run a number of versions of Bacula (mostly 1.37) on FreeBSD-5.4 and never seen any such problem. Another possibility is that you have a 64 bit processor and it is not being correctly detected. I suggest the best procedure is for you to ask the bacula-users list for help. It will be much more interactive. Bug History Date Modified Username Field Change ====================================================================== 10-19-05 01:58 Lomat New Bug 10-19-05 02:11 Lomat Bug Monitored: Lomat 10-20-05 00:23 Lomat Bugnote Added: 0001238 10-20-05 05:58 Lomat Bugnote Added: 0001239 10-20-05 07:15 Lomat Bugnote Added: 0001240 10-25-05 00:28 kern Bugnote Added: 0001241 10-25-05 00:28 kern Status new => feedback 10-25-05 01:46 Lomat Bugnote Added: 0001242 10-25-05 06:42 kern Bugnote Added: 0001247 10-25-05 06:42 kern Resolution open => unable to duplicate 10-25-05 06:42 kern Status feedback => closed ====================================================================== |