Re: [mpls-linux-general] RES: Zebra LDP Crash (Discovered source of problem)
Status: Beta
Brought to you by:
jleu
|
From: James R. L. <jl...@mi...> - 2002-11-01 15:15:23
|
After you patched zebra to created mplsd did you do a 'make distclean' fr= om the top level? Are you sure you're running the zebra binary created at t= he sametime as the mplsd binary? I will look into this failure, but it looks like it is in some code that = I don't mess with. Thank for the backtrace, this is very helpful. On Fri, Nov 01, 2002 at 12:08:14PM -0300, Pl=EDnio de Paula wrote: > Zebra stopped generating core dump (maybe because I upgraded the OS), b= ut problem persists... > I compiled zebra with debug info and here is the backtrace: > -------------------------------------- > Program received signal SIGSEGV, Segmentation fault. > 0x0805c8fb in rib_install_lower (rn=3D0x8087600, rib=3D0x8087638) at ri= b.c:661 > 661 if (rib->rib_table->rib_install_kernel) > (gdb) bt > #0 0x0805c8fb in rib_install_lower (rn=3D0x8087600, rib=3D0x8087638) a= t rib.c:661 > #1 0x0805cae0 in rib_process (rn=3D0x8087600, del=3D0x0) at rib.c:785 > #2 0x0804d6d6 in static_ipv4_add (p=3D0xbffff8c0, gate=3D0x0, > ifname=3D0x8087a20 "eth4", distance=3D1 '\001', table=3D0) at zrib.= c:297 > #3 0x0804da5a in static_ipv4_func (vty=3D0xbffff8b8, add_cmd=3D1, > dest_str=3D0x80870b8 "10.10.1.10/32", > mask_str=3D0x1 <Address 0x1 out of bounds>, gate_str=3D0x8087a20 "e= th4", > distance_str=3D0x0) at zrib.c:440 > #4 0x0804daa4 in ip_route (self=3D0x806b4e0, vty=3D0x8087240, argc=3D2= , argv=3D0x2) > at zrib.c:457 > #5 0x08054952 in cmd_execute_command_strict (vline=3D0x8087550, vty=3D= 0x8087240, > cmd=3D0x0) at command.c:1963 > #6 0x08054a7b in config_from_file (vty=3D0x8087240, fp=3D0x80870d0) > at command.c:2001 > #7 0x08051755 in vty_read_file (confp=3D0x80870d0) at vty.c:2079 > #8 0x08051a23 in vty_read_config (config_file=3D0x0, > config_current_dir=3D0x806b260 "zebra.conf", > config_default_dir=3D0x806b26b "/usr/local/etc/zebra.conf") at vty.= c:2266 > #9 0x0804b682 in main (argc=3D0, argv=3D0xbffffb54) at main.c:287 > #10 0x420158d4 in __libc_start_main () from /lib/i686/libc.so.6 > -------------------------------------- >=20 > This is my zebra.conf: > -------------------------------------- > hostname routerA >=20 > interface eth5 > description Fiber1000 Interface -> routerB > ip address 10.10.2.1/24 > shutdown >=20 > interface eth4 > description Fiber1000 Interface -> clientA > ip address 10.10.1.1/24 > no shutdown *** >=20 > interface eth3 > description Fiber100 Interface -> Optical Network > no shutdown >=20 > interface eth2 > description Fiber100 Interface -> Optical Network > no shutdown >=20 > interface eth1 > description Fiber100 Interface -> Optical Network > no shutdown >=20 > ip route 10.10.1.10/32 eth4 <- This causes seg fault if eth4 (***) is u= p > | Instantaneous seg fault if initially down then brought up within= zebra vty > | Same problem applies to Giga and Fast NICs (All opticals) > ---------------------------------------- >=20 >=20 > -----Mensagem original----- > De: James R. Leu [mailto:jl...@mi...] > Enviada em: quinta-feira, 31 de outubro de 2002 20:45 > Para: Pl=EDnio de Paula > Cc: mpl...@li... > Assunto: Re: RES: [mpls-linux-general] Zebra LDP Crash (Discovered > source of problem) >=20 >=20 > No one else is using static routes (that I know of). I know I've never > tried it. Do you get a core file? Give me the backtrace from it and I= 'll > try to fix it. >=20 > On Thu, Oct 31, 2002 at 06:48:07PM -0300, Pl=EDnio de Paula wrote: > > My configuration of zebra included static routes! With LDP patch they= cause zebra segmentation fault! > >=20 > > Without static routes, LDP-patched-zebra runs OK... > >=20 > > Is this happening with everybody? > >=20 > > See you! > >=20 > > Pl=EDnio de Paula > > UNICAMP/Brazil > >=20 > > -----Mensagem original----- > > De: James R. Leu [mailto:jl...@mi...] > > Enviada em: quinta-feira, 31 de outubro de 2002 15:11 > > Para: Pl=EDnio de Paula > > Cc: Gianfranco Delli Carri; mpl...@li... > > Assunto: Re: [mpls-linux-general] Zebra LDP Crash > >=20 > >=20 > > Do you have acore file? Can you get me the backtrace from the core d= ump? > >=20 > > On Thu, Oct 31, 2002 at 02:04:17PM -0300, Pl=EDnio de Paula wrote: > > > Hello Gianfranco, > > >=20 > > > I=B4m trying to compile zebra with LDP patch in the same configurat= ion as yours. The compilation goes OK, but > > > when I call zebra, it generates core dump. Have you crossed similar= problems? What did you do about them? > > >=20 > > > Pl=EDnio de Paula > > > UNICAMP > > >=20 > > > -----Mensagem original----- > > > De: Gianfranco Delli Carri [mailto:gf....@nc...] > > > Enviada em: quarta-feira, 30 de outubro de 2002 22:13 > > > Para: 'mpl...@li...' > > > Assunto: [mpls-linux-general] Zebra LDP session > > >=20 > > >=20 > > > Hi to all, > > >=20 > > > I have a linux box (2.4.19) patched with mpls-linux-1.170 and zebra= -0.93b > > > patched with ldp-portable-0.250. > > >=20 > > > When I a start mlpsd after zebra and ospfs, in my CISCO router MPLS= /LDP > > > enabled, I can see the LDP connection setting UP, but after few sec= ond (hold > > > timer) it come down. > > >=20 > > > Debugging MPLSD I can see: > > >=20 > > > /usr/local/sbin/mplsd > > > ldp_if_new: > > > 2002/10/31 02:00:24 MPLS: MPLSd (0.93b) starts > > > 2002/10/31 02:00:24 MPLS: interface add lo index 1 flags 73 metric = 1 mtu > > > 16436 > > > 2002/10/31 02:00:24 MPLS: address add 127.0.0.1 to interface lo > > > 2002/10/31 02:00:24 MPLS: interface add eth0 index 2 flags 4419 met= ric 1 mtu > > > 1500 > > > 2002/10/31 02:00:24 MPLS: address add 10.254.0.250 to interface eth= 0 > > > 2002/10/31 02:00:24 MPLS: router-id change 10.254.0.250 > > > 2002/10/31 02:00:24 MPLS: router-id update 10.254.0.250 > > > 2002/10/31 02:00:24 MPLS: router add 0.0.0.0/0 > > > 2002/10/31 02:00:24 MPLS: nexthop 10.254.0.1 > > > 2002/10/31 02:00:24 MPLS: ifindex 2=20 > > > session delete > > >=20 > > > Debugging CISCO LDP: > > >=20 > > > Oct 31 02:00:24.584 CET: ldp: Opening ldp conn; adj 0x67827E30, 10.= 254.2.6 > > > <-> 10.254.0.250 > > > Oct 31 02:00:24.584 CET: ldp: ldp conn is up; adj 0x67827E30, > > > 10.254.2.6:11439 <-> 10.254.0.250:646 > > > Oct 31 02:00:24.584 CET: ldp: Sent init msg to 10.254.0.250 (pp 0x0= ) > > > Oct 31 02:00:24.604 CET: ldp: ldp conn closed by peer; adj 0x67827E= 30 > > > 10.254.2.6:11439 <-> 10.254.0.250:646, FastEthernet0/0 > > > Oct 31 02:00:24.604 CET: ldp: Closing ldp conn 10.254.2.6:11439 <-> > > > 10.254.0.250:646, adj 0x67827E30 > > > Oct 31 02:00:29.588 CET: ldp: Opening ldp conn; adj 0x67827E30, 10.= 254.2.6 > > > <-> 10.254.0.250 > > > Oct 31 02:00:29.588 CET: ldp: ldp conn is up; adj 0x67827E30, > > > 10.254.2.6:11440 <-> 10.254.0.250:646 > > > Oct 31 02:00:29.588 CET: ldp: Sent init msg to 10.254.0.250 (pp 0x0= ) > > > Oct 31 02:00:29.600 CET: ldp: Rcvd init msg from 10.254.0.250 (pp 0= x0) > > > Oct 31 02:00:29.600 CET: ldp: Sent keepalive msg to 10.254.0.250:0 = (pp 0x0) > > > Oct 31 02:00:29.604 CET: ldp: Rcvd keepalive msg from 10.254.0.250:= 0 (pp > > > 0x0) > > > Oct 31 02:00:29.608 CET: ldp: Sent address msg to 10.254.0.250:0 (p= p > > > 0x6225D768) > > > Oct 31 02:00:29.608 CET: ldp: Sent label mapping msg to 10.254.0.25= 0:0 (pp > > > 0x6225D768) > > > Oct 31 02:00:29.608 CET: ldp: Sent label mapping msg to 10.254.0.25= 0:0 (pp > > > 0x6225D768) > > > Oct 31 02:00:29.608 CET: ldp: Sent label mapping msg to 10.254.0.25= 0:0 (pp > > > 0x6225D768) > > > Oct 31 02:00:29.608 CET: ldp: Sent label mapping msg to 10.254.0.25= 0:0 (pp > > > 0x6225D768) > > > Oct 31 02:00:29.608 CET: ldp: Sent label mapping msg to 10.254.0.25= 0:0 (pp > > > 0x6225D768) > > > Oct 31 02:00:29.608 CET: ldp: Sent label mapping msg to 10.254.0.25= 0:0 (pp > > > 0x6225D768) > > > Oct 31 02:00:29.608 CET: ldp: Sent label mapping msg to 10.254.0.25= 0:0 (pp > > > 0x6225D768) > > > etc... > > > Oct 31 02:00:44.605 CET: ldp: Discovery hold timer expired for adj > > > 0x67827E30, 10.254.0.250:0, will close conn > > > Oct 31 02:00:44.605 CET: ldp: Sent notif msg to 10.254.0.250:0 (pp > > > 0x6225D768) > > > Oct 31 02:00:44.605 CET: ldp: Sent notif msg to 10.254.0.250:0 (pp > > > 0x6225D768) > > > Oct 31 02:00:44.605 CET: ldp: Close LDP transport conn for adj 0x67= 827E30 > > > Oct 31 02:00:44.605 CET: ldp: Closing ldp conn 10.254.2.6:11440 <-> > > > 10.254.0.250:646, adj 0x67827E30 > > >=20 > > > Ah... my MPLSD process come to use all the CPU time: > > >=20 > > > ps aux > > > USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMM= AND > > > root 769 98.4 0.7 2076 904 pts/0 R 02:00 5:10 > > > /usr/local/sbin/mplsd > > >=20 > > > and I'm always unable to telnet on it, the session freeze. > > >=20 > > > telnet 10.254.0.250 2610 > > > Trying 10.254.0.250... > > > Connected to 10.254.0.250. > > > Escape character is '^]'. > > >=20 > > >=20 > > >=20 > > > Have you any kind of idea ? > > >=20 > > > Thanks in advance. > > >=20 > > > Regards, > > >=20 > > > Gianfranco > > >=20 > > >=20 > > > ------------------------------------------------------- > > > This sf.net email is sponsored by: Influence the future=20 > > > of Java(TM) technology. Join the Java Community=20 > > > Process(SM) (JCP(SM)) program now.=20 > > > http://ads.sourceforge.net/cgi-bin/redirect.pl?sunm0004en > > > _______________________________________________ > > > mpls-linux-general mailing list > > > mpl...@li... > > > https://lists.sourceforge.net/lists/listinfo/mpls-linux-general > > >=20 > > >=20 > > > ------------------------------------------------------- > > > This sf.net email is sponsored by: Influence the future=20 > > > of Java(TM) technology. Join the Java Community=20 > > > Process(SM) (JCP(SM)) program now.=20 > > > http://ads.sourceforge.net/cgi-bin/redirect.pl?sunm0004en > > > _______________________________________________ > > > mpls-linux-general mailing list > > > mpl...@li... > > > https://lists.sourceforge.net/lists/listinfo/mpls-linux-general > >=20 > > --=20 > > James R. Leu >=20 > --=20 > James R. Leu >=20 >=20 > ------------------------------------------------------- > This sf.net email is sponsored by: See the NEW Palm=20 > Tungsten T handheld. Power & Color in a compact size! > http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0001en > _______________________________________________ > mpls-linux-general mailing list > mpl...@li... > https://lists.sourceforge.net/lists/listinfo/mpls-linux-general --=20 James R. Leu |