Re: [Htmlparser-user] <a> tags?
Brought to you by:
derrickoswald
From: Wojciech G. <woj...@gm...> - 2008-03-19 21:19:54
|
Hi Martin, You're absolutely correct -- it's because of the incorrect closing (at least that's what it seems to be), and it occurs multiple times. Thanks for letting me know -- I ended up writing a quick regex script to extract this stuff, so it works in its own special way, now. :) Thanks again, Wojciech On Mon, Mar 17, 2008 at 5:41 AM, Martin Sturm <mst...@gm...> wrote: > 2008/3/11, Wojciech Gryc <woj...@gm...>: > > > Specifically, the link appears in the page, but surrounds an image, like > so: > > > > <p><a href="http://feeds.feedburner.com/mydd"><img > > src="..."></a> > > > > When I use a basic tag name filter, the actual tag above doesn't get > > returned (while other <a> tag links do)... I've been playing around with > the > > code and don't know where to go from here. Is it because it surrounds an > > image? Is there anything I can do to fix this? > > If it is only a problem with this particular piece of HTML, then it > could be useful to post it verbatim. Looking to the part you have in > your e-mail, it seems that the img-tag is not properly closed (it > should be <img src="..." /> if it is a XHTML document, that is). > However, I'm not sure if HTMLParser just ignores this... from previous > expieriences, I noted that HTMLParser is pretty strict when it comes > to standard compliance. > > -- > Martin Sturm > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2008. > http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > -- Five Minutes to Midnight: Youth on human rights and current affairs http://www.fiveminutestomidnight.org/ |