Hi Martin,
You're absolutely correct -- it's because of the incorrect closing (at least
that's what it seems to be), and it occurs multiple times.
Thanks for letting me know -- I ended up writing a quick regex script to
extract this stuff, so it works in its own special way, now. :)
Thanks again,
Wojciech
On Mon, Mar 17, 2008 at 5:41 AM, Martin Sturm <mst...@gm...> wrote:
> 2008/3/11, Wojciech Gryc <woj...@gm...>:
>
> > Specifically, the link appears in the page, but surrounds an image, like
> so:
> >
> > <p><a href="http://feeds.feedburner.com/mydd"><img
> > src="..."></a>
> >
> > When I use a basic tag name filter, the actual tag above doesn't get
> > returned (while other <a> tag links do)... I've been playing around with
> the
> > code and don't know where to go from here. Is it because it surrounds an
> > image? Is there anything I can do to fix this?
>
> If it is only a problem with this particular piece of HTML, then it
> could be useful to post it verbatim. Looking to the part you have in
> your e-mail, it seems that the img-tag is not properly closed (it
> should be <img src="..." /> if it is a XHTML document, that is).
> However, I'm not sure if HTMLParser just ignores this... from previous
> expieriences, I noted that HTMLParser is pretty strict when it comes
> to standard compliance.
>
> --
> Martin Sturm
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2008.
> http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
> _______________________________________________
> Htmlparser-user mailing list
> Htm...@li...
> https://lists.sourceforge.net/lists/listinfo/htmlparser-user
>
--
Five Minutes to Midnight:
Youth on human rights and current affairs
http://www.fiveminutestomidnight.org/
|