I am trying to parse some particularly annoying HTML, and
unfortunately Neko is not handling it very well. In the HTML document,
I have links that look like this: "<a class=foo 1' href='/link/text'>1</a>"
Unfortunately, the "href" attribute gets dropped completely, and the
anchor ends up with a "class" attribute, and another attribute named "1"
whose value is an empty string.
Any suggestions?
This is pretty important for me to be able to parse, so I'm even willing
to patch the parser if necessary.
Thanks!!! :-)
-Donnie Pinkston
|