I'm parsing an HTML page with the following image defintion:
<img src="dot_blue.gif" width="3" height="3" alt="" style="margin-top: 6px; margin-left: 3px;">
When accessing the tag's attributes with tag.getAttributes() the result is
{$<TAGNAME>$=IMG, ALT=, WIDTH=3, SRC=dot_blue.gif, HEIGHT=3}
The STYLE attribute is missing. Anyone an idea ahy?
Thanks in advance for any kind of hint!
cheers
wolfgang
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
...sorry, the reason it's not being attended to is that this problem is obviated by the new lexer code added recently.
This will take another few weeks to release though, unless you want to grab the latest snapshot from CVS.
I'm parsing an HTML page with the following image defintion:
<img src="dot_blue.gif" width="3" height="3" alt="" style="margin-top: 6px; margin-left: 3px;">
When accessing the tag's attributes with tag.getAttributes() the result is
{$<TAGNAME>$=IMG, ALT=, WIDTH=3, SRC=dot_blue.gif, HEIGHT=3}
The STYLE attribute is missing. Anyone an idea ahy?
Thanks in advance for any kind of hint!
cheers
wolfgang
This is logged as bug #805598 attribute src in tag img sometimes not correctly parsed
http://sourceforge.net/tracker/index.php?func=detail&aid=805598&group_id=24399&atid=381399
which is a duplicate of bug #778781 SRC-attribute suppression in IMG-tags
http://sourceforge.net/tracker/index.php?func=detail&aid=778781&group_id=24399&atid=381399
...sorry, the reason it's not being attended to is that this problem is obviated by the new lexer code added recently.
This will take another few weeks to release though, unless you want to grab the latest snapshot from CVS.
See also recent bug #832530 empty attribute causes parser to fail, that has a suggested fix...
https://sourceforge.net/tracker/?func=detail&atid=381399&aid=832530&group_id=24399
Derrick