Re: [Htmlparser-developer] RE: [Htmlparser-user] Integration Release 1.3-20030323 is out

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hi Marc,
    I will look into it soon - I am in a conference right now, but should be
on it this weekend. Meanwhile you are free to analyze it, and tell me
anything that you find.

Regards,
Somik
----- Original Message -----
From: "Marc Novakowski" <ma...@ke...>
To: <htm...@li...>
Sent: Monday, March 24, 2003 5:42 PM
Subject: [Htmlparser-developer] RE: [Htmlparser-user] Integration Release
1.3-20030323 is out

By the way, I've entered the OOM exception as a bug (#709152), along with a
simple program that reproduces it.

Marc

-----Original Message-----
From: Marc Novakowski
Sent: Monday, March 24, 2003 3:23 PM
To: htm...@li...
Subject: RE: [Htmlparser-user] Integration Release 1.3-20030323 is out

Somik,

Thanks for fixing 702614!  Unfortunately I can't seem to get the latest
build to work.  It's throwing an OOM exception in my own code when using the
NodeIterator returned by parser.elements().  I'm looking into this to make
sure I'm not doing something stupid in my code.  However, the library seems
to be acting differently than previous releases even out-of-the-box.  For
example, the following used to return a list of the links on Yahoo (in the
0302 release):

java -jar ./htmlparser.jar http://www.yahoo.com -l

In the 0323 release, however, it returns nothing.

Marc

-----Original Message-----
From: Somik Raha [mailto:so...@ya...]
Sent: Sunday, March 23, 2003 5:24 PM
To: HTMLParser Announcement List; HTMLParser User List; HTMLParser
Developer List
Subject: [Htmlparser-user] Integration Release 1.3-20030323 is out

Hi Folks,
    This week's integration release has two important fixes :

Integration build 1.3 - 20030323
--------------------------------
[1] Fixed bug 702547 - single quotes parsed more robustly now
[2] Fixed bug 702614 - empty tags handled correctly now. Tag now has a
method isEmptyXmlTag().

#2 refers to tags like <tag/>.

Thanks to Joe Robbins for a fine bug report that helped in putting in the
fix for #1 faster. Thanks also to Marc Novakowski for the other report.

Thanks are also due to Huang-Chun Yu for uncovering a serious bug with the
script scanning mechanism. The parser can currently handle script tags like
:

<script>
<!--
    code here
-->
</script>

But when the tags are like:
<script>
    code here
</script>

the parser is unable to identify the code and treats it like regular tags.
Such pages are quite widespread and ought to be supported. I was curious if
anyone has ideas on solving this - given the existing design - fresh ideas
often lead to a better perspective. If you have some ideas, feel free to
join the developer list
(http://lists.sourceforge.net/lists/listinfo/htmlparser-developer) and post.

Regards,
Somik

-------------------------------------------------------
This SF.net email is sponsored by:Crypto Challenge is now open!
Get cracking and register here for some mind boggling fun and
the chance of winning an Apple iPod:
http://ads.sourceforge.net/cgi-bin/redirect.pl?thaw0031en
_______________________________________________
Htmlparser-user mailing list
Htm...@li...
https://lists.sourceforge.net/lists/listinfo/htmlparser-user

-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Htmlparser-user mailing list
Htm...@li...
https://lists.sourceforge.net/lists/listinfo/htmlparser-user

-------------------------------------------------------
This SF.net email is sponsored by:
The Definitive IT and Networking Event. Be There!
NetWorld+Interop Las Vegas 2003 -- Register today!
http://ads.sourceforge.net/cgi-bin/redirect.pl?keyn0001en
_______________________________________________
Htmlparser-developer mailing list
Htm...@li...
https://lists.sourceforge.net/lists/listinfo/htmlparser-developer