htmlparser-announce Mailing List for HTML Parser (Page 4)
Brought to you by:
derrickoswald
You can subscribe to this list here.
2002 |
Jan
(6) |
Feb
|
Mar
(2) |
Apr
(1) |
May
|
Jun
(4) |
Jul
(3) |
Aug
(3) |
Sep
(1) |
Oct
(3) |
Nov
(2) |
Dec
(5) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2003 |
Jan
(2) |
Feb
(4) |
Mar
(4) |
Apr
(3) |
May
(2) |
Jun
(1) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2004 |
Jan
(1) |
Feb
(1) |
Mar
(1) |
Apr
|
May
|
Jun
|
Jul
(1) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2005 |
Jan
|
Feb
|
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2006 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
(1) |
Oct
|
Nov
(1) |
Dec
(6) |
2007 |
Jan
|
Feb
(6) |
Mar
(6) |
Apr
(6) |
May
(1) |
Jun
(1) |
Jul
(1) |
Aug
(27) |
Sep
(7) |
Oct
(4) |
Nov
|
Dec
|
2008 |
Jan
|
Feb
(1) |
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
(2) |
2009 |
Jan
|
Feb
|
Mar
(1) |
Apr
(15) |
May
(83) |
Jun
(72) |
Jul
(39) |
Aug
(14) |
Sep
(16) |
Oct
(30) |
Nov
(5) |
Dec
(4) |
2010 |
Jan
|
Feb
(1) |
Mar
(37) |
Apr
(57) |
May
(74) |
Jun
(66) |
Jul
(44) |
Aug
(54) |
Sep
(19) |
Oct
|
Nov
|
Dec
|
2011 |
Jan
|
Feb
|
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: Somik R. <so...@ya...> - 2002-07-08 01:06:01
|
Hi Folks, The latest integration release (2002-07-07) is out, and has major = improvements : [1] 50% speed improvement over v1.1. The previous 1.2 versions had a = slowdown bug due to which it was slower by 20% over v1.1. [2] Fixed bug in HTMLScriptScanner, which would break on incorrect HTML = inside the script code. [3] Removed HTMLFormScanner from standard registered scanners, as it has = a bug - cannot parse non-ended forms (goes into infinite loop). Regards, Somik ********************************** Somik Raha System Architect Kizna Corporation Hiroo ON Bldg. 2F, 5-19-9 Hiroo, Shibuya-ku, Tokyo,=20 150-0012, JAPAN Phone : +81-3-5475-2646 Fax : +81-3-3445-9089 Web : http://www.kizna.com Mail : so...@ki... ********************************** |
From: Somik R. <so...@ya...> - 2002-06-30 12:15:36
|
Hi Folks, Just to drop a note - this week's integration release is out and can = be downloaded from http://htmlparser.sourceforge.net. Contains a couple of bug fixes, and small API changes.. Regards, Somik |
From: Somik R. <so...@ya...> - 2002-06-22 06:59:18
|
Hi Folks, Integration Release 2002-06-23 is out. You can get it from = http://htmlparser.sourceforge.net. This release has bug fixes, two new = scanners (HTMLFrameSetScanner, HTMLFormScanner), and an API change = (toRawString() changed to toHTML()). Regards Somik ********************************** Somik Raha System Architect Kizna Corporation Hiroo ON Bldg. 2F, 5-19-9 Hiroo, Shibuya-ku, Tokyo,=20 150-0012, JAPAN Phone : +81-3-5475-2646 Fax : +81-3-3445-9089 Web : http://www.kizna.com Mail : so...@ki... ********************************** |
From: Somik R. <so...@ya...> - 2002-06-16 09:15:40
|
Hi Folks, A new integration build is out. Major change : [1] HTMLStringNode now gives string blocks, all in one string node = object, instead of several string node objects for continuous lines. = This is based on a=20 bug report by Gordon Deudney. This will improve the scalability of the = parser. [2] HTMLScriptScanner's scan method has been refactored. For folks = writing new scanners, take a look at this method - to see how simple it = is to make your own scanners. There's a substantial reduction in the = code size and complexity. To do :=20 [1] Integrate Raghavendra Srimantula's scanners (Form and Frame) as soon = as the test cases are available. [2] Write a guide for writing your own scanners. Regards, Somik |
From: Somik R. <so...@ya...> - 2002-06-07 05:22:25
|
Hi Folks, An integration build is out, incorporating 2 bug fixes in the script = scanner, contributed by Wolfgang Germund.=20 Wolfgang --> Thanks a lot for the nice test cases that you = submitted, and of course, the fixes. Incorporated and released in the = latest package. Release 1.2 is still some way off... need test cases for the form = and frame scanners...=20 Regards, Somik |
From: Somik R. <so...@ya...> - 2002-04-17 03:39:36
|
Hi Folks, HTMLParser 1.1 has just been released. This is a production release = - HTMLParser finally moves out of the beta stage.=20 A whole lot of bug fixes, architecture modifications, and intense = testing has been done.=20 You can get it from http://htmlparser.sourceforge.net Regards, Somik |
From: Somik R. <so...@ya...> - 2002-03-22 16:39:55
|
Hi Folks, Release 1.04 is out. Has the following bug fixes : [1] Parsing JSP tags which had tags within inverted commas, was causing = problems. [2] A link with no link url would cause the parser to crash with a null = pointer exception. Regards, Somik |
From: Somik R. <so...@ya...> - 2002-03-04 14:26:50
|
HTMLParser 1.03 has been released. It contains a bug fix in = HTMLRemarkNode which was causing the parser to crash on pages with = remarks going over one line. A test case for the bug has been added in = HTMLRemarkNodeTest.=20 The release also contains the design documentation in the zip. Regards Somik |
From: Somik R. <so...@ya...> - 2002-01-16 14:08:08
|
Hi Folks, Check http://htmlparser.sourceforge.net for a totally new look. = Design documentation with sample programs has been added. Feedback is welcome. Regards, Somik |
From: Emilio N. <ena...@at...> - 2002-01-10 18:34:33
|
Hello, I tried to use the MailRipper application and I obtained two different = results depending on how I typed in the URL. By the way, I added a = System.out.println() in the MailRipper to see what other tags I would = obtain. For example, I used: the following site: http://www.translationlinks.com = and then http://www.translationlinks.com/ As you can see the only = difference is the "/" at the end of the URL. 1. When I input http://www.translationlinks.com I obtained the following Ripping Site http://www.translationlinks.com LINK TAG: http://www.translationmail.com LINK TAG: http://www.translationlinks.com 2. WHen I input http://www.translationlinks.com/ I obtained the = following Ripping Site http://www.translationlinks.com/ LINK TAG: in...@tr... Why would I be receiving two different results for the same page? Thank you in Advance. Emilio =20 =20 |
From: Somik R. <so...@ya...> - 2002-01-09 16:35:55
|
Hi Folks, Another bug was detected in HTMLStyleScanner, and has been = immediately fixed. v1.02 has been released with this fix, and another = one - which allows scanning of Finnish pages to proceed properly. Regards, Somik |
From: Somik R. <so...@ya...> - 2002-01-08 17:33:22
|
Hi Folks, An important bug fix has been done. The parser was crashing on style = tags - this has been fixed. Regards, Somik |
From: Somik R. <so...@ya...> - 2002-01-05 17:12:03
|
Hi Folks, Sorry bout that, the zip file that was uploaded seemed to be = corrupted. Its fixed, and you should be able to download it now. Regards, Somik |
From: Somik R. <so...@ya...> - 2002-01-04 06:15:49
|
Hi Folks, A new year present - HTMLParser 1.0 is released. We've finally made = the transition from alpha to a beta stage. Modifications henceforth = would only be of a maintenance nature and API should remain constant. There are huge changes in the architecture, and lots of bug fixes. = Thanks a lot to Kaarle Kaaila for some great support and ideas. Thanks = also to Rodney Foley, for some nice ideas for improvement. And thanks to = everyone else who's been supporting this project.=20 Looking forward to your continuing support, and wishing you a very = happy new year. =20 Cheers, Somik |