htmlparser-announce Mailing List for HTML Parser (Page 4)
Brought to you by:
derrickoswald
You can subscribe to this list here.
| 2002 |
Jan
(6) |
Feb
|
Mar
(2) |
Apr
(1) |
May
|
Jun
(4) |
Jul
(3) |
Aug
(3) |
Sep
(1) |
Oct
(3) |
Nov
(2) |
Dec
(5) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2003 |
Jan
(2) |
Feb
(4) |
Mar
(4) |
Apr
(3) |
May
(2) |
Jun
(1) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
| 2004 |
Jan
(1) |
Feb
(1) |
Mar
(1) |
Apr
|
May
|
Jun
|
Jul
(1) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
| 2005 |
Jan
|
Feb
|
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
| 2006 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
(1) |
Oct
|
Nov
(1) |
Dec
(6) |
| 2007 |
Jan
|
Feb
(6) |
Mar
(6) |
Apr
(6) |
May
(1) |
Jun
(1) |
Jul
(1) |
Aug
(27) |
Sep
(7) |
Oct
(4) |
Nov
|
Dec
|
| 2008 |
Jan
|
Feb
(1) |
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
(2) |
| 2009 |
Jan
|
Feb
|
Mar
(1) |
Apr
(15) |
May
(83) |
Jun
(72) |
Jul
(39) |
Aug
(14) |
Sep
(16) |
Oct
(30) |
Nov
(5) |
Dec
(4) |
| 2010 |
Jan
|
Feb
(1) |
Mar
(37) |
Apr
(57) |
May
(74) |
Jun
(66) |
Jul
(44) |
Aug
(54) |
Sep
(19) |
Oct
|
Nov
|
Dec
|
| 2011 |
Jan
|
Feb
|
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
|
From: Somik R. <so...@ya...> - 2002-07-08 01:06:01
|
Hi Folks,
The latest integration release (2002-07-07) is out, and has major =
improvements :
[1] 50% speed improvement over v1.1. The previous 1.2 versions had a =
slowdown bug due to which it was slower by 20% over v1.1.
[2] Fixed bug in HTMLScriptScanner, which would break on incorrect HTML =
inside the script code.
[3] Removed HTMLFormScanner from standard registered scanners, as it has =
a bug - cannot parse non-ended forms (goes into infinite loop).
Regards,
Somik
**********************************
Somik Raha
System Architect
Kizna Corporation
Hiroo ON Bldg. 2F, 5-19-9 Hiroo,
Shibuya-ku, Tokyo,=20
150-0012, JAPAN
Phone : +81-3-5475-2646
Fax : +81-3-3445-9089
Web : http://www.kizna.com
Mail : so...@ki...
**********************************
|
|
From: Somik R. <so...@ya...> - 2002-06-30 12:15:36
|
Hi Folks,
Just to drop a note - this week's integration release is out and can =
be downloaded from http://htmlparser.sourceforge.net.
Contains a couple of bug fixes, and small API changes..
Regards,
Somik
|
|
From: Somik R. <so...@ya...> - 2002-06-22 06:59:18
|
Hi Folks,
Integration Release 2002-06-23 is out. You can get it from =
http://htmlparser.sourceforge.net. This release has bug fixes, two new =
scanners (HTMLFrameSetScanner, HTMLFormScanner), and an API change =
(toRawString() changed to toHTML()).
Regards
Somik
**********************************
Somik Raha
System Architect
Kizna Corporation
Hiroo ON Bldg. 2F, 5-19-9 Hiroo,
Shibuya-ku, Tokyo,=20
150-0012, JAPAN
Phone : +81-3-5475-2646
Fax : +81-3-3445-9089
Web : http://www.kizna.com
Mail : so...@ki...
**********************************
|
|
From: Somik R. <so...@ya...> - 2002-06-16 09:15:40
|
Hi Folks,
A new integration build is out. Major change :
[1] HTMLStringNode now gives string blocks, all in one string node =
object, instead of several string node objects for continuous lines. =
This is based on a=20
bug report by Gordon Deudney. This will improve the scalability of the =
parser.
[2] HTMLScriptScanner's scan method has been refactored. For folks =
writing new scanners, take a look at this method - to see how simple it =
is to make your own scanners. There's a substantial reduction in the =
code size and complexity.
To do :=20
[1] Integrate Raghavendra Srimantula's scanners (Form and Frame) as soon =
as the test cases are available.
[2] Write a guide for writing your own scanners.
Regards,
Somik
|
|
From: Somik R. <so...@ya...> - 2002-06-07 05:22:25
|
Hi Folks,
An integration build is out, incorporating 2 bug fixes in the script =
scanner, contributed by Wolfgang Germund.=20
Wolfgang --> Thanks a lot for the nice test cases that you =
submitted, and of course, the fixes. Incorporated and released in the =
latest package.
Release 1.2 is still some way off... need test cases for the form =
and frame scanners...=20
Regards,
Somik
|
|
From: Somik R. <so...@ya...> - 2002-04-17 03:39:36
|
Hi Folks,
HTMLParser 1.1 has just been released. This is a production release =
- HTMLParser finally moves out of the beta stage.=20
A whole lot of bug fixes, architecture modifications, and intense =
testing has been done.=20
You can get it from http://htmlparser.sourceforge.net
Regards,
Somik
|
|
From: Somik R. <so...@ya...> - 2002-03-22 16:39:55
|
Hi Folks,
Release 1.04 is out. Has the following bug fixes :
[1] Parsing JSP tags which had tags within inverted commas, was causing =
problems.
[2] A link with no link url would cause the parser to crash with a null =
pointer exception.
Regards,
Somik
|
|
From: Somik R. <so...@ya...> - 2002-03-04 14:26:50
|
HTMLParser 1.03 has been released. It contains a bug fix in = HTMLRemarkNode which was causing the parser to crash on pages with = remarks going over one line. A test case for the bug has been added in = HTMLRemarkNodeTest.=20 The release also contains the design documentation in the zip. Regards Somik |
|
From: Somik R. <so...@ya...> - 2002-01-16 14:08:08
|
Hi Folks,
Check http://htmlparser.sourceforge.net for a totally new look. =
Design documentation with sample programs has been added.
Feedback is welcome.
Regards,
Somik
|
|
From: Emilio N. <ena...@at...> - 2002-01-10 18:34:33
|
Hello, I tried to use the MailRipper application and I obtained two different = results depending on how I typed in the URL. By the way, I added a = System.out.println() in the MailRipper to see what other tags I would = obtain. For example, I used: the following site: http://www.translationlinks.com = and then http://www.translationlinks.com/ As you can see the only = difference is the "/" at the end of the URL. 1. When I input http://www.translationlinks.com I obtained the following Ripping Site http://www.translationlinks.com LINK TAG: http://www.translationmail.com LINK TAG: http://www.translationlinks.com 2. WHen I input http://www.translationlinks.com/ I obtained the = following Ripping Site http://www.translationlinks.com/ LINK TAG: in...@tr... Why would I be receiving two different results for the same page? Thank you in Advance. Emilio =20 =20 |
|
From: Somik R. <so...@ya...> - 2002-01-09 16:35:55
|
Hi Folks,
Another bug was detected in HTMLStyleScanner, and has been =
immediately fixed. v1.02 has been released with this fix, and another =
one - which allows scanning of Finnish pages to proceed properly.
Regards,
Somik
|
|
From: Somik R. <so...@ya...> - 2002-01-08 17:33:22
|
Hi Folks,
An important bug fix has been done. The parser was crashing on style =
tags - this has been fixed.
Regards,
Somik
|
|
From: Somik R. <so...@ya...> - 2002-01-05 17:12:03
|
Hi Folks,
Sorry bout that, the zip file that was uploaded seemed to be =
corrupted. Its fixed, and you should be able to download it now.
Regards,
Somik
|
|
From: Somik R. <so...@ya...> - 2002-01-04 06:15:49
|
Hi Folks,
A new year present - HTMLParser 1.0 is released. We've finally made =
the transition from alpha to a beta stage. Modifications henceforth =
would only be of a maintenance nature and API should remain constant.
There are huge changes in the architecture, and lots of bug fixes. =
Thanks a lot to Kaarle Kaaila for some great support and ideas. Thanks =
also to Rodney Foley, for some nice ideas for improvement. And thanks to =
everyone else who's been supporting this project.=20
Looking forward to your continuing support, and wishing you a very =
happy new year.
=20
Cheers,
Somik
|