I have a project on source forget that uses HTMLStreamTokenizer. (PVRBOT). I used to link to your site at http://www.do.org/products/parser/ to get it but this site appears to be dead.
I have looked here but this site looks pretty dead too. A few things in CVS but no downloads.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi there,
I have a project on source forget that uses HTMLStreamTokenizer. (PVRBOT). I used to link to your site at http://www.do.org/products/parser/ to get it but this site appears to be dead.
I have looked here but this site looks pretty dead too. A few things in CVS but no downloads.
If you know of a faster, more lightweight Java HTML parser, I would be very interested. Until then, HtmlStreamTokenizer lives on in my app.