Re: Fw: [Htmlparser-user] Bad formed web page
Brought to you by:
derrickoswald
From: R. <ced...@fr...> - 2002-06-27 10:16:47
|
Oh yes excuse me, it's another test :) I attach the new document. I've trimed another line of the page :) I don't run software with options, I use class with my own program, but it was just for testing :) I could make a fix when I find some time too :) I'll register sourceforge asap and I'll try to understand how CVS works :) For the moment I can send my fixes to the developer mailing list. Regards, Cedric. At 18:43 27/06/2002 +0900, you wrote: >Hi Cedric, >Thanks for this fix. But when I download the CVS version of HTMLParser and >try to parse the page again I get this error: >"java.lang.OutOfMemoryError > <<no stack trace available>> >Exception in thread "main" " > >Is-it normal ? Should I catch this error and write my own code around ? >Its highly abnormal... It should not happen - are you trying it with the >same piece of html ? Send me the data you are trying on. If its the same >page, it works perfectly on my end. I am running HTMLParser (main) with no >params except the file name. > >Other question, I can't run the software with two options. Is-it normal ? >Why don't you set the options before the name of the file to parse ? > >Yes, this is normal (a feature, not a bug). This is bcos the options are >intended only as a demo, and I didnt think it'd really be of use to >people. Are you actually using it this way ? Also, another thing is I am >not full time on this, so I'd be grateful if you can join up as a >developer and make this fix. > >All code recieved from developers is acknowledged both in the code, and >the Contributors page that goes out with each release. You can send me >your sourceforge id and I can add you as a developer. > >It can be used like this: >public HTMLStringNode(String text,int textBegin,int textEnd) >{ > NormalizeHtmlCode normalizer = new NormalizeHtmlCode(); > this.text = normalizer.html2text(text); > this.textBegin = textBegin; > this.textEnd = textEnd; >} >You can implement it with the meta-tags, ... > >This is cool. I think it will be useful in the toPlainString() method, >where we can get the actual meaningful text out. I'd be glad to include >this as soon as I find some time. Or Tariq can also join as a developer >and I can give him CVS access to do it. > >Thanks a lot for your participation. > >Cheers, >Somik |