[Htmlparser-developer] Re: encoding issue
Brought to you by:
derrickoswald
From: Amit R. <ami...@ya...> - 2002-11-22 03:30:32
|
> > outHTML.append(((HTMLStringNode)node).getText() > > +"\n"); instead use following String string = new String(((HTMLStringNode)node).getText().getBytes("8859_4"),"newEncoding"); for reading japanese pages i use newEncoding = Shift_JIS for chinese it can be (i have never tested though) newEncoding = BIG5 or Big5_HKSCS or GB2312 Somik, Thats why i suggested sometime back to have a function/method to tell programmer what encoding htmlparser is using internally. > We've got an expert on internationalization on this > list.. > Amit -- do you have time to check this ? i hope first line was not for me :-) __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |