[Htmlparser-developer] Re: encoding issue
Brought to you by:
derrickoswald
|
From: Amit R. <ami...@ya...> - 2002-11-22 03:30:32
|
> > outHTML.append(((HTMLStringNode)node).getText()
> > +"\n");
instead use following
String string = new
String(((HTMLStringNode)node).getText().getBytes("8859_4"),"newEncoding");
for reading japanese pages i use
newEncoding = Shift_JIS
for chinese it can be (i have never tested though)
newEncoding = BIG5 or Big5_HKSCS or GB2312
Somik,
Thats why i suggested sometime back to have a
function/method to tell programmer what encoding
htmlparser is using internally.
> We've got an expert on internationalization on this
> list..
> Amit -- do you have time to check this ?
i hope first line was not for me :-)
__________________________________________________
Do you Yahoo!?
Yahoo! Mail Plus Powerful. Affordable. Sign up now.
http://mailplus.yahoo.com
|