how to deal with the EncodingChangeException ?
Status: Beta
Brought to you by:
chousz
I am parsing some html web pages.
most of html pages could be parsed successfully.
but throw a EncodingChangeException from the method
'visitAllNodesWith(ObjectFindingVisitor visitor) when I
pase a EUC-CN encoded html web pages. Exception
details follows:
character mismatch (new: ? != old: 福) for encoding
change from EUC-CN to ISO-8859-1 at character offset
115610
how can I sovle the problem to get the page info by
htmlParser?
thanks.
Logged In: YES
user_id=349759
I guess you're using another parser.