#515 XML_ERROR_UNCLOSED_TOKEN when parse some Chinese characters

Test Required
closed-invalid
nobody
5
2017-09-11
2013-08-28
paccautj
No

Hi,

I am using expat-1.95.5 OR expat-2.1.0 on Windows XP SP3.

I have a "XML_ERROR_UNCLOSED_TOKEN" error when I parse an XML file that contains particular Chinese characters :
级 (0xe7 0xB4 0x9A)
通 (0xe9 0x80 0x9A)
Without these characters, all others Chinese characters are well parsed !
I used encoding "UTF-16"

Do you have any clue ?

Thx

Discussion

  • paccautj

    paccautj - 2013-08-29

    Another infos :
    These Chinese characters in UTF-16 are encode like this :
    级 (0x1A 0x7D)
    通 (0x1A 0x90)
    The problem seems to be the "0x1A" character which is the EOF character and is an illegal character according to the XML specification

     
  • Sebastian Pipping

    • status: open --> closed-invalid
     

Log in to post a comment.