Supplementary unicode characters are not handled properly

Status: Beta

Brought to you by: crystall

#64 Supplementary unicode characters are not handled properly

Status: open

Owner: Gabriele Svelto

Labels: virtual machine (43)

Priority: 1

Updated: 2011-03-29

Created: 2011-03-29

Creator: Gabriele Svelto

Private: No

Java class files can contain modified UTF-8 strings which represent characters with code points above U+FFFF, these characters are represented as a couple of 3-bytes encoded characters. The JVM does not properly parse those characters, it currently skips over treating them as two distinct characters.

Supplementary unicode characters are not handled properly

Group

Searches

Help

#64 Supplementary unicode characters are not handled properly

Discussion