Menu

#64 Supplementary unicode characters are not handled properly

open
1
2011-03-29
2011-03-29
No

Java class files can contain modified UTF-8 strings which represent characters with code points above U+FFFF, these characters are represented as a couple of 3-bytes encoded characters. The JVM does not properly parse those characters, it currently skips over treating them as two distinct characters.

Discussion


Log in to post a comment.

MongoDB Logo MongoDB