Supplementary unicode characters are not handled properly
Status: Beta
Brought to you by:
crystall
Java class files can contain modified UTF-8 strings which represent characters with code points above U+FFFF, these characters are represented as a couple of 3-bytes encoded characters. The JVM does not properly parse those characters, it currently skips over treating them as two distinct characters.