From: Derek L. <DL...@qn...> - 2010-09-27 12:17:15
|
Thanks Markus! ________________________________ From: Markus Scherer [mailto:mar...@gm...] Sent: Sunday, September 26, 2010 11:46 AM To: ICU support mailing list Subject: Re: [icu-support] Having trouble EUC-JP On Fri, Sep 24, 2010 at 10:38 AM, Derek Leach <DL...@qn...> wrote: What about the 02b example: uni: \u8500 \u00A1 \u4F01 \u30F1 \u30E6 ch: CJK UN INVERT CJK UN KATAKA KATAKA targ: 0 1 2 3 4 5 6 7 8 uni: \xBC \xC3 \x1A \xB4 \xEB \xA5 \xF1 \xA5 \xE6 ch: utarg: 0 1 2 3 4 uni: \u8500 \u001A Where the u00A1 converts back into \u001A? Is that expected? U+00A1 does not have a real mapping in the EUC-JP table, which convrtrs.txt says is ibm-33722_P12A_P12A-2004_U2.ucm. The entry for it there is <U00A1> \x1A |2 which is a mapping to the alternate single-byte subsitution character that IBM likes to use in variable-width mapping tables. markus |