From: Fridrich S. <fri...@bl...> - 2011-04-19 14:10:27
|
On Tue, 2011-04-19 at 09:22 -0400, Edward Mendelson wrote: > Thanks for that quick reply. I don't have such a document, but I'll > ask the person who asked me for help, in the hope that he might be > able to create one. Yup, would be good to see how that document is. Whether the characters are stored as charset 12 or not. Because in WP for mac we do two things, first try to map the Mac character that is in the first byte of the 0xCO function, and if we don't succeed, we use the second two bytes to use the mapping of the WP5 character set/character. WP5 character set has some charset for japanese, but it is normally used only for documents that were converterted from windows/dos WP file-formats. Now, there is a function 0xC8 which is defined like this: Double-Byte Script Character <0xC8> (length = 4) [character]double-byte Macintosh script character <0xC8> I strongly suspect that those characters are encoded like that and we don't support this at all. Now, if we could have the table of those or at least parts of such table, it would be ~trivial to extend our coverage by mapping this to unicode. I did not find the tables anywhere and since I did not know how much is this used, I did not dig more. If you manage to get some more information, we can extend our support to this function in a ~trivial way. Cheers F. |