From: Scott J. C. <ced...@cs...> - 2004-03-16 17:50:48
|
Shuji, Thanks for the pointers to Unicode sites. On Mon, Mar 15, 2004 at 02:03:48AM -0800, Shuji Yamaguchi wrote: > I have however a 2nd thought that it may be quicker and more straightforward > to write a program which converts a Japanese character to an alphabet (e.g. > by mapping an internal encoding in hexadecimal to 'a' to 'p' character, > instead of the regular 0-f characters, and vice versa). InfoMap then will be > able to handle a 'Japanese' words as another sequence of alphabets, though > it would double the length of word representation within InfoMap. Obviously > it has a drawback that you can not read a Japanese word in the direct > outputs from InfoMap, which have to be converted back to be shown as a > meaningful character. > If you can think of any other pitfalls in this sort of method, please let me > know. > So you're saying that any (alphabetic) Japanese character would be repesented by a unique string of alphabetic ASCII characters? That's an interesting approach. I'd like to think it over a little more before offering other comments. I'm sorry for the delays on making CVS available. I'll post to the infomap-nlp-devel list when it's ready. Scott |