|
From: Scott J. C. <ced...@cs...> - 2004-03-16 17:50:48
|
Shuji,
Thanks for the pointers to Unicode sites.
On Mon, Mar 15, 2004 at 02:03:48AM -0800, Shuji Yamaguchi wrote:
> I have however a 2nd thought that it may be quicker and more straightforward
> to write a program which converts a Japanese character to an alphabet (e.g.
> by mapping an internal encoding in hexadecimal to 'a' to 'p' character,
> instead of the regular 0-f characters, and vice versa). InfoMap then will be
> able to handle a 'Japanese' words as another sequence of alphabets, though
> it would double the length of word representation within InfoMap. Obviously
> it has a drawback that you can not read a Japanese word in the direct
> outputs from InfoMap, which have to be converted back to be shown as a
> meaningful character.
> If you can think of any other pitfalls in this sort of method, please let me
> know.
>
So you're saying that any (alphabetic) Japanese character would be
repesented by a unique string of alphabetic ASCII characters?
That's an interesting approach. I'd like to think it over a
little more before offering other comments.
I'm sorry for the delays on making CVS available. I'll post to
the infomap-nlp-devel list when it's ready.
Scott
|