From: Shuji Y. <yam...@ya...> - 2004-03-11 14:17:39
|
Hi Scott, Beate, As Beate wrote on my_isalpha(), I note it does not accept non-ASCII characters from its outset. Are there any other parts of InfoMap I should give a closer look and if necessary change for making it capable of handling Japanese and other multibyte characters? I think I have to do so by trials and errors, but if you could give me guidance it would streamline my process. I plan to use UTF8 as encoding. I hope that my changes would be transparent to ASCII and could be brought back to the main release if we want to. I would be appreciate if I could have access to CVS when it is ready. Regards, Shuji -----Original Message----- From: Scott James Cederberg [mailto:ced...@cs...] Sent: Wednesday, March 10, 2004 3:08 PM To: Beate Dorow Cc: inf...@li...; yam...@ya... Subject: Re: [infomap-nlp-devel] Re: [infomap-nlp-users] Infomap. Can I choose and feed "content-bearing words" to "count_wordvec"? (fwd) Beate, Thanks for your help! What you describe sounds like a reasonable approach. Unfortunately, I need to do some housekeeping with our CVS repository before it can be changed by multiple people without making a mess. I am planning to do that by the end of the week, and I'll get back to you. Scott |