[infomap-nlp-devel] my_isalpha(). What else should I change to make InfoMap capable of handling mult

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hi Scott, Beate,

As Beate wrote on my_isalpha(), I note it does not accept non-ASCII
characters from its outset.

Are there any other parts of InfoMap I should give a closer look and if
necessary change for making it capable of handling Japanese and other
multibyte characters?  I think I have to do so by trials and errors, but if
you could give me guidance it would streamline my process.

I plan to use UTF8 as encoding. I hope that my changes would be transparent
to ASCII and could be brought back to the main release if we want to. I
would be appreciate if I could have access to CVS when it is ready.

Regards, Shuji

-----Original Message-----
From: Scott James Cederberg [mailto:ced...@cs...] 
Sent: Wednesday, March 10, 2004 3:08 PM
To: Beate Dorow
Cc: inf...@li...; yam...@ya...
Subject: Re: [infomap-nlp-devel] Re: [infomap-nlp-users] Infomap. Can I
choose and feed "content-bearing words" to "count_wordvec"? (fwd)

Beate,

   Thanks for your help!  What you describe sounds like a reasonable
   approach.

   Unfortunately, I need to do some housekeeping with our CVS
   repository before it can be changed by multiple people without
   making a mess.  I am planning to do that by the end of the week,
   and I'll get back to you.

                                                    Scott

[infomap-nlp-devel] my_isalpha(). What else should I change to make InfoMap capable of handling mult

[infomap-nlp-devel] my_isalpha(). What else should I change to make InfoMap capable of handling multibyte characters?