From: Dominic W. <wi...@go...> - 2011-02-21 15:38:04
|
Hi Shiva, Please accept my apologies as well: my Sourceforge account has been dormant for too long. Scott is right, Semantic Vectors (http://code.google.com/p/semanticvectors/) is an actively supported descendent of Infomap. Its tokenization is based upon Apache Lucene and as far as I know, nobody has had any problems with UTF8 or right-to-left languages. If you were to try it out, I'd appreciate any feedback you have. Best wishes, Dominic On Mon, Feb 21, 2011 at 12:54 AM, Scott Cederberg <ced...@gm...> wrote: > Hi Shiva, > I'm sorry for the very belated reply. > The Infomap project has not been actively developed or maintained for a few > years now, and sad to say it's not likely to be in the future. > You may want to consider the Semantic Vectors project as an alternative. It > is based on the same research and is under active development. I've added > Dominic Widdows, the leader of the Semantic Vectors project (and former > leader of the Infomap project), to this thread. He may be able to tell you > about Semantic Vectors' support for UTF-8 and RTL languages. > Best, > Scott > > On Tue, Jan 11, 2011 at 1:59 PM, shiva taslimi <sh....@go...> > wrote: >> >> Dear infomap users, >> >> I would like to know if INFOMAP supports UTF-8, >> and if it can be used for right-to-left languages. >> >> Any help would be appreciated. >> >> ------------------------------------------------------------------------------ >> Protect Your Site and Customers from Malware Attacks >> Learn about various malware tactics and how to avoid them. Understand >> malware threats, the impact they can have on your business, and how you >> can protect your company and customers by using code signing. >> http://p.sf.net/sfu/oracle-sfdevnl >> _______________________________________________ >> infomap-nlp-users mailing list >> inf...@li... >> https://lists.sourceforge.net/lists/listinfo/infomap-nlp-users >> > > |