The study environment of ancient languages (Coptic, Greek, Latin)
Subtitle translator from one natural language to other.
We describe a simple XML format to share text documents and annotation
Functional Arabic Morphology
A Porter stemming or stemmer algorithm coded in ooRexx
Statistical phrase-based machine translation system
Java API for the Romanian WordNet
A proram to de-inflect modern Hebrew words
Weka wrapper for the SGM toolkit for text classification and modeling.
Web corpus creation software (moved to GitHub)
simple BNF parser makes xml markup of matches
Dialogue Similarity
WNLT is a suite of open source natural language modules for the Welsh
Lexicon and data model for Elvish languages
A simple vocabulary builder with Unicode support.
Nigerian component of the International Corpus of English