Statistical machine intelligence and learning engine
Unicode XML TEI text analysis platform
Data and Text Mining Software for Everyone
DSTK - DataScience ToolKit for All of Us
an application to automatically extract text from comic books.
Statistical phrase-based machine translation system
Log-linear analysis (data modelling) for high-dimensional data