Aligns tokens in two versions of a text with differing tokenization.
Count frequency of single, 2-word and 3-word clusters in a text
A toolkit for managing and manipulating text annotations
Safe Harbor Deidentification for medical documents
Text categorization, arabic language processing, language modeling
the intelligent predictive text entry platform
Collecter and manager of semiotica annalisis data
Turku Event Extraction System
We describe a simple XML format to share text documents and annotation
simple BNF parser makes xml markup of matches
THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/
Arabic Text Vocalization system
automatic alignment pipeline for parallel treebanks
Complete tool for constructing/manipulating languages in digital form