Aligns tokens in two versions of a text with differing tokenization.
Phrase-Based & Neural Unsupervised Machine Translation
Text categorization, arabic language processing, language modeling
natural language corpora search engine
We describe a simple XML format to share text documents and annotation
THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/
Python, NLTK-based package for shallow parsing of Brazilian Portuguese
a synonym extractor based on web-corpora and a multilingual translator