Indexing and query tools for very large text corpora
The Linguistic Analyzer is a tool for corpus analysis and comparison
Phrase-Based & Neural Unsupervised Machine Translation
@Note2 - A workbench for Biomedical Text Mining
Text categorization, arabic language processing, language modeling
An open source system for Arabic corpora processing
We describe a simple XML format to share text documents and annotation
Dialogue Similarity
A parallel corpora (bitext) aligning tool. Create TMX databases
cross-languages resources
Python, NLTK-based package for shallow parsing of Brazilian Portuguese
A POS, disfluency and multi-word unit annotator for spoken language
An Arabic Corpora Processing Tool
A tool for exporting Wikipedia data
A repository of software, documentation and data for NLP