Indexing and query tools for very large text corpora
External plugins for modnlp/teccli
The Linguistic Analyzer is a tool for corpus analysis and comparison
Corpus Linguistics Software
Parsing Korean words by morpheme and part-of-speech
A 50 million tokens corpus of Classical Arabic.
This program is for text lemmatization
Text categorization, arabic language processing, language modeling
Powerful search library, best suited for computer-aided translation
A text management tool for linguistic purposes...
A multilingual Parallel Arabic DIalectal Corpus
Web corpus creation software (moved to GitHub)
Nigerian component of the International Corpus of English
cross-languages resources
THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/
Drug name extraction
Validation of terms in corpus
An ongoing project to collate and provide access to language data
CRFSharp is a .NET(C#) implementation of Conditional Random Field