Indexing and query tools for very large text corpora
Unicode-XML-TEI text/corpus analysis platform
External plugins for modnlp/teccli
The Linguistic Analyzer is a tool for corpus analysis and comparison
A 50 million tokens corpus of Classical Arabic.
Corpus Linguistics Software
Parsing Korean words by morpheme and part-of-speech
This program is for text lemmatization
Text categorization, arabic language processing, language modeling
Powerful search library, best suited for computer-aided translation
A text management tool for linguistic purposes...
R interface to the Corpus Query Protocol
A multilingual Parallel Arabic DIalectal Corpus
A corpus contains more than 1 M distinct Arabic words.
Arabic business and management corpus
cross-languages resources
Web corpus creation software (moved to GitHub)
Nigerian component of the International Corpus of English
Open Source tool for Arabic text readability
THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/
Drug name extraction
A POS, disfluency and multi-word unit annotator for spoken language