A Model Context Protocol server for searching and analyzing arXiv
Indexing and query tools for very large text corpora
Private & local AI personal knowledge management app
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
DeepSeek Coder: Let the Code Write Itself
Modular Suite of NLP Tools
a collection of indexing and search tools for corpus linguists
Tools to download and cleanup Common Crawl data
American fuzzy lop - a security-oriented fuzzer
The most comprehensive database of Chinese poetry
Software tools to re-tell stories in a better way and expand them
GloVe model for distributed word representation
THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/
A POS, disfluency and multi-word unit annotator for spoken language
Utility classes from maps to search engine to random samplers