Stanford NLP Python library for many human languages
Stanford CoreNLP, a Java suite of core NLP tools
Document (PDF, Word, PPTX ...) extraction and parse API
Syntax tree editor for rapid annotation of existing text
Modest natural-language processing
The Classical Language Toolkit
Qwen3-TTS is an open-source series of TTS models
A simple tool for reading in poorly redacted documents
Python tool for converting files and office documents to Markdown
PostgreSQL extension for BM25 relevance-ranked full-text search
Discourse Network Analyzer (DNA)
Han Language Processing
NLTK Source
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
LLM-based Reinforcement Learning audio edit model
The most accurate natural language detection library for Rust
Industrial-level controllable zero-shot text-to-speech system
The most accurate natural language detection library for Python
Text mining using tidy tools
Open source annotation tool for machine learning practitioners
Open-Source Python3 tool for recognizing layouts, tables, and math
A Pioneering Open-Source Alternative to GPT-4o
Metaprogramming library to analyze and transform Java source code
Open source semantic search and text analytics for large document sets
Chinese XLNet pre-trained model