Stanford NLP Python library for many human languages
Stanford CoreNLP, a Java suite of core NLP tools
Syntax tree editor for rapid annotation of existing text
Document (PDF, Word, PPTX ...) extraction and parse API
Modest natural-language processing
The Classical Language Toolkit
Qwen3-TTS is an open-source series of TTS models
A simple tool for reading in poorly redacted documents
PostgreSQL extension for BM25 relevance-ranked full-text search
Python tool for converting files and office documents to Markdown
Discourse Network Analyzer (DNA)
Han Language Processing
NLTK Source
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
LLM-based Reinforcement Learning audio edit model
Text mining using tidy tools
The most accurate natural language detection library for Rust
Industrial-level controllable zero-shot text-to-speech system
The most accurate natural language detection library for Python
Open source annotation tool for machine learning practitioners
A Pioneering Open-Source Alternative to GPT-4o
Open-Source Python3 tool for recognizing layouts, tables, and math
Metaprogramming library to analyze and transform Java source code
Chinese XLNet pre-trained model
Open source semantic search and text analytics for large document sets