Stanford NLP Python library for many human languages
Stanford CoreNLP, a Java suite of core NLP tools
Syntax tree editor for rapid annotation of existing text
Document (PDF, Word, PPTX ...) extraction and parse API
Modest natural-language processing
The Classical Language Toolkit
Qwen3-TTS is an open-source series of TTS models
A simple tool for reading in poorly redacted documents
Python tool for converting files and office documents to Markdown
PostgreSQL extension for BM25 relevance-ranked full-text search
Discourse Network Analyzer (DNA)
Han Language Processing
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
NLTK Source
LLM-based Reinforcement Learning audio edit model
The most accurate natural language detection library for Rust
Industrial-level controllable zero-shot text-to-speech system
Text mining using tidy tools
The most accurate natural language detection library for Python
Open source annotation tool for machine learning practitioners
Metaprogramming library to analyze and transform Java source code
Open-Source Python3 tool for recognizing layouts, tables, and math
Open source semantic search and text analytics for large document sets
A Pioneering Open-Source Alternative to GPT-4o
General natural language facilities for node