Stanford NLP Python library for many human languages
Stanford CoreNLP, a Java suite of core NLP tools
Document (PDF, Word, PPTX ...) extraction and parse API
Syntax tree editor for rapid annotation of existing text
Modest natural-language processing
The Classical Language Toolkit
Qwen3-TTS is an open-source series of TTS models
A simple tool for reading in poorly redacted documents
Python tool for converting files and office documents to Markdown
PostgreSQL extension for BM25 relevance-ranked full-text search
Discourse Network Analyzer (DNA)
Han Language Processing
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
NLTK Source
LLM-based Reinforcement Learning audio edit model
The most accurate natural language detection library for Rust
Industrial-level controllable zero-shot text-to-speech system
Text mining using tidy tools
The most accurate natural language detection library for Python
Open source annotation tool for machine learning practitioners
Metaprogramming library to analyze and transform Java source code
Open-Source Python3 tool for recognizing layouts, tables, and math
A Pioneering Open-Source Alternative to GPT-4o
Open source semantic search and text analytics for large document sets
Chinese XLNet pre-trained model