Recognition and resolution of numbers, units, date/time, etc.
Large Language Model Text Generation Inference
Document (PDF, Word, PPTX ...) extraction and parse API
Module for automatic summarization of text documents and HTML pages
High-performance inference server for text embeddings models API layer
Stanford CoreNLP, a Java suite of core NLP tools
New way to create web server and NoSQL data model
A simple interface for working with TeX documents
Modest natural-language processing
AI tool that removes hardcoded subtitles and text from videos locally
Lightweight and flexible command-line JSON processor
Persian NLP Toolkit
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
Speech Note Linux app. Note taking, reading and translating
Robust Speech Recognition via Large-Scale Weak Supervision
A full spaCy pipeline and models for scientific/biomedical documents
Han Language Processing
Open source text shaping engine
Parser generator to read, process, or translate structured text
General natural language facilities for node
Underthesea - Vietnamese NLP Toolkit
OCR model for complex documents with layout-aware structured outputs
Text mining using tidy tools
The pluggable natural language linter for text and markdown
Comprehensive Gradio WebUI for audio processing