Recognition and resolution of numbers, units, date/time, etc.
Large Language Model Text Generation Inference
Document (PDF, Word, PPTX ...) extraction and parse API
Module for automatic summarization of text documents and HTML pages
High-performance inference server for text embeddings models API layer
Stanford CoreNLP, a Java suite of core NLP tools
New way to create web server and NoSQL data model
A simple interface for working with TeX documents
Modest natural-language processing
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
Persian NLP Toolkit
Han Language Processing
Text mining using tidy tools
Robust Speech Recognition via Large-Scale Weak Supervision
A full spaCy pipeline and models for scientific/biomedical documents
General natural language facilities for node
Open source healthcare AI
A persistent, network resilient, full text search library
Underthesea - Vietnamese NLP Toolkit
Parser generator to read, process, or translate structured text
OCR model for complex documents with layout-aware structured outputs
Comprehensive Gradio WebUI for audio processing
Open source text shaping engine
The pluggable natural language linter for text and markdown
OCRmyPDF adds an OCR text layer to scanned PDF files