Recognition and resolution of numbers, units, date/time, etc.
Large Language Model Text Generation Inference
Document (PDF, Word, PPTX ...) extraction and parse API
Module for automatic summarization of text documents and HTML pages
High-performance inference server for text embeddings models API layer
Stanford CoreNLP, a Java suite of core NLP tools
Modest natural-language processing
AI tool that removes hardcoded subtitles and text from videos locally
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
Persian NLP Toolkit
Speech Note Linux app. Note taking, reading and translating
Han Language Processing
A full spaCy pipeline and models for scientific/biomedical documents
Robust Speech Recognition via Large-Scale Weak Supervision
Text mining using tidy tools
General natural language facilities for node
Underthesea - Vietnamese NLP Toolkit
A persistent, network resilient, full text search library
Open source healthcare AI
OCR model for complex documents with layout-aware structured outputs
The pluggable natural language linter for text and markdown
OCRmyPDF adds an OCR text layer to scanned PDF files
Contexts Optical Compression
Comprehensive Gradio WebUI for audio processing
The most accurate natural language detection library for Python