Recognition and resolution of numbers, units, date/time, etc.
Large Language Model Text Generation Inference
Module for automatic summarization of text documents and HTML pages
Modest natural-language processing
Stanford CoreNLP, a Java suite of core NLP tools
A simple interface for working with TeX documents
New way to create web server and NoSQL data model
Underthesea - Vietnamese NLP Toolkit
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
A full spaCy pipeline and models for scientific/biomedical documents
Lightweight and flexible command-line JSON processor
Persian NLP Toolkit
Text mining using tidy tools
A persistent, network resilient, full text search library
General natural language facilities for node
Open source text shaping engine
Robust Speech Recognition via Large-Scale Weak Supervision
Han Language Processing
Easy-to-use and powerful NLP library with Awesome model zoo
Parser generator to read, process, or translate structured text
The most accurate natural language detection library for Python
OCRmyPDF adds an OCR text layer to scanned PDF files
The pluggable natural language linter for text and markdown
Contexts Optical Compression
A Repo For Document AI