OCRmyPDF adds an OCR text layer to scanned PDF files
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Enhances Tesseract OCR output using LLMs (local or API)
Toolkit for conversational AI
Readest is a modern, feature-rich ebook reader
A full spaCy pipeline and models for scientific/biomedical documents
Accurate × Fast × Comprehensive
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Han Language Processing
The behavior guidance framework for customer-facing LLM agents
NLP Cloud serves high performance pre-trained or custom models for NER
Go efficient multilingual NLP and text segmentation
NLP Cloud serves high performance pre-trained or custom models
Framework for building real-time voice and multimodal AI agents
Open source clipboard management tools for Windows, Macos and Linux
The media player for language learning, with dual subtitles
Persian NLP Toolkit
OCR expert VLM powered by Hunyuan's native multimodal architecture
OCR offline image text recognition command line windows program
Speech recognition for your site
Crowdsourcing platform for full text transcription and tagging
StreamSpeech is a seamless model for offline speech recognition
A simple tool for reading in poorly redacted documents
Fast multimodal LLM for real-time voice interaction and AI apps
Speech to Text to Speech, sends text as OSC messages