OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Contexts Optical Compression
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR software, free and offline
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
Accurate × Fast × Comprehensive
OCR expert VLM powered by Hunyuan's native multimodal architecture
Open Source Document Management System for Digital Archives
Visual Causal Flow
Multilingual Document Layout Parsing in a Single Vision-Language Model
Implementation of Nougat Neural Optical Understanding
A Unified Toolkit for Deep Learning Based Document Image Analysis
Ozyr is a simple and easy to use OCR snipping tool
A supercharged version of paperless, scan, index and archive docs
Easy-OCR solution and Tesseract trainer for GNU/Linux
The tool supports template-based parsing, allowing structured output i