A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR software, free and offline
Library for OCR-related tasks powered by Deep Learning
Ready-to-use OCR with 80+ supported languages
Contexts Optical Compression
Accurate × Fast × Comprehensive
Open Source Document Management System for Digital Archives
OCR expert VLM powered by Hunyuan's native multimodal architecture
Visual Causal Flow
Multilingual Document Layout Parsing in a Single Vision-Language Model
Implementation of Nougat Neural Optical Understanding
Ozyr is a simple and easy to use OCR snipping tool
A supercharged version of paperless, scan, index and archive docs
Easy-OCR solution and Tesseract trainer for GNU/Linux