Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Awesome multilingual OCR toolkits based on PaddlePaddle
Library for OCR-related tasks powered by Deep Learning
Ready-to-use OCR with 80+ supported languages
Open Source Document Management System for Digital Archives
OCR expert VLM powered by Hunyuan's native multimodal architecture
Implementation of Nougat Neural Optical Understanding
A Unified Toolkit for Deep Learning Based Document Image Analysis
Ozyr is a simple and easy to use OCR snipping tool
A supercharged version of paperless, scan, index and archive docs
Easy-OCR solution and Tesseract trainer for GNU/Linux