Awesome multilingual OCR toolkits based on PaddlePaddle
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCR software, free and offline
Contexts Optical Compression
Library for OCR-related tasks powered by Deep Learning
OCRmyPDF adds an OCR text layer to scanned PDF files
Accurate × Fast × Comprehensive
OCR expert VLM powered by Hunyuan's native multimodal architecture
Ready-to-use OCR with 80+ supported languages
Visual Causal Flow
Multilingual Document Layout Parsing in a Single Vision-Language Model