OCR software, free and offline
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
Ready-to-use OCR with 80+ supported languages
OCR expert VLM powered by Hunyuan's native multimodal architecture
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Library for OCR-related tasks powered by Deep Learning
Convert AI papers to GUI
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
A Repo For Document AI
A community-supported supercharged version of paperless
Qwen3-omni is a natively end-to-end, omni-modal LLM
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A Python application to add watermarks (text or image) to PDF files
FaceOnLive Open KYC: Streamlining Identity Verification with AI
Implementation of Nougat Neural Optical Understanding
Img2Txt - Extract Text From Images using AI
An OCR translator tool made by utilizing tesseract & python-opencv
The ultimate tool to automate custom telegram message forwarding
CCTV Footage Timestamp Search Tool
A Unified Toolkit for Deep Learning Based Document Image Analysis
e-Dokyumento is web-based Document Management System (DMS)