Open Source OCR Engine
Contexts Optical Compression
PDF to Markdown with vision models
OCRmyPDF adds an OCR text layer to scanned PDF files
Formula recognition based on LaTeX-OCR and ONNXRuntime
Awesome multilingual OCR toolkits based on PaddlePaddle
Ready-to-use OCR with 80+ supported languages
Free OCR Software: No internet required, easy to use.
OCR expert VLM powered by Hunyuan's native multimodal architecture
A pure Javascript Multilingual OCR
Library for OCR-related tasks powered by Deep Learning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A cross-platform software for text translation and recognition
Convert AI papers to GUI
Math OCR model that outputs LaTeX and markdown
A high-quality tool for convert PDF to Markdown and JSON
Readest is a modern, feature-rich ebook reader
Web application that allows you to perform operations on PDF files
PDF scientific paper translation with preserved formats
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
A Repo For Document AI
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Qwen3-VL, the multimodal large language model series by Alibaba Cloud