OCR software, free and offline
Contexts Optical Compression
Accurate × Fast × Comprehensive
PDF to Markdown with vision models
Visual Causal Flow
OCRmyPDF adds an OCR text layer to scanned PDF files
Formula recognition based on LaTeX-OCR and ONNXRuntime
Enhances Tesseract OCR output using LLMs (local or API)
Awesome multilingual OCR toolkits based on PaddlePaddle
Library for OCR-related tasks powered by Deep Learning
A high-quality tool for convert PDF to Markdown and JSON
Ready-to-use OCR with 80+ supported languages
OCR expert VLM powered by Hunyuan's native multimodal architecture
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Multilingual Document Layout Parsing in a Single Vision-Language Model
Convert AI papers to GUI
PDF scientific paper translation with preserved formats
Windrecorder is a memory search app by records everything
Math OCR model that outputs LaTeX and markdown
A simple tool for reading in poorly redacted documents
Open Source Document Management System for Digital Archives
Get your documents ready for gen AI
A Repo For Document AI
A framework to enable multimodal models to operate a computer
Document content and metadata extraction microservice