OCR software, free and offline
Contexts Optical Compression
Accurate × Fast × Comprehensive
Welcome the Era of One-shot Long-horizon Parsing
Visual Causal Flow
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
Enhances Tesseract OCR output using LLMs (local or API)
A high-quality tool for convert PDF to Markdown and JSON
An Open-Source Toolkit for General-OCR Research and Applications
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
OCR expert VLM powered by Hunyuan's native multimodal architecture
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Convert AI papers to GUI
Multilingual Document Layout Parsing in a Single Vision-Language Model
A framework to enable multimodal models to operate a computer
Open Source Document Management System for Digital Archives
Get your documents ready for gen AI
A Repo For Document AI
OCR model for complex documents with layout-aware structured outputs
OpenRecall is a fully open-source, privacy-first alternative
Document content and metadata extraction microservice
Structured data extraction and instruction calling with ML, LLM
A community-supported supercharged version of paperless