OCR software, free and offline
OCRmyPDF adds an OCR text layer to scanned PDF files
A high-quality tool for convert PDF to Markdown and JSON
Visual Causal Flow
Get your documents ready for gen AI
Open Source Document Management System for Digital Archives
Document content and metadata extraction microservice
A community-supported supercharged version of paperless
A Repo For Document AI
A Python application to add watermarks (text or image) to PDF files
A supercharged version of paperless, scan, index and archive docs
Easy-OCR solution and Tesseract trainer for GNU/Linux
Virtual Appliance of RadicalSpam
The tool supports template-based parsing, allowing structured output i