Open-Source Python3 tool for recognizing layouts, tables, and math
CLI tool to extract (meta)data from PDF and manipulate PDF files
Formula recognition based on LaTeX-OCR and ONNXRuntime
OCRmyPDF adds an OCR text layer to scanned PDF files
A simple tool for reading in poorly redacted documents
A Python application to add watermarks (text or image) to PDF files
Budou is an auto organizer tool for beautiful line breaking in CJK