Open-Source Python3 tool for recognizing layouts, tables, and math
OCRmyPDF adds an OCR text layer to scanned PDF files
Formula recognition based on LaTeX-OCR and ONNXRuntime
CLI tool to extract (meta)data from PDF and manipulate PDF files
JSON Hero is an open-source, beautiful JSON explorer for the web
A simple tool for reading in poorly redacted documents
Download, save and convert multiple subtitles from YouTube videos
C++ library for creating XLSX files for MS Excel 2007 and above.
A JavaScript HTML screenshot renderer
Budou is an auto organizer tool for beautiful line breaking in CJK
Source code to formatted text converter
A scientific document recognition system