Open-Source Python3 tool for recognizing layouts, tables, and math
Formula recognition based on LaTeX-OCR and ONNXRuntime
OCRmyPDF adds an OCR text layer to scanned PDF files
JSON Hero is an open-source, beautiful JSON explorer for the web
CLI tool to extract (meta)data from PDF and manipulate PDF files
A simple tool for reading in poorly redacted documents
Download, save and convert multiple subtitles from YouTube videos
C++ library for creating XLSX files for MS Excel 2007 and above.
Improved JPEG encoder
A JavaScript HTML screenshot renderer
Budou is an auto organizer tool for beautiful line breaking in CJK
Source code to formatted text converter
an CSV / ASCII Data Log file converter.
A scientific document recognition system