Node.js module for rendering pdf pages to images, svgs and HTML files
World's most comprehensive, powerful, process-based PDF editor
World's most comprehensive, powerful, process-based PDF editor
Open Source OCR Engine
OCRmyPDF adds an OCR text layer to scanned PDF files
A fast image processing library with low memory needs
Parse files for optimal RAG
Build cross-modal and multimodal applications on the cloud
ITTT is a Free tool designed to Scan and extract Text from Images.
Create, Edit, Delete, Organize , Convert, Export, Secure & Sign.
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
Document Management System and Content Management System
Download books from the hathitrust website in a fast and easy manner
Android Manga reader with Japanese OCR and dictionary capabilities
Award-winning modern data processing in C++17/20
ADAMS is a workflow engine for building complex knowledge workflows.
Common Resource Grep
Easy Tools of PDF, Image, File, Network, Data, and Medias
A supercharged version of paperless, scan, index and archive docs
A graphical frontend to tesseract-ocr
Analysis Nuclei DAB (AND) Tool
Easy-OCR solution and Tesseract trainer for GNU/Linux
It is a Windows library that merges standard PDFs into a final PDF
Provides OCR solutions for Nepali, based on Tesseract 4.0.