Document content and metadata extraction microservice
Structured data extraction and instruction calling with ML, LLM
OpenRecall is a fully open-source, privacy-first alternative
OCR model for complex documents with layout-aware structured outputs
AI tool for automating desktop tasks via natural language input
Vision utilities for web interaction agents
An on-premises, OCR-free unstructured data extraction
Handwritten Text Recognition (HTR) system implemented with TensorFlow
In-depth tutorials on LLMs, RAGs and real-world AI agent applications
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Qwen3-omni is a natively end-to-end, omni-modal LLM
A Python application to add watermarks (text or image) to PDF files
FaceOnLive Open KYC: Streamlining Identity Verification with AI
Implementation of Nougat Neural Optical Understanding
An OCR translator tool made by utilizing tesseract & python-opencv
Img2Txt - Extract Text From Images using AI
Framework with web data entry, OCR & designer
The ultimate tool to automate custom telegram message forwarding
CCTV Footage Timestamp Search Tool
A Unified Toolkit for Deep Learning Based Document Image Analysis
Ozyr is a simple and easy to use OCR snipping tool
e-Dokyumento is web-based Document Management System (DMS)
Typeface from Ming Dynasty woodblock printed books
A supercharged version of paperless, scan, index and archive docs