OCR software, free and offline
Accurate × Fast × Comprehensive
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCR expert VLM powered by Hunyuan's native multimodal architecture
Document content and metadata extraction microservice
A simple tool for reading in poorly redacted documents
Structured data extraction and instruction calling with ML, LLM
A Repo For Document AI
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Python binding to the Apache Tika™ REST services
Build AI-powered semantic search applications
Aseryla code repositories
The tool supports template-based parsing, allowing structured output i