Qwen3-VL, the multimodal large language model series by Alibaba Cloud
A Repo For Document AI
OCR model for complex documents with layout-aware structured outputs
Declarative way to run AI models in React Native on device
Document content and metadata extraction microservice
Fast and efficient unstructured data extraction
Structured data extraction and instruction calling with ML, LLM
Extract and convert data from any document, images, pdfs, word doc
A community-supported supercharged version of paperless
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Assist in organizing your piles of documents
In-depth tutorials on LLMs, RAGs and real-world AI agent applications
AI tool for automating desktop tasks via natural language input
An on-premises, OCR-free unstructured data extraction
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Qwen3-omni is a natively end-to-end, omni-modal LLM
screen recognition and search
LightWeight OCR
Chess application whichs allows working with chess PDF books and PGNs.
Download books from the hathitrust website in a fast and easy manner
Visual Automation IDE — automate anything you see on screen
Document Management System and Content Management System