Awesome multilingual OCR toolkits based on PaddlePaddle
Handwritten Text Recognition (HTR) system implemented with TensorFlow
A framework to enable multimodal models to operate a computer
Contexts Optical Compression
OCR software, free and offline
AI Agent Application Development Framework
Crowdsourcing platform for full text transcription and tagging
OCRmyPDF adds an OCR text layer to scanned PDF files
Open source AI VTuber platform with voice chat and Live2D avatars
Accurate × Fast × Comprehensive
Enhances Tesseract OCR output using LLMs (local or API)
Powerful Android AI agent with tools, automation, and Linux shell
Visual Causal Flow
OCR expert VLM powered by Hunyuan's native multimodal architecture
A simple tool for reading in poorly redacted documents
Towards Studio-Grade Character Animation via In-Context Learning of 3D
AI assistant based on large models that can actively think and plan
Framework for building AI-powered interactive digital humans and agent
An on-premises, OCR-free unstructured data extraction
A ranked list of awesome machine learning Python libraries
A Python application to add watermarks (text or image) to PDF files
Run GGUF models easily with a UI or API. One File. Zero Install.
PyCAPGE - Python Classic Adventure Point and Click Game Engine
AI-powered PC monitoring that explains. Not shows numbers/spikes.
A powerful, free and open-source tool for TextureAtlases/Spritesheets