Handwritten Text Recognition (HTR) system implemented with TensorFlow
OCR software, free and offline
Awesome multilingual OCR toolkits based on PaddlePaddle
State-of-the-art 2D and 3D Face Analysis Project
Contexts Optical Compression
Accurate × Fast × Comprehensive
OCR expert VLM powered by Hunyuan's native multimodal architecture
Enhances Tesseract OCR output using LLMs (local or API)
Visual Causal Flow
A framework to enable multimodal models to operate a computer
OCRmyPDF adds an OCR text layer to scanned PDF files
Library for OCR-related tasks powered by Deep Learning
AI Agent Application Development Framework
Replace OpenAI GPT with another LLM in your app
Omnilingual ASR Open-Source Multilingual SpeechRecognition
NLP Cloud serves high performance pre-trained or custom models for NER
Repo of Qwen2-Audio chat & pretrained large audio language model
Qwen3-Coder is the code version of Qwen3
Image processing in Python
Ready-to-use OCR with 80+ supported languages
An on-premises, OCR-free unstructured data extraction
A ranked list of awesome machine learning Python libraries
Advanced NLP with spaCy: A free online course
Framework for building AI-powered interactive digital humans and agent
Code release for Cut and Learn for Unsupervised Object Detection