State-of-the-art 2D and 3D Face Analysis Project
NLP Cloud serves high performance pre-trained or custom models for NER
OCR software, free and offline
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
WAFW00F allows one to identify and fingerprint Web App Firewall
Formula recognition based on LaTeX-OCR and ONNXRuntime
Image processing in Python
Library for OCR-related tasks powered by Deep Learning
Repo of Qwen2-Audio chat & pretrained large audio language model
Accurate × Fast × Comprehensive
Lightweight vault and password manager for Android
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Replace OpenAI GPT with another LLM in your app
OCR expert VLM powered by Hunyuan's native multimodal architecture
A library to generate LaTeX expression from Python code
Visual Causal Flow
Ready-to-use OCR with 80+ supported languages
Qwen3-Coder is the code version of Qwen3
A fast, powerful, and simple hierarchical vision transformer
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Pre-trained Deep Learning models and demos
UI Automation Framework for Games and Apps
Code release for Cut and Learn for Unsupervised Object Detection
Jittor is a high-performance deep learning framework
Open-Source AI Camera. Empower any camera/CCTV