High-Performance Face Recognition Library on PaddlePaddle & PyTorch
2D and 3D Face alignment library build using pytorch
State-of-the-art 2D and 3D Face Analysis Project
A Lightweight Face Recognition and Facial Attribute Analysis
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Real time face swap and one-click video deepfake
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Repo of Qwen2-Audio chat & pretrained large audio language model
Speech recognition module for Python
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Graphical User Interface Face Anonymization Tool
Open-source industrial-grade ASR models
Multilingual speech recognition and audio understanding model
Handwritten Text Recognition (HTR) system implemented with TensorFlow
A self-hosted open source photo management service
Faster and easier training and deployments
Enhances Tesseract OCR output using LLMs (local or API)
Library for OCR-related tasks powered by Deep Learning
Create UIs for your machine learning model in Python in 3 minutes
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Integrating LLMs into structured NLP pipelines
An open-source photo thumbnail service by globo.com
Welcome the Era of One-shot Long-horizon Parsing
Image processing in Python