State-of-the-art 2D and 3D Face Analysis Project
GUI for a Vocal Remover that uses Deep Neural Networks
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Everything you need to build state-of-the-art foundation models
A very simple framework for state-of-the-art NLP
State-of-the-art diffusion models for image and audio generation
Native and Compact Structured Latents for 3D Generation
Tiny vision language model
A Lightweight Face Recognition and Facial Attribute Analysis
Optimizing inference proxy for LLMs
Train multi-step agents for real-world tasks using GRPO
Agent Skill for generating 2D sprite sheets and map, transparent PNG
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Modular Deep Reinforcement Learning framework in PyTorch
A high-throughput and memory-efficient inference and serving engine
Faster Whisper transcription with CTranslate2
1 min voice data can also be used to train a good TTS model
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Stable Diffusion web UI
Stable Diffusion built-in to Blender
State-of-the-art Parameter-Efficient Fine-Tuning
MTEB: Massive Text Embedding Benchmark
Image inpainting tool powered by SOTA AI Model
Foundation model for image generation