Fast-stable-diffusion + DreamBooth
A Customizable Image-to-Video Model based on HunyuanVideo
Official inference repo for FLUX.2 models
PyTorch implementation of JiT
Reference PyTorch implementation and models for DINOv3
Sharp Monocular Metric Depth in Less Than a Second
Diffusion Transformer with Fine-Grained Chinese Understanding
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Easy Docker setup for Stable Diffusion with user-friendly UI
RGBD video generation model conditioned on camera input
A latent text-to-image diffusion model
Lightweight multimodal translation model for 55 languages
Small 3B-base multimodal model ideal for custom AI on edge hardware
Compact 8B multimodal instruct model optimized for edge deployment
Efficient 14B multimodal instruct model with edge deployment and FP8