Fast-stable-diffusion + DreamBooth
A Customizable Image-to-Video Model based on HunyuanVideo
Kaggle Python docker image
Official inference repo for FLUX.2 models
Easily compute clip embeddings and build a clip retrieval system
Structure-from-Motion and Multi-View Stereo
AI video generator optimized for low VRAM and older GPUs use
PyTorch implementation of JiT
Reference PyTorch implementation and models for DINOv3
Sharp Monocular Metric Depth in Less Than a Second
Diffusion Transformer with Fine-Grained Chinese Understanding
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Open-Sora: Democratizing Efficient Video Production for All
A Unified Framework for Image Customization
"Big Model" trains a visual multimodal VLM with 26M parameters
A Universal Customization Method for Single and Multi Conditioning
Easy Docker setup for Stable Diffusion with user-friendly UI
RGBD video generation model conditioned on camera input
Instant neural graphics primitives: lightning fast NeRF and more
A Customizable Image-to-Video Model based on HunyuanVideo
Point cloud diffusion for 3D model synthesis
A latent text-to-image diffusion model
Codebase for Image Classification Research, written in PyTorch
Convolutional Neural Networks to predict aesthetic quality of images
Lightweight multimodal translation model for 55 languages