Multimodal Diffusion with Representation Alignment
A unified library of SOTA model optimization techniques
Plug-in that makes it easy to generate stable diffusion images
HY-Motion model for 3D character animation generation
PyTorch implementation of JiT
Project Lyra: Open Generative 3D World Models
Official Python inference and LoRA trainer package
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Personalize Any Characters with a Scalable Diffusion Transformer
Code and models for ICML 2024 paper, NExT-GPT
Inference script for Oasis 500M
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Modular AI image and video generation web UI with extensible tools
All-in-one WebUI for AI generative image and video creation
Official PyTorch Implementation
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Official inference repo for FLUX.1 models
A Unified Framework for Image Customization
A SOTA open-source image editing model
A PyTorch library for implementing flow matching algorithms
High-Fidelity and Controllable Generation of Textured 3D Assets
State-of-the-art (SoTA) text-to-video pre-trained model
A Rust machine learning framework