A playground to generate images from any text prompt using SD
InvokeAI is a leading creative engine for Stable Diffusion models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Collection of CVPR 2026 Papers and Open Source Projects
Tokenizer-Free TTS for Multilingual Speech Generation
Autoregressive Model Beats Diffusion
Diffusion Transformer with Fine-Grained Chinese Understanding
Open-source multi-speaker long-form text-to-speech model
A unified library of SOTA model optimization techniques
Multimodal Diffusion with Representation Alignment
Plug-in that makes it easy to generate stable diffusion images
PyTorch implementation of JiT
HY-Motion model for 3D character animation generation
The desktop app for ComfyUI
Official Python inference and LoRA trainer package
Image inpainting tool powered by SOTA AI Model
Project Lyra: Open Generative 3D World Models
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
Personalize Any Characters with a Scalable Diffusion Transformer
UI application to connect multiple AI models together
Modular AI image and video generation web UI with extensible tools
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
All-in-one WebUI for AI generative image and video creation
Code and models for ICML 2024 paper, NExT-GPT
Inference script for Oasis 500M