Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Removes backgrounds from pictures. Extension for webui
User-friendly AI Interface
A single Gradio + React WebUI with extensions for ACE-Step
Stable-diffusion-webui-pixelization
Image generation model with single-stream diffusion transformer
Towards Human-Level Text-to-Speech through Style Diffusion
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
RGBD video generation model conditioned on camera input
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Diffusion Transformer with Fine-Grained Chinese Understanding
Deep learning framework
A simple, secure MCP-to-OpenAPI proxy server
Open-source multi-speaker long-form text-to-speech model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multi-Platform Package Manager for Stable Diffusion
Multimodal Diffusion with Representation Alignment
Inference script for Oasis 500M
Next Generation AI One-Stop Internationalization Solution
Basic Machine Learning Natural Language Processing Roadmap
A Unified Framework for Image Customization
A PyTorch library for implementing flow matching algorithms
Official inference repo for FLUX.1 models
A Powerful Native Multimodal Model for Image Generation