Generating Immersive, Explorable, and Interactive 3D Worlds
A fast TTS architecture with conditional flow matching
MII makes low-latency and high-throughput inference possible
Marrying Grounding DINO with Segment Anything & Stable Diffusion
An Open Source text-to-speech system built by inverting Whisper
A PyTorch library for implementing flow matching algorithms
Virtual AI anchor that combines state-of-the-art technology
Run the Stable Diffusion releases in a Docker container
Plug-n-play module turning text-to-image models into animation
Run GGUF models easily with a UI or API. One File. Zero Install.
Implementation of Dreambooth
Consistency Distilled Diff VAE
Chat-based assistant that understands tasks
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Let us control diffusion models
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Basaran, an open-source alternative to the OpenAI text completion API
Official PyTorch Implementation of "Scalable Diffusion Models"
A latent text-to-image diffusion model
A converter for seamless transformation of files, data, and media ...
MMGeneration is a powerful toolkit for generative models
Real-time music generation using stable diffusion techniques AI
Discord bot and Interface for Stable Diffusion
Scripthea is designed to streamline of crafting prompts for T2I gen.