Stable Diffusion built-in to Blender
HY-Motion model for 3D character animation generation
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Open-source multi-speaker long-form text-to-speech model
Personalize Any Characters with a Scalable Diffusion Transformer
Flutter-based cross-platform app integrating major AI models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
InvokeAI is a leading creative engine for Stable Diffusion models
Inference script for Oasis 500M
Multimodal Diffusion with Representation Alignment
Image inpainting tool powered by SOTA AI Model
A Unified Framework for Image Customization
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A Rust machine learning framework
A Fork from Github repository of Illyasviel's Forge
text and image to video generation: CogVideoX (2024) and CogVideo
UI application to connect multiple AI models together
Generating Immersive, Explorable, and Interactive 3D Worlds
A PyTorch library for implementing flow matching algorithms
Next Generation AI One-Stop Internationalization Solution
A Powerful Native Multimodal Model for Image Generation
Official code for Style Aligned Image Generation via Shared Attention
State-of-the-art Parameter-Efficient Fine-Tuning
A fast TTS architecture with conditional flow matching
Virtual AI anchor that combines state-of-the-art technology