TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Diffusion Transformer with Fine-Grained Chinese Understanding
EPUB to audiobook converter, optimized for Audiobookshelf
InvokeAI is a leading creative engine for Stable Diffusion models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Free, open source crypto trading bot
Repo for SeedVR2 & SeedVR
A simple, secure MCP-to-OpenAPI proxy server
HY-Motion model for 3D character animation generation
Open-source multi-speaker long-form text-to-speech model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Image inpainting tool powered by SOTA AI Model
Multimodal Diffusion with Representation Alignment
Personalize Any Characters with a Scalable Diffusion Transformer
State-of-the-art (SoTA) text-to-video pre-trained model
Official PyTorch Implementation
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Inference script for Oasis 500M
Generating Immersive, Explorable, and Interactive 3D Worlds
Operating LLMs in production
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
The official repo of Qwen chat & pretrained large language model
A Unified Framework for Image Customization
A SOTA open-source image editing model
High-Fidelity and Controllable Generation of Textured 3D Assets