Create UIs for your machine learning model in Python in 3 minutes
Focus on prompting and generating
A single Gradio + React WebUI with extensions for ACE-Step
A simple, high-quality voice conversion tool focused on ease of use
Comprehensive Gradio WebUI for audio processing
Synchronized Translation for Videos
EPUB to audiobook converter, optimized for Audiobookshelf
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Chat with multiple PDFs locally
Unified Multimodal Understanding and Generation Models
Diffusion Transformer with Fine-Grained Chinese Understanding
Speech-AI-Forge is a project developed around TTS generation model
A fast TTS architecture with conditional flow matching
One-click deployment (including offline integration package)
The python library for real-time communication
Real-time voice interactive digital human
Time-lapse Video Generation Models as Metamorphic Simulators
Stable Diffusion web UI
SoTA open-source TTS
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Oobabooga - The definitive Web UI for local AI, with powerful features
An open-source RAG-based tool for chatting with your documents
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Text and image to video generation: CogVideoX and CogVideo