Create UIs for your machine learning model in Python in 3 minutes
Focus on prompting and generating
A single Gradio + React WebUI with extensions for ACE-Step
Comprehensive Gradio WebUI for audio processing
A simple, high-quality voice conversion tool focused on ease of use
Synchronized Translation for Videos
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
EPUB to audiobook converter, optimized for Audiobookshelf
Chat with multiple PDFs locally
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Diffusion Transformer with Fine-Grained Chinese Understanding
Speech-AI-Forge is a project developed around TTS generation model
A fast TTS architecture with conditional flow matching
One-click deployment (including offline integration package)
The python library for real-time communication
Unified Multimodal Understanding and Generation Models
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Stable Diffusion web UI
Time-lapse Video Generation Models as Metamorphic Simulators
SoTA open-source TTS
Real-time voice interactive digital human
An open-source RAG-based tool for chatting with your documents
From Images to High-Fidelity 3D Assets
A Web UI for easy subtitle using whisper model
Text and image to video generation: CogVideoX and CogVideo