Low-code app builder for RAG and multi-agent AI applications
GUI for a Vocal Remover that uses Deep Neural Networks
The most powerful and modular diffusion model GUI, api and backend
GUI Exploration Lab. One of the best GUI agent solutions
Powerful tool that lets you create and run intelligent agents
No-code multi-agent framework to build LLM Agents, workflows
Image polygonal annotation with Python
A state-of-the-art open visual language model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
UI-TARS-desktop version that can operate on your local personal device
Witness the aha moment of VLM with less than $3
Framework and no-code GUI for fine-tuning LLMs
Generate audiobooks from e-books
Convert AI papers to GUI
Real-time behaviour synthesis with MuJoCo, using Predictive Control
GUI/CLI tool for downloading Xiaohongshu
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Agent framework and applications built upon Qwen>=3.0
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Generate audiobooks from e-books, voice cloning & 1107+ languages
Enable AI to control your desktop, mobile and HMI devices
The AI toolkit for the AI developer
A single Gradio + React WebUI with extensions for ACE-Step
Meta Agents Research Environments is a comprehensive platform
StreamSpeech is a seamless model for offline speech recognition