Low-code app builder for RAG and multi-agent AI applications
GUI for a Vocal Remover that uses Deep Neural Networks
The most powerful and modular diffusion model GUI, api and backend
A state-of-the-art open visual language model
Powerful tool that lets you create and run intelligent agents
No-code multi-agent framework to build LLM Agents, workflows
Image polygonal annotation with Python
Witness the aha moment of VLM with less than $3
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Agent framework and applications built upon Qwen>=3.0
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Framework and no-code GUI for fine-tuning LLMs
The AI toolkit for the AI developer
GUI/CLI tool for downloading Xiaohongshu
Convert AI papers to GUI
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Generate audiobooks from e-books
Meta Agents Research Environments is a comprehensive platform
Generate audiobooks from e-books, voice cloning & 1107+ languages
StreamSpeech is a seamless model for offline speech recognition
Enable AI to control your desktop, mobile and HMI devices
A single Gradio + React WebUI with extensions for ACE-Step
AI-powered tool for developers, simplifying coding tasks
A graphical manager for ollama that can manage your LLMs