GUI for a Vocal Remover that uses Deep Neural Networks
Real-World Centric Foundation GUI Agents
The most powerful and modular diffusion model GUI, api and backend
GUI Exploration Lab. One of the best GUI agent solutions
ChatMCP is an AI chat client implementing the Model Context Protocol
A state-of-the-art open visual language model
Framework and no-code GUI for fine-tuning LLMs
A single Gradio + React WebUI with extensions for ACE-Step
Convert AI papers to GUI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Agent framework and applications built upon Qwen>=3.0
Make your own story. User-friendly software for LLM roleplaying
GUI/CLI tool for downloading Xiaohongshu
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Witness the aha moment of VLM with less than $3
UI-TARS-desktop version that can operate on your local personal device
Generate audiobooks from e-books
MCP Aggregator, Orchestrator, Middleware, Gateway in one docker
Generate audiobooks from e-books, voice cloning & 1107+ languages
Image polygonal annotation with Python
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Local Groq Desktop chat app with MCP support
Shrimp Task Manager is a task tool built for AI Agents
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Enable AI to control your desktop, mobile and HMI devices