GUI for a Vocal Remover that uses Deep Neural Networks
Real-World Centric Foundation GUI Agents
The most powerful and modular diffusion model GUI, api and backend
GUI Exploration Lab. One of the best GUI agent solutions
A state-of-the-art open visual language model
Agent framework and applications built upon Qwen>=3.0
A single Gradio + React WebUI with extensions for ACE-Step
Framework and no-code GUI for fine-tuning LLMs
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Convert AI papers to GUI
Generate audiobooks from e-books
UI-TARS-desktop version that can operate on your local personal device
Witness the aha moment of VLM with less than $3
Make your own story. User-friendly software for LLM roleplaying
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Generate audiobooks from e-books, voice cloning & 1107+ languages
Image polygonal annotation with Python
GUI/CLI tool for downloading Xiaohongshu
MCP Aggregator, Orchestrator, Middleware, Gateway in one docker
Shrimp Task Manager is a task tool built for AI Agents
Local Groq Desktop chat app with MCP support
The AI toolkit for the AI developer
Enable AI to control your desktop, mobile and HMI devices
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning