Plugin for JADX to integrate MCP server
A single-file tkinter-based Ollama GUI project
GUI for a Vocal Remover that uses Deep Neural Networks
The most powerful and modular diffusion model GUI, api and backend
GUI Exploration Lab. One of the best GUI agent solutions
A state-of-the-art open visual language model
Fast stable diffusion on CPU and AI PC
A GUI Agent app based on UI-TARS to control your computer using AI
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Testing tool for modeling GUI transitions
Agent framework and applications built upon Qwen>=3.0
Make your own story. User-friendly software for LLM roleplaying
Real-World Centric Foundation GUI Agents
Generate audiobooks from e-books
UI-TARS-desktop version that can operate on your local personal device
A python library that makes AMR parsing, generation and visualization
Framework and no-code GUI for fine-tuning LLMs
A native desktop GUI for Claude Code
GUI/CLI tool for downloading Xiaohongshu
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A single Gradio + React WebUI with extensions for ACE-Step
Witness the aha moment of VLM with less than $3
Image polygonal annotation with Python
An open sourced end-to-end VLM-based GUI Agent