A single-file tkinter-based Ollama GUI project
A single Gradio + React WebUI with extensions for ACE-Step
A simple, high-quality voice conversion tool focused on ease of use
C++ inference library for multiple SVC/TTS
Software that uses AI to perform real-time voice conversion
GUI for a Vocal Remover that uses Deep Neural Networks
GUI Exploration Lab. One of the best GUI agent solutions
The most powerful and modular diffusion model GUI, api and backend
Fast stable diffusion on CPU and AI PC
A GUI Agent app based on UI-TARS to control your computer using AI
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Real-World Centric Foundation GUI Agents
Make your own story. User-friendly software for LLM roleplaying
A state-of-the-art open visual language model
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
A python library that makes AMR parsing, generation and visualization
Framework and no-code GUI for fine-tuning LLMs
Clone a voice in 5 seconds to generate arbitrary speech in real-time
An open sourced end-to-end VLM-based GUI Agent
UI-TARS-desktop version that can operate on your local personal device
A native desktop GUI for Claude Code
Agent framework and applications built upon Qwen>=3.0
Generate audiobooks from e-books
Witness the aha moment of VLM with less than $3
Generate audiobooks from e-books, voice cloning & 1107+ languages