GUI for a Vocal Remover that uses Deep Neural Networks
The most powerful and modular diffusion model GUI, api and backend
Real-World Centric Foundation GUI Agents
A python library that makes AMR parsing, generation and visualization
GUI Exploration Lab. One of the best GUI agent solutions
A state-of-the-art open visual language model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Image polygonal annotation with Python
Agent framework and applications built upon Qwen>=3.0
Witness the aha moment of VLM with less than $3
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Framework and no-code GUI for fine-tuning LLMs
Convert AI papers to GUI
An open sourced end-to-end VLM-based GUI Agent
UI-TARS-desktop version that can operate on your local personal device
Generate audiobooks from e-books, voice cloning & 1107+ languages
The AI toolkit for the AI developer
Generate audiobooks from e-books
A single Gradio + React WebUI with extensions for ACE-Step
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
AI-powered tool for developers, simplifying coding tasks
GUI/CLI tool for downloading Xiaohongshu
Enable AI to control your desktop, mobile and HMI devices
A simple screen parsing tool towards pure vision based GUI agent
Example client of oagi-python developed with Tauri