Image inpainting tool powered by SOTA AI Model
GUI/CLI tool for downloading Xiaohongshu
Advanced language and coding AI model
GUI for a Vocal Remover that uses Deep Neural Networks
GUI Exploration Lab. One of the best GUI agent solutions
The most powerful and modular diffusion model GUI, api and backend
Fast stable diffusion on CPU and AI PC
A state-of-the-art open visual language model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Real-World Centric Foundation GUI Agents
UI-TARS-desktop version that can operate on your local personal device
A python library that makes AMR parsing, generation and visualization
An open sourced end-to-end VLM-based GUI Agent
Framework and no-code GUI for fine-tuning LLMs
Browser userscript that enhances ChatGPT reliability and usability
Spark-TTS Inference Code
Convert AI papers to GUI
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Agent framework and applications built upon Qwen>=3.0
Witness the aha moment of VLM with less than $3
Stable Diffusion web UI
Generate audiobooks from e-books
Image polygonal annotation with Python
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Generate audiobooks from e-books, voice cloning & 1107+ languages