GUI for a Vocal Remover that uses Deep Neural Networks
The most powerful and modular diffusion model GUI, api and backend
GUI Exploration Lab. One of the best GUI agent solutions
Fast stable diffusion on CPU and AI PC
A state-of-the-art open visual language model
Real-World Centric Foundation GUI Agents
UI-TARS-desktop version that can operate on your local personal device
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Agent framework and applications built upon Qwen>=3.0
Convert AI papers to GUI
GUI/CLI tool for downloading Xiaohongshu
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Witness the aha moment of VLM with less than $3
Generate audiobooks from e-books
Framework and no-code GUI for fine-tuning LLMs
A python library that makes AMR parsing, generation and visualization
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Image polygonal annotation with Python
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An open sourced end-to-end VLM-based GUI Agent
Generate audiobooks from e-books, voice cloning & 1107+ languages
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Python hands on tutorial with 50+ Python Application