Helps data scientists define testable self-documenting dataflows
A single-file tkinter-based Ollama GUI project
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Real-World Centric Foundation GUI Agents
An open sourced end-to-end VLM-based GUI Agent
A state-of-the-art open visual language model
Framework and no-code GUI for fine-tuning LLMs
Agent framework and applications built upon Qwen>=3.0
UI-TARS-desktop version that can operate on your local personal device
Generate audiobooks from e-books, voice cloning & 1107+ languages
GUI for a Vocal Remover that uses Deep Neural Networks
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
The most powerful and modular diffusion model GUI, api and backend
GUI Exploration Lab. One of the best GUI agent solutions
Robust Speech Recognition via Large-Scale Weak Supervision
Personal AI, On Personal Devices
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
The official Python library for the OpenAI API
Fast stable diffusion on CPU and AI PC
A high-throughput and memory-efficient inference and serving engine
Image inpainting tool powered by SOTA AI Model
Awesome multilingual OCR toolkits based on PaddlePaddle
AI tool that removes hardcoded subtitles and text from videos locally
AI Fully Automated Short Video Engine
Agent S: an open agentic framework that uses computers like a human