A single-file tkinter-based Ollama GUI project
The Python code to reproduce illustrations from Machine Learning Book
The Python Code Tutorials
GUI for a Vocal Remover that uses Deep Neural Networks
The most powerful and modular diffusion model GUI, api and backend
Python scraper based on AI
A python library that makes AMR parsing, generation and visualization
Fast stable diffusion on CPU and AI PC
Browser userscript that enhances ChatGPT reliability and usability
GUI Exploration Lab. One of the best GUI agent solutions
UI-TARS-desktop version that can operate on your local personal device
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A state-of-the-art open visual language model
Image polygonal annotation with Python
Real-World Centric Foundation GUI Agents
Witness the aha moment of VLM with less than $3
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Generate audiobooks from e-books
StreamSpeech is a seamless model for offline speech recognition
Elyra extends JupyterLab with an AI centric approach
Framework and no-code GUI for fine-tuning LLMs
Clone a voice in 5 seconds to generate arbitrary speech in real-time
An open sourced end-to-end VLM-based GUI Agent
GUI/CLI tool for downloading Xiaohongshu
Run Stable Diffusion on Mac natively