Multi-user UI for managing and running Stable Diffusion workflows tool
A simple, high-quality voice conversion tool focused on ease of use
OCR software, free and offline
Sunfish: a Python Chess Engine in 111 lines of code
Modular AI image and video generation web UI with extensible tools
The Clay Foundation Model - An open source AI model and interface
A single-file tkinter-based Ollama GUI project
Open-Sora: Democratizing Efficient Video Production for All
A research prototype of a human-centered web agent
A simple screen parsing tool towards pure vision based GUI agent
Evaluation and Tracking for LLM Experiments
Collect, organize, use, and share, all in OmniBox
A text-to-speech, speech-to-text and speech-to-speech library
Gen-AI Chat for Teams
Python Stream Processing
Image polygonal annotation with Python
Implementation of Recurrent Interface Network (RIN)
A simple native web interface that uses ChatTTS to synthesize text
Google Flights MCP and Python Library
AI-powered Jupyter spreadsheet that converts workflows into Python
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Browser userscript that enhances ChatGPT reliability and usability
Python library and CLI tool to interface with Google Translate
Unified terminal AI tool for exploring and editing codebases
A Model Context Protocol (MCP) server that enables secure interaction