Minimal CLI coding agent by Mistral
Reverse-engineered Python API for Google Gemini web app
The Agent-User Interaction Protocol
Multi-user UI for managing and running Stable Diffusion workflows tool
⚡ Building applications with LLMs through composability ⚡
Effortless data labeling with AI support from Segment Anything
A Web UI for easy subtitle using whisper model
LangChain powered shell command generator and runner CLI
Chat with multiple PDFs locally
AI-powered document analysis and tagging for Paperless-ngx
A unified framework for machine learning with time series
A simple screen parsing tool towards pure vision based GUI agent
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
A simple native web interface that uses ChatTTS to synthesize text
Query MCP enables end-to-end management of Supabase via chat interface
InvokeAI is a leading creative engine for Stable Diffusion models
Weaving the Digital Agent Galaxy
Generate audiobooks from e-books
A text-to-speech, speech-to-text and speech-to-speech library
Python Stream Processing
Google Gen AI Python SDK provides an interface for developers
Interface for OuteTTS models
The Clay Foundation Model - An open source AI model and interface
Give your AI agent eyes to see the entire internet
All-in-one native macOS AI chat application