Stable Diffusion web UI
Web interface for generating images using Stable Diffusion models
An MCP server that autonomously evaluates web applications
Create UIs for your machine learning model in Python in 3 minutes
InvokeAI is a leading creative engine for Stable Diffusion models
Reverse-engineered Python API for Google Gemini web app
A text-to-speech, speech-to-text and speech-to-speech library
A sound cloning tool with a web interface, using your voice
A simple native web interface that uses ChatTTS to synthesize text
Synchronized Translation for Videos
Generate short videos with one click using AI LLM
Low-level Python library used to interact with a Substra network
A research prototype of a human-centered web agent
SWE-agent takes a GitHub issue and tries to automatically fix it
ChatGPT interface with better UI
Adds support for Yandex Smart Home (Alice voice assistant)
Private chat with local GPT with document, images, video, etc.
A high-performance ML model serving framework, offers dynamic batching
Generate audiobooks from e-books, voice cloning & 1107+ languages
Real-time voice interactive digital human
A neural network that transforms a design mock-up into static websites
Speech-AI-Forge is a project developed around TTS generation model
Deploy and share agents with open infrastructure
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training