Make websites accessible for AI agents
Automate native Android apps with AI using accessibility APIs
Browse the web, directly from Cursor etc.
An MCP server that autonomously evaluates web applications
Opensource browser using agents
Open-source, code-first Python toolkit for building, evaluating, etc.
An AI personal assistant for your digital brain
An open sourced end-to-end VLM-based GUI Agent
Use Microsoft Edge's online text-to-speech service from Python
Enable AI to control your desktop, mobile and HMI devices
A sound cloning tool with a web interface, using your voice
A simple native web interface that uses ChatTTS to synthesize text
Build Vision Agents quickly with any model or video provider
Tools like web browser, computer access and code runner for LLMs
On-device Speech-to-Intent engine powered by deep learning
Open-source MCP server that gives your coding agent
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Speech-AI-Forge is a project developed around TTS generation model
MCP integration platforms for AI agents to use tools at any scale
Open Source Computer Vision Library
Run GGUF models easily with a UI or API. One File. Zero Install.
Ainee - AI Notetaking and Learning Companion
Leading free and open-source liveliness check &face recognition system
StudioOllamaUI is a local, portable interface for Ollama