Make websites accessible for AI agents
Browse the web, directly from Cursor etc.
A simple, high-quality voice conversion tool focused on ease of use
Opensource browser using agents
An AI personal assistant for your digital brain
Automate browser-based workflows with LLMs and Computer Vision
A library to communicate with ChatGPT, Claude, Copilot, Gemini
No fortress, purely open ground. OpenManus is Coming
Agent framework and applications built upon Qwen>=3.0
An MCP server that autonomously evaluates web applications
Use Microsoft Edge's online text-to-speech service from Python
A sound cloning tool with a web interface, using your voice
The most reliable AI agent framework that supports MCP
Qwen3-Coder is the code version of Qwen3
Open-source MCP server that gives your coding agent
AIHawk aims to easy job hunt process by automating job applications
Tools like web browser, computer access and code runner for LLMs
UI-TARS-desktop version that can operate on your local personal device
A nearly-live implementation of OpenAI's Whisper
Speech-AI-Forge is a project developed around TTS generation model
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Library for OCR-related tasks powered by Deep Learning
Automate native Android apps with AI using accessibility APIs
A simple native web interface that uses ChatTTS to synthesize text
Python SDK for the Computer Use model Lux, developed by OpenAGI