Open-source MCP server that gives your coding agent
A nearly-live implementation of OpenAI's Whisper
Use Microsoft Edge's online text-to-speech service from Python
UI-TARS-desktop version that can operate on your local personal device
Your Personal AI Assistant; easy to install, deploy on local or coud
Tools like web browser, computer access and code runner for LLMs
Qwen3-Coder is the code version of Qwen3
Speech-AI-Forge is a project developed around TTS generation model
Fast-stable-diffusion + DreamBooth
Context-aware desktop AI assistant that understands screen content
Automate native Android apps with AI using accessibility APIs
Linkedin Automation Tool
The most reliable AI agent framework that supports MCP
A simple native web interface that uses ChatTTS to synthesize text
AI tool for real-time monitoring and analysis of Goofish listings
Gracefully face hCaptcha challenge with multimodal llms
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
AI tool converting video/audio into structured documents instantly
Stable Diffusion web UI
Use Claude Code's agent loop with DeepSeek V4 Pro, OpenRouter & more
A fast TTS architecture with conditional flow matching
Python SDK for the Computer Use model Lux, developed by OpenAGI
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Open Source Computer Vision Library
Visual localization made easy with hloc