A simple native web interface that uses ChatTTS to synthesize text
Qwen3-Coder is the code version of Qwen3
Tools like web browser, computer access and code runner for LLMs
Local AI coding agent CLI with multi-agent orchestration tools
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Automate native Android apps with AI using accessibility APIs
A nearly-live implementation of OpenAI's Whisper
A sound cloning tool with a web interface, using your voice
Linkedin Automation Tool
Speech-AI-Forge is a project developed around TTS generation model
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
AI tool converting video/audio into structured documents instantly
Modular AI image and video generation web UI with extensible tools
Stable Diffusion web UI
AI tool for real-time monitoring and analysis of Goofish listings
Gracefully face hCaptcha challenge with multimodal llms
Fast-stable-diffusion + DreamBooth
Context-aware desktop AI assistant that understands screen content
A fast TTS architecture with conditional flow matching
Python SDK for the Computer Use model Lux, developed by OpenAGI
Open Source Computer Vision Library
Visual localization made easy with hloc
TensorFlow documentation
Generate photo-realistic textures based on source images