Opensource browser using agents
Fast stable diffusion on CPU and AI PC
InvokeAI is a leading creative engine for Stable Diffusion models
Synchronized Translation for Videos
A sound cloning tool with a web interface, using your voice
AI-powered video clipping and highlight generation
A simple native web interface that uses ChatTTS to synthesize text
1 min voice data can also be used to train a good TTS model
Reverse-engineered Python API for Google Gemini web app
Python scraper based on AI
Advanced language and coding AI model
Generate short videos with one click using AI LLM
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Generating Immersive, Explorable, and Interactive 3D Worlds
Speech-AI-Forge is a project developed around TTS generation model
🐈 nanobot: The Ultra-Lightweight Clawdbot / OpenClaw
Time-lapse Video Generation Models as Metamorphic Simulators
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Spark-TTS Inference Code
A high-quality rapid TTS voice cloning model
Private chat with local GPT with document, images, video, etc.
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
LLM based autonomous agent that does online comprehensive research
Tongyi Deep Research, the Leading Open-source Deep Research Agent
One-click deployment (including offline integration package)