Generate audiobooks from e-books
Automate native Android apps with AI using accessibility APIs
GUI Exploration Lab. One of the best GUI agent solutions
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
End-to-end pipeline converting generative videos
Deploy and share agents with open infrastructure
Code to accompany "A Method for Animating Children's Drawings"
Agent S: an open agentic framework that uses computers like a human
Focus on prompting and generating
A Simple and Universal Swarm Intelligence Engine
One-click deployment (including offline integration package)
Generate short videos with one click using AI LLM
Python tool for browser-based interactive data apps in one file
Document Image Parsing via Heterogeneous Anchor Prompting”
Build multi-modal Agents with memory, knowledge, tools and reasoning
A reactive notebook for Python
Synthetic data generators for tabular and time-series data
Offline inference engine for art, real-time voice conversations
Multilingual speech recognition and audio understanding model
InvokeAI is a leading creative engine for Stable Diffusion models
Custom Chinese chatbot with Seq2Seq, GPT, and agent features
Industry leading face manipulation platform
TTS with kokoro and onnx runtime
Time-lapse Video Generation Models as Metamorphic Simulators