DeepMind model for tracking arbitrary points across videos & robotics
Enable AI to control your desktop, mobile and HMI devices
Outcome driven agent development framework that evolves
A fast TTS architecture with conditional flow matching
A text-to-speech, speech-to-text and speech-to-speech library
Plug-and-play library to enable agents to call MCP and UTCP tools
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A react-based starter app for using the Live API over websockets
Datasets, transforms and models specific to Computer Vision
LiteRT-LM is Google's production-ready inference framework
AI generative media user experience highlighting use of APIs
Makes coding agents get smarter with every task
A multi-platform desktop application to evaluate and compare LLM
Chat with multiple PDFs locally
Persistent vector memory for AI assistants
Garry's Opinionated OpenClaw/Hermes Agent Brain
Ultralytics YOLO
MCP server for interfacing with Godot game engine
A general, three-party dependency-free, cross-platform
Convert files and web content into clean, usable Markdown easily
Agent framework for the JVM. Pronounced Em-BAY-bel
Cosmos-RL is a flexible and scalable Reinforcement Learning framework
C++ behavior tree library for robotics and AI decision systems
High-performance inference server for text embeddings models API layer
Open Source Speech Language Model