The Cradle framework is a first attempt at General Computer Control
A system for agentic LLM-powered data processing and ETL
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Open-source, local-first memory for any tool-capable LLM agent
Open-source framework for intelligent speech interaction
Persistent AI memory using local Markdown knowledge graphs
Audio foundation model excelling in audio understanding
Agents write python code to call tools and orchestrate other agents
Project Lyra: Open Generative 3D World Models
Low-latency AI inference engine optimized for mobile devices
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
AI Powered Knowledge Graph Generator
The SOTA Open-Source Browser Agent
Private chat with local GPT with document, images, video, etc.
AI multi-agent platform for automated code security auditing system
Open source demo platform where you can easily showcase your AI models
Build your own Cowork, AI Scientist and other SoTA Agents
Context management for Claude Code. Hooks maintain state via ledgers
Controllable & emotion-expressive zero-shot TTS
When LLM Meets Domain Experts
Official PyTorch Implementation
Controllable and fast Text-to-Speech for over 7000 languages
Towards Human-Sounding Speech
Bridging Reasoning and Action Prediction