Real-time voice interactive digital human
AI generative media user experience highlighting use of APIs
Framework for building neural networks
Real-World Centric Foundation GUI Agents
Scalable data pre processing and curation toolkit for LLMs
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
Offline inference engine for art, real-time voice conversations
Stanford NLP Python library for many human languages
Model Context Protocol Server for Apache OpenDAL™
A Unified Framework for Text-to-3D and Image-to-3D Generation
The Agent-User Interaction Protocol
Public opinion analysis system
A self-hosted open source photo management service
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Open source multi-agent RAG over a knowledge graph
Lightweight demo to build a conversational AI search engine quickly
Low-latency AI inference engine optimized for mobile devices
Create prompt-friendly codebase digests from any Git repository URL
Extension of Google Research’s PaperBanana
Data Infrastructure providing an approach to multimodal AI workloads
Build multimodal language agents for fast prototype and production
95% token savings. 155x faster queries. 16 languages
The open-source data curation platform for LLMs
Framework for building AI-powered interactive digital humans and agent
The Clay Foundation Model - An open source AI model and interface