Structure-from-Motion and Multi-View Stereo
Self-hosted AI coding assistant
Sample code and notebooks for Generative AI on Google Cloud
Real-World Centric Foundation GUI Agents
Contexts Optical Compression
Run Local LLMs on Any Device. Open-source
Next Generation AI One-Stop Internationalization Solution
Lightning-fast, on-device TTS, running natively via ONNX
Java enterprise application development framework
Synchronized Translation for Videos
Safety reasoning models built-upon gpt-oss
MCP Aggregator, Orchestrator, Middleware, Gateway in one docker
Open-source, high-performance AI model with advanced reasoning
RGBD video generation model conditioned on camera input
Use Microsoft Edge's online text-to-speech service from Python
Open source platform for the machine learning lifecycle
Context data platform for building observable, self-learning AI agents
In-App assistant SDK to build a multimodal conversational UX websites
Claude Code action for GitHub PRs
A solution to build and deploy MCP agents and applications
Official implementation of DreamCraft3D
Workflow and speech recognition app
Python library and CLI tool to interface with Google Translate
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
NVIDIA Federated Learning Application Runtime Environment