The common language for platforms, agents and businesses.
Real-World Centric Foundation GUI Agents
Context data platform for building observable, self-learning AI agents
Democratizing Reinforcement Learning for LLMs
Generate blog articles from video or audio
Provider-agnostic, open-source evaluation infrastructure
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
SOTA discrete acoustic codec models with 40/75 tokens per second
Controllable and fast Text-to-Speech for over 7000 languages
One-click deployment (including offline integration package)
A TTS model capable of generating ultra-realistic dialogue
Pokee Deep Research Model Open Source Repo
Unified Multimodal Understanding and Generation Models
Volcano Engine Reinforcement Learning for LLMs
AI discovers 520000 stable inorganic crystal structures for research
DeepMind model for tracking arbitrary points across videos & robotics
Global weather forecasting model using graph neural networks and JAX
An alignment auditing agent capable of exploring alignment hypothesis
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
Renderer for the harmony response format to be used with gpt-oss
A Powerful Native Multimodal Model for Image Generation