New family of code large language models (LLMs)
Controllable & emotion-expressive zero-shot TTS
The common language for platforms, agents and businesses.
Real-World Centric Foundation GUI Agents
Context data platform for building observable, self-learning AI agents
Democratizing Reinforcement Learning for LLMs
Generate blog articles from video or audio
When LLM Meets Domain Experts
Open-sourced unified customization model
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Controllable and fast Text-to-Speech for over 7000 languages
A text-to-speech, speech-to-text and speech-to-speech library
A TTS model capable of generating ultra-realistic dialogue
Collections of robotics environments
Pokee Deep Research Model Open Source Repo
Unified Multimodal Understanding and Generation Models
Python examples of popular machine learning algorithms
Volcano Engine Reinforcement Learning for LLMs
An alignment auditing agent capable of exploring alignment hypothesis
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Language modeling in a sentence representation space
Renderer for the harmony response format to be used with gpt-oss
A Powerful Native Multimodal Model for Image Generation
Designed for text embedding and ranking tasks