A Systematic Framework for Interactive World Modeling
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Bridging Reasoning and Action Prediction
100–200× Acceleration for Video Diffusion Models
Learn it. Build it. Ship it for others
Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Open Source Speech Language Model
Shared repository for open-sourced projects from the Google AI Lang
The official implementation of RAPTOR
From nobody to big model (LLM) hero
Mastering Applied AI, One Concept at a Time
AI memory OS for LLM and Agent systems
Quick illustration of how one can easily read books together with LLMs
Multimodal embedding and reranking models built on Qwen3-VL
A collection of open-source skills for AI coding agents
Project-scoped Lean workflow orchestrator from Math, Inc.
AI agent microservice
Build production-ready AI agents in both Python and Typescript
AI-powered video generation skill for OpenClaw
Decomposable Multiscale Mixing for Time Series Forecasting
Quickly get started with AI theory and practical applications
Agent framework that enables tool-use agent tasks
Unified KV Cache Compression Methods for Auto-Regressive Models
Scalable RL solution for advanced reasoning of language models
A simple, performant and scalable Jax LLM