DeepMind model for tracking arbitrary points across videos & robotics
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Integrate, train and manage any AI models and APIs with your database
Achieving 3+ generation speedup on reasoning tasks
Learn how to develop, deploy and iterate on production-grade ML
Hypernetworks that adapt LLMs for specific benchmark tasks
A Survey of Large Language Models
100–200× Acceleration for Video Diffusion Models
A Foundation Model for Generalist Gaming Agents
The official PyTorch implementation of Google's Gemma models
Photorealistic Synthetic Dataset for Holistic Indoor Scene
An Open Source package that allows video game creators
Machine Learning Pipelines for Kubeflow
The easiest way to use deep metric learning in your application
GLM-4 series: Open Multilingual Multimodal Chat LMs
Towards Human-Sounding Speech
A unified framework for scalable computing
Learn to build your Second Brain AI assistant with LLMs
Offical Implementation for "Recursive Multi-Agent Systems"
Master the fundamentals of machine learning, deep learning
Test-Time Reinforcement Learning
MiroThinker is an open source deep research agent
Bridging LLM and Recommender System
Motion-controllable Video Generation via Latent Trajectory Guidance
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git