A Pragmatic VLA Foundation Model
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
DeepMind model for tracking arbitrary points across videos & robotics
Qwen-Image is a powerful image generation foundation model
Pushing the Limits of Mathematical Reasoning in Open Language Models
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Large Multimodal Models for Video Understanding and Editing
LLM-based Reinforcement Learning audio edit model
Real-time behaviour synthesis with MuJoCo, using Predictive Control