Unified Multimodal Understanding and Generation Models
Open-weight, large-scale hybrid-attention reasoning model
Tool for exploring and debugging transformer model behaviors
Fast and Universal 3D reconstruction model for versatile tasks
ICLR2024 Spotlight: curation/training code, metadata, distribution
A Conversational Speech Generation Model
800,000 step-level correctness labels on LLM solutions to MATH problem
Learning to Act by Watching Unlabeled Online Videos
Reproduces results of "Fixing the train-test resolution discrepancy"