Visual Causal Flow
Native and Compact Structured Latents for 3D Generation
A multimodal model for brain response prediction
State of the art LLM and coding model
Industrial-level controllable zero-shot text-to-speech system
Pokee Deep Research Model Open Source Repo
Large Multimodal Models for Video Understanding and Editing
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Official implementation of Watermark Anything with Localized Messages
General-purpose image editing model that delivers high-fidelity
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Self-evolving AI model for agents, coding, and complex workflows
High-precision 14B multimodal model built for advanced reasoning tasks
Stable fine-tuned Gemma model for structured, clear responses
Frontier-scale 675B multimodal instruct MoE model for enterprise AIMis