Agentic, Reasoning, and Coding (ARC) foundation models
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Visual Causal Flow
Code for running inference and finetuning with SAM 3 model
Easy Docker setup for Stable Diffusion with user-friendly UI
Clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
A Customizable Image-to-Video Model based on HunyuanVideo
Advanced language and coding AI model
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
An experimental version of DeepSeek model
MiniMax M2.1, a SOTA model for real-world dev & agents.
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The official PyTorch implementation of Google's Gemma models
A Family of Open Foundation Models for Code Intelligence
Block Diffusion for Ultra-Fast Speculative Decoding
Large-language-model & vision-language-model based on Linear Attention
ICLR2024 Spotlight: curation/training code, metadata, distribution
MiniMax-M2, a model built for Max coding & agentic workflows
RGBD video generation model conditioned on camera input
New family of code large language models (LLMs)
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Open-weight, large-scale hybrid-attention reasoning model
Towards Real-World Vision-Language Understanding
Open Multilingual Multimodal Chat LMs