Agentic, Reasoning, and Coding (ARC) foundation models
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Visual Causal Flow
Code for running inference and finetuning with SAM 3 model
Advanced language and coding AI model
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
An experimental version of DeepSeek model
Open-source image generative foundation model
The official PyTorch implementation of Google's Gemma models
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
MiniMax M2.1, a SOTA model for real-world dev & agents.
Block Diffusion for Ultra-Fast Speculative Decoding
Clean and efficient FP8 GEMM kernels with fine-grained scaling
A Family of Open Foundation Models for Code Intelligence
ICLR2024 Spotlight: curation/training code, metadata, distribution
MiniMax-M2, a model built for Max coding & agentic workflows
RGBD video generation model conditioned on camera input
New family of code large language models (LLMs)
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Towards Real-World Vision-Language Understanding
Open Multilingual Multimodal Chat LMs
DeepSeek LLM: Let there be answers