DeepSeek Coder: Let the Code Write Itself
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Image generation model with single-stream diffusion transformer
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
New set of lightweight state-of-the-art, open foundation models
Production-tested AI infrastructure tools
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer