ChatGLM-6B: An Open Bilingual Dialogue Language Model
DeepSeek Coder: Let the Code Write Itself
New set of lightweight state-of-the-art, open foundation models
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Production-tested AI infrastructure tools
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer