DeepSeek Coder: Let the Code Write Itself
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Image generation model with single-stream diffusion transformer
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Ling is a MoE LLM provided and open-sourced by InclusionAI
New set of lightweight state-of-the-art, open foundation models
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Production-tested AI infrastructure tools
Fine-tuning ChatGLM-6B with PEFT
llama.go is like llama.cpp in pure Golang
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
GLIDE: a diffusion-based text-conditional image synthesis model
Large-scale autoregressive pixel model for image generation by OpenAI