New set of lightweight state-of-the-art, open foundation models
Qwen3-Coder is the code version of Qwen3
CodeGeeX2: A More Powerful Multilingual Code Generation Model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Tiny vision language model
A Family of Open Foundation Models for Code Intelligence
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
ICLR2024 Spotlight: curation/training code, metadata, distribution
A SOTA open-source image editing model
CLIP, Predict the most relevant text snippet given an image
MiniMax-M2, a model built for Max coding & agentic workflows
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Facebook AI Research Sequence-to-Sequence Toolkit
Vision-language-action model for robot control via images and text
High-efficiency reasoning and agentic intelligence model
JetBrains’ 4B parameter code model for completions
Kimi K2: 1T-param MoE model for advanced coding and agentic reasoning