Agentic, Reasoning, and Coding (ARC) foundation models
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Visual Causal Flow
Code for running inference and finetuning with SAM 3 model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Advanced language and coding AI model
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
An experimental version of DeepSeek model
The official repo of Qwen chat & pretrained large language model
Capable of understanding text, audio, vision, video
An Efficient Agentic Model for Computer Use
Open-source image generative foundation model
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
The official PyTorch implementation of Google's Gemma models
MiniMax M2.1, a SOTA model for real-world dev & agents.
Qwen-Image is a powerful image generation foundation model
Block Diffusion for Ultra-Fast Speculative Decoding
A Family of Open Foundation Models for Code Intelligence
ICLR2024 Spotlight: curation/training code, metadata, distribution
MiniMax-M2, a model built for Max coding & agentic workflows
New family of code large language models (LLMs)
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
DeepMind model for tracking arbitrary points across videos & robotics