Agentic, Reasoning, and Coding (ARC) foundation models
Visual Causal Flow
Code for running inference and finetuning with SAM 3 model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Designed for text embedding and ranking tasks
Advanced language and coding AI model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The official repo of Qwen chat & pretrained large language model
Capable of understanding text, audio, vision, video
An experimental version of DeepSeek model
An Efficient Agentic Model for Computer Use
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Open-source image generative foundation model
The official PyTorch implementation of Google's Gemma models
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Block Diffusion for Ultra-Fast Speculative Decoding
ICLR2024 Spotlight: curation/training code, metadata, distribution
Large-language-model & vision-language-model based on Linear Attention
Qwen-Image is a powerful image generation foundation model
New family of code large language models (LLMs)
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
DeepMind model for tracking arbitrary points across videos & robotics
A SOTA open-source image editing model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
OCR expert VLM powered by Hunyuan's native multimodal architecture