Agentic, Reasoning, and Coding (ARC) foundation models
Code for running inference and finetuning with SAM 3 model
Visual Causal Flow
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Designed for text embedding and ranking tasks
Capable of understanding text, audio, vision, video
Advanced language and coding AI model
An experimental version of DeepSeek model
The official repo of Qwen chat & pretrained large language model
RGBD video generation model conditioned on camera input
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The official PyTorch implementation of Google's Gemma models
An Efficient Agentic Model for Computer Use
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Qwen-Image is a powerful image generation foundation model
Block Diffusion for Ultra-Fast Speculative Decoding
Towards Real-World Vision-Language Understanding
ICLR2024 Spotlight: curation/training code, metadata, distribution
Tongyi Deep Research, the Leading Open-source Deep Research Agent
New family of code large language models (LLMs)
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Dataset of GPT-2 outputs for research in detection, biases, and more
Open Multilingual Multimodal Chat LMs