Diversity-driven optimization and large-model reasoning ability
Code for running inference with the SAM 3D Body Model 3DB
RGBD video generation model conditioned on camera input
Visual Causal Flow
Agentic, Reasoning, and Coding (ARC) foundation models
ChatGLM-6B: An Open Bilingual Dialogue Language Model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Models for object and human mesh reconstruction
Multimodal Diffusion with Representation Alignment
Open-source deep-learning framework
An Efficient Agentic Model for Computer Use
DeepSeek Coder: Let the Code Write Itself
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
The official repo of Qwen chat & pretrained large language model
Multimodal embedding and reranking models built on Qwen3-VL
Tool for exploring and debugging transformer model behaviors
Global weather forecasting model using graph neural networks and JAX
Easy Docker setup for Stable Diffusion with user-friendly UI
Large-language-model & vision-language-model based on Linear Attention
A SOTA open-source image editing model
Repo of Qwen2-Audio chat & pretrained large audio language model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
An AI-powered security review GitHub Action using Claude
Qwen3-TTS is an open-source series of TTS models