Claude Code action for GitHub PRs
New set of lightweight state-of-the-art, open foundation models
Official inference repo for FLUX.2 models
Reference PyTorch implementation and models for DINOv3
A Systematic Framework for Interactive World Modeling
Industrial-level controllable zero-shot text-to-speech system
Inference script for Oasis 500M
PyTorch implementation of JiT
Foundation model for image generation
MiniMax M2.1, a SOTA model for real-world dev & agents.
Open-source deep-learning framework
Scaling Reinforcement Learning with LLMs
Multimodal embedding and reranking models built on Qwen3-VL
Video understanding codebase from FAIR for reproducing video models
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Continuous Autonomy for the AI SDK
Foundational Models for State-of-the-Art Speech and Text Translation
Research code artifacts for Code World Model (CWM)
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
RGBD video generation model conditioned on camera input
New family of code large language models (LLMs)
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open language model developed by NVIDIA as part of Nemotron-3 family