Claude Code action for GitHub PRs
Reference PyTorch implementation and models for DINOv3
New set of lightweight state-of-the-art, open foundation models
Official inference repo for FLUX.2 models
A Systematic Framework for Interactive World Modeling
Industrial-level controllable zero-shot text-to-speech system
PyTorch implementation of JiT
Inference script for Oasis 500M
Foundation model for image generation
MiniMax M2.1, a SOTA model for real-world dev & agents.
Open-source deep-learning framework
Scaling Reinforcement Learning with LLMs
Multimodal embedding and reranking models built on Qwen3-VL
Video understanding codebase from FAIR for reproducing video models
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Continuous Autonomy for the AI SDK
Foundational Models for State-of-the-Art Speech and Text Translation
Research code artifacts for Code World Model (CWM)
RGBD video generation model conditioned on camera input
New family of code large language models (LLMs)
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open language model developed by NVIDIA as part of Nemotron-3 family