The most powerful local music generation model
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A theoretical reconstruction of the Claude Mythos architecture
Advanced language and coding AI model
Code for running inference and finetuning with SAM 3 model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Official inference repo for FLUX.2 models
Qwen3-omni is a natively end-to-end, omni-modal LLM
Repo for SeedVR2 & SeedVR
Genome modeling and design across all domains of life
Unified Multimodal Understanding and Generation Models
Towards Real-World Vision-Language Understanding
Renderer for the harmony response format to be used with gpt-oss
A trainable PyTorch reproduction of AlphaFold 3
Foundation model for image generation
This repository contains the official implementation of FastVLM
A Production-ready Reinforcement Learning AI Agent Library
A PyTorch library for implementing flow matching algorithms
Open-source framework for intelligent speech interaction
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Video understanding codebase from FAIR for reproducing video models
Large-language-model & vision-language-model based on Linear Attention
4M: Massively Multimodal Masked Modeling
Language modeling in a sentence representation space