The most powerful local music generation model
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Wan2.1: Open and Advanced Large-Scale Video Generative Model
State of the art LLM and coding model
Kimi K2 is the large language model series developed by Moonshot AI
A theoretical reconstruction of the Claude Mythos architecture
Advanced language and coding AI model
Code for running inference and finetuning with SAM 3 model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Official inference repo for FLUX.2 models
Qwen3-omni is a natively end-to-end, omni-modal LLM
Flux 2 image generation model pure C inference
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Unified Multimodal Understanding and Generation Models
Renderer for the harmony response format to be used with gpt-oss
Fast, Sharp & Reliable Agentic Intelligence
Towards Real-World Vision-Language Understanding
Foundation model for image generation
This repository contains the official implementation of FastVLM
A PyTorch library for implementing flow matching algorithms
Clean and efficient FP8 GEMM kernels with fine-grained scaling
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Video understanding codebase from FAIR for reproducing video models
Large-language-model & vision-language-model based on Linear Attention