The most powerful local music generation model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
State of the art LLM and coding model
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Kimi K2 is the large language model series developed by Moonshot AI
Code for running inference and finetuning with SAM 3 model
A theoretical reconstruction of the Claude Mythos architecture
Advanced language and coding AI model
Official inference repo for FLUX.2 models
Flux 2 image generation model pure C inference
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Qwen3-omni is a natively end-to-end, omni-modal LLM
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Unified Multimodal Understanding and Generation Models
Fast, Sharp & Reliable Agentic Intelligence
Renderer for the harmony response format to be used with gpt-oss
Towards Real-World Vision-Language Understanding
This repository contains the official implementation of FastVLM
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Foundation model for image generation
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Video understanding codebase from FAIR for reproducing video models
4M: Massively Multimodal Masked Modeling