FlashMLA: Efficient Multi-head Latent Attention Kernels
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
Advanced language and coding AI model
Port of Facebook's LLaMA model in C/C++
Agentic, Reasoning, and Coding (ARC) foundation models
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Image generation model with single-stream diffusion transformer
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Code for running inference and finetuning with SAM 3 model
Open-source, high-performance AI model with advanced reasoning
RGBD video generation model conditioned on camera input
Powerful AI language model (MoE) optimized for efficiency/performance
Official inference repo for FLUX.2 models
Python inference and LoRA trainer package for the LTX-2 audio–video
Multimodal model achieving SOTA performance
From Images to High-Fidelity 3D Assets
Kimi K2 is the large language model series developed by Moonshot AI
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Models for object and human mesh reconstruction
A Systematic Framework for Interactive World Modeling
Qwen3-TTS is an open-source series of TTS models
A Family of Open Sourced Music Foundation Models
MiniMax M2.1, a SOTA model for real-world dev & agents.
Reference PyTorch implementation and models for DINOv3