Production-tested AI infrastructure tools
Renderer for the harmony response format to be used with gpt-oss
MiniMax-M2, a model built for Max coding & agentic workflows
Port of Facebook's LLaMA model in C/C++
From Vibe Coding to Agentic Engineering
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Official Python inference and LoRA trainer package
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Agentic, Reasoning, and Coding (ARC) foundation models
Code for running inference and finetuning with SAM 3 model
Official inference repo for FLUX.2 models
Advanced language and coding AI model
Python inference and LoRA trainer package for the LTX-2 audio–video
Kimi K2 is the large language model series developed by Moonshot AI
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Qwen3-TTS is an open-source series of TTS models
Open-source multi-speaker long-form text-to-speech model
An experimental version of DeepSeek model
Image generation model with single-stream diffusion transformer
Official inference repo for FLUX.1 models
Accurate × Fast × Comprehensive
Reference PyTorch implementation and models for DINOv3
A Family of Open Sourced Music Foundation Models