Mooncake is the serving platform for Kimi
oneAPI Deep Neural Network Library (oneDNN)
Serving system for machine learning models
High-Resolution Image Synthesis with Latent Diffusion Models
Fast ML inference & training for ONNX models in Rust
Recipes to train reward model for RLHF
A simple, performant and scalable Jax LLM
Metal programming in Julia
A massively parallel, high-level programming language
Interactive data visualizations and plotting in Julia
AI agents running research on single-GPU nanochat training
TensorRT LLM provides users with an easy-to-use Python API
Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop
TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
Khronos Vulkan, OpenGL, and OpenGL ES Conformance Tests
Generate music based on natural language prompts using LLMs
Official inference framework for 1-bit LLMs
MiniMax-M2, a model built for Max coding & agentic workflows
NeurIPS2025 Spotlight] Quantized Attention
Unified Model Serving Framework
FFmpeg implements video cropping, watermarking, transcoding
PyTorch implementation of JiT
Open Source Wealth Management Software. Angular + NestJS + Prisma
Composable transformations of Python+NumPy programs
TT-NN operator library, and TT-Metalium low level kernel programming