Strong, Economical, and Efficient Mixture-of-Experts Language Model
Official inference repo for FLUX.2 models
Open-weight, large-scale hybrid-attention reasoning model
Official inference repo for FLUX.1 models
Safety reasoning models built-upon gpt-oss
MiniMax-M2, a model built for Max coding & agentic workflows
Track food, fitness, water, and health
Official inference library for Mistral models
Drop-in replacement for standard residual connections in Transformers
Local AI chat + coding agent for Apple Silicon, powered by Gemma 4
Bidirectional token-classification model for identifiable info
BitNet: Scaling 1-bit Transformers for Large Language Models
Alibaba's high-performance LLM inference engine for diverse apps
On the Structural Pruning of Large Language Models
UCCL is an efficient communication library for GPUs
Implementation for MatMul-free LM
Renderer for the harmony response format to be used with gpt-oss
Towards Real-World Vision-Language Understanding
Scientific Visualisation Made Easy
DeepSeek LLM: Let there be answers
Point cloud diffusion for 3D model synthesis
DE-based Weight Optimisation for Heterogeneous Ensemble
A fast implementation of LeCun's convolutional neural network