Gemma open-weight LLM library, from Google DeepMind
Strong, Economical, and Efficient Mixture-of-Experts Language Model
Instant voice cloning by MIT and MyShell. Audio foundation model
Official inference repo for FLUX.2 models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Open-weight, large-scale hybrid-attention reasoning model
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
Official inference repo for FLUX.1 models
Text and image to video generation: CogVideoX and CogVideo
Safety reasoning models built-upon gpt-oss
MiniMax-M2, a model built for Max coding & agentic workflows
Local Groq Desktop chat app with MCP support
Official inference library for Mistral models
Drop-in replacement for standard residual connections in Transformers
Local AI chat + coding agent for Apple Silicon, powered by Gemma 4
Bidirectional token-classification model for identifiable info
A light-weight and powerful meta-prompting, context engineering
Track food, fitness, water, and health
Bolt is a deep learning library with high performance
BitNet: Scaling 1-bit Transformers for Large Language Models
Alibaba's high-performance LLM inference engine for diverse apps
On the Structural Pruning of Large Language Models
UCCL is an efficient communication library for GPUs
Implementation for MatMul-free LM
Renderer for the harmony response format to be used with gpt-oss