Port of Facebook's LLaMA model in C/C++
Image generation model with single-stream diffusion transformer
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
MiniMax M2.1, a SOTA model for real-world dev & agents.
DeepSeek LLM: Let there be answers
FlashMLA: Efficient Multi-head Latent Attention Kernels
Production-tested AI infrastructure tools
Collection of Gemma 3 variants that are trained for performance
Dataset of GPT-2 outputs for research in detection, biases, and more
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
Code for reproducing key results in the paper
Agentic 123B coding model optimized for large-scale engineering
OpenAI’s open-weight 120B model optimized for reasoning and tooling
Text-to-image model optimized for artistic quality and safe generation
Lightweight 24B agentic coding model with vision and long context
685B model with improved agents and consistency
Russian ASR model fine-tuned on Common Voice and CSS10 datasets