Run Local LLMs on Any Device. Open-source
Access large language models from the command-line
MiniMax M2.1, a SOTA model for real-world dev & agents.
Curated list of datasets and tools for post-training
Inference code for CodeLlama models
Open-weight, large-scale hybrid-attention reasoning model
Framework that is dedicated to making neural data processing
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Training and serving large-scale neural networks