From Vibe Coding to Agentic Engineering
GLM-5: From Vibe Coding to Agentic Engineering
Open-source, high-performance AI model with advanced reasoning
Advanced language and coding AI model
157 models, 30 providers, one command to find what runs on hardware
Powerful AI language model (MoE) optimized for efficiency/performance
LLM-enabled investment tracker that consolidates market performance
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A high-performance inference engine for AI models
Agentic, Reasoning, and Coding (ARC) foundation models
Open source LLM engineering platform: LLM Observability, metrics, etc.
TokenSpeed is a speed-of-light LLM inference engine
How to optimize some algorithm in cuda
Open-source large language model family from Tencent Hunyuan
Universal LLM Deployment Engine with ML Compilation
Kimi K2 is the large language model series developed by Moonshot AI
Port of Facebook's LLaMA model in C/C++
Diversity-driven optimization and large-model reasoning ability
Fast, flexible LLM inference
High-performance inference framework for large language models
Redundancy-aware KV Cache Compression for Reasoning Models
The official repo of Qwen chat & pretrained large language model
Run Local LLMs on Any Device. Open-source
Open-source LLM load balancer and serving platform for hosting LLMs
MiniMax M2.1, a SOTA model for real-world dev & agents.