From Vibe Coding to Agentic Engineering
GLM-5: From Vibe Coding to Agentic Engineering
Open-source, high-performance AI model with advanced reasoning
Advanced language and coding AI model
Powerful AI language model (MoE) optimized for efficiency/performance
157 models, 30 providers, one command to find what runs on hardware
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
LLM-enabled investment tracker that consolidates market performance
Agentic, Reasoning, and Coding (ARC) foundation models
A high-performance inference engine for AI models
TokenSpeed is a speed-of-light LLM inference engine
Open source LLM engineering platform: LLM Observability, metrics, etc.
Universal LLM Deployment Engine with ML Compilation
Open-source large language model family from Tencent Hunyuan
Kimi K2 is the large language model series developed by Moonshot AI
How to optimize some algorithm in cuda
Port of Facebook's LLaMA model in C/C++
Diversity-driven optimization and large-model reasoning ability
Run Local LLMs on Any Device. Open-source
Fast, flexible LLM inference
The official repo of Qwen chat & pretrained large language model
Redundancy-aware KV Cache Compression for Reasoning Models
High-performance inference framework for large language models
Open-source LLM load balancer and serving platform for hosting LLMs
MiniMax M2.1, a SOTA model for real-world dev & agents.