Open-source, high-performance AI model with advanced reasoning
A high-throughput and memory-efficient inference and serving engine
TokenSpeed is a speed-of-light LLM inference engine
Diversity-driven optimization and large-model reasoning ability
Open-source large language model family from Tencent Hunyuan
A Simple and Universal Swarm Intelligence Engine
Advanced LLM-powered brute-force tool combining AI intelligence
High-performance inference framework for large language models
A simple, performant and scalable Jax LLM
High-performance Inference and Deployment Toolkit for LLMs and VLMs
slime is an LLM post-training framework for RL Scaling
Advanced language and coding AI model
Powerful AI language model (MoE) optimized for efficiency/performance
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Retrieval and Retrieval-augmented LLMs
Universal LLM Deployment Engine with ML Compilation
950 line, minimal, extensible LLM inference engine built from scratch
AirLLM 70B inference with single 4GB GPU
A high-performance ML model serving framework, offers dynamic batching
Traditional Mandarin LLMs for Taiwan
SDG is a specialized framework
LightLLM is a Python-based LLM (Large Language Model) inference
BISHENG is an open LLM devops platform for next generation apps
A course of learning LLM inference serving on Apple Silicon
The official repository for ERNIE 4.5 and ERNIEKit