A lightweight vLLM implementation built from scratch
System Level Intelligent Router for Mixture-of-Models at Cloud
Visual Causal Flow
Towards Human-Sounding Speech
Run a full local LLM stack with one command using Docker
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
From Vibe Coding to Agentic Engineering
Interface for OuteTTS models
Advanced language and coding AI model
Accurate × Fast × Comprehensive
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
Renderer for the harmony response format to be used with gpt-oss
Open-source large language model family from Tencent Hunyuan
Multilingual Document Layout Parsing in a Single Vision-Language Model
MiniMax M2.1, a SOTA model for real-world dev & agents.
Ultra-Efficient LLMs on End Device
High-performance Inference and Deployment Toolkit for LLMs and VLMs
LightLLM is a Python-based LLM (Large Language Model) inference
A course of learning LLM inference serving on Apple Silicon
Mooncake is the serving platform for Kimi
New family of code large language models (LLMs)
MiniMax-M2, a model built for Max coding & agentic workflows
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
OpenAI’s compact 20B open model for fast, agentic, and local use
OpenAI’s open-weight 120B model optimized for reasoning and tooling