Kimi K2.7 Code
Coding-focused Kimi model for long-horizon agent workflows
...Architecturally, it uses a 1T-parameter Mixture-of-Experts design with 32B activated parameters, 61 layers, 384 experts, a 256K-token context window, and a MoonViT vision encoder. The model supports image and video input, native INT4 quantization, interleaved thinking, and multi-step tool calling. It also forces preserve-thinking mode by default, retaining full reasoning context across multi-turn interactions to improve coding-agent consistency. K2.7 Code is recommended for use through Kimi Code CLI and can be deployed with vLLM, SGLang, or KTransformers.