Agentic, Reasoning, and Coding (ARC) foundation models
MOSS‑TTS Family open‑source speech and sound generation model
Long-form streaming TTS system for multi-speaker dialogue generation
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Production-tested AI infrastructure tools
Large Multimodal Models for Video Understanding and Editing
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open Multilingual Multimodal Chat LMs
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201