A state-of-the-art open visual language model
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Fast stable diffusion on CPU and AI PC
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Awesome multilingual OCR toolkits based on PaddlePaddle
Agentic, Reasoning, and Coding (ARC) foundation models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Advanced language and coding AI model
A Family of Open Sourced Music Foundation Models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Lets make video diffusion practical
Visual Causal Flow
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official inference repo for FLUX.2 models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Qwen3-TTS is an open-source series of TTS models
The official repo of Qwen chat & pretrained large language model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen-Image is a powerful image generation foundation model
Official inference repo for FLUX.1 models
State-of-the-art TTS model under 25MB
Renderer for the harmony response format to be used with gpt-oss
General-purpose image editing model that delivers high-fidelity
ChatGLM-6B: An Open Bilingual Dialogue Language Model