Qwen3-ASR is an open-source series of ASR models
Easy Docker setup for Stable Diffusion with user-friendly UI
Repo of Qwen2-Audio chat & pretrained large audio language model
Qwen-Image is a powerful image generation foundation model
26m function call model that runs on incredibly small devices
The Clay Foundation Model - An open source AI model and interface
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Fidelity and Controllable Generation of Textured 3D Assets
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Sharp Monocular Metric Depth in Less Than a Second
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
A series of math-specific large language models of our Qwen2 series
Programmatic access to the AlphaGenome model
Bidirectional token-classification model for identifiable info
Chinese and English multimodal conversational language model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Open Source Speech Language Model
Open-source industrial-grade ASR models
Fast-stable-diffusion + DreamBooth
Tool for exploring and debugging transformer model behaviors
4M: Massively Multimodal Masked Modeling
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
A Pragmatic VLA Foundation Model
Ling is a MoE LLM provided and open-sourced by InclusionAI