Open-source industrial-grade ASR models
Audio foundation model excelling in audio understanding
Repo of Qwen2-Audio chat & pretrained large audio language model
Capable of understanding text, audio, vision, video
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Qwen3-ASR is an open-source series of ASR models
Qwen3-omni is a natively end-to-end, omni-modal LLM
Multi-modal large language model designed for audio understanding