Qwen-Audio is a large audio-language model developed by Alibaba Cloud, built to accept various types of audio input (speech, natural sounds, music, singing) along with text input, and output text. There is also an instruction-tuned version called Qwen-Audio-Chat which supports conversational interaction (multi-round), audio + text input, creative tasks and reasoning over audio. It uses multi-task training over many different audio tasks (30+), and achieves strong multi-benchmarks performance without task-specific fine‐tuning. It includes features such as flexible multi-run chat, audio understanding/reasoning, music appreciation, and also tool usage (e.g. voice editing).

Features

  • Supports various audio types: speech, natural sounds, music, singing etc.
  • Multi-task training framework covering 30+ audio tasks to allow transfer across them and avoid interference
  • Audio + text input and text output; Qwen-Audio-Chat enables dialogue over audio and text, multi-round interactions
  • Excellent zero- or few-shot performance: achieves state-of-the-art on multiple audio benchmarks (Aishell1, cochlscene, ClothoAQA, VocalSound) without task‐specific fine-tuning
  • Flexibility: supports multiple-audio analysis, sound understanding & reasoning, creative tasks like music appreciation, and external tool usage (e.g. voice editing)
  • Multilingual support in many languages/dialects in audio; voice chat modes; designed for flexible real-world audio interaction scenarios

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Qwen-Audio

Qwen-Audio Web Site

Other Useful Business Software
Gemini 3 and 200+ AI Models on One Platform Icon
Gemini 3 and 200+ AI Models on One Platform

Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Qwen-Audio!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Large Language Models (LLM), Python AI Models

Registered

2025-09-23