Repo of Qwen2-Audio chat & pretrained large audio language model
Contexts Optical Compression
Audio foundation model excelling in audio understanding
Capable of understanding text, audio, vision, video
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Visual Causal Flow
Qwen3-ASR is an open-source series of ASR models
AI Suite for upscaling, interpolating & restoring images/videos