Repo of Qwen2-Audio chat & pretrained large audio language model
Fast multimodal LLM for real-time voice interaction and AI apps
Curated collection of Amazing Python scripts
Virtual AI anchor that combines state-of-the-art technology
Chat & pretrained large audio language model proposed by Alibaba Cloud
Toolkit for conversational AI
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A natural language interface for computers
Toolkit for audio, music, and speech generation
General Speech Restoration