Large Audio Language Model built for natural interactions
AudioMuse-AI is an Open Source Dockerized environment
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Code and models for ICML 2024 paper, NExT-GPT
Data Infrastructure providing an approach to multimodal AI workloads
Build multimodal language agents for fast prototype and production
LLM Large Model of Selling Anchor