AudioMuse-AI is an Open Source Dockerized environment
Sample code and notebooks for Generative AI on Google Cloud
A set of AI-enabled effects, generators, and analyzers for Audacity
Audio foundation model excelling in audio understanding
AI tool converting video/audio into structured documents instantly
Open-source framework for intelligent speech interaction
Repo of Qwen2-Audio chat & pretrained large audio language model
Large Audio Language Model built for natural interactions
A gallery that showcases on-device ML/GenAI use cases
Chat & pretrained large audio language model proposed by Alibaba Cloud
Curated AI engineering notes on LLMs, generative models, and tools
LLM-based Reinforcement Learning audio edit model
Official Python inference and LoRA trainer package
Multi-modal large language model designed for audio understanding
Use API to call the music generation AI of suno.ai
A blazing fast AI Gateway with integrated guardrails
A python tool that uses GPT-4, FFmpeg, and OpenCV
Spring AI Alibaba examples for building and testing AI apps
Implementation of AudioLM audio generation model in Pytorch
Multimodal Diffusion with Representation Alignment
Framework for building real-time voice and multimodal AI agents
A Systematic Framework for Interactive World Modeling
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Local-first AI Notepad for Private Meetings
A multimodal model for brain response prediction