AudioMuse-AI is an Open Source Dockerized environment
Audio foundation model excelling in audio understanding
A set of AI-enabled effects, generators, and analyzers for Audacity
Sample code and notebooks for Generative AI on Google Cloud
AI tool converting video/audio into structured documents instantly
Open-source framework for intelligent speech interaction
Repo of Qwen2-Audio chat & pretrained large audio language model
A gallery that showcases on-device ML/GenAI use cases
Chat & pretrained large audio language model proposed by Alibaba Cloud
Large Audio Language Model built for natural interactions
LLM-based Reinforcement Learning audio edit model
Curated AI engineering notes on LLMs, generative models, and tools
Multi-modal large language model designed for audio understanding
Official Python inference and LoRA trainer package
Use API to call the music generation AI of suno.ai
A blazing fast AI Gateway with integrated guardrails
A python tool that uses GPT-4, FFmpeg, and OpenCV
Spring AI Alibaba examples for building and testing AI apps
Implementation of AudioLM audio generation model in Pytorch
Multimodal Diffusion with Representation Alignment
Framework for building real-time voice and multimodal AI agents
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Systematic Framework for Interactive World Modeling
A Family of Open Sourced Music Foundation Models
Local-first AI Notepad for Private Meetings