GUI for a Vocal Remover that uses Deep Neural Networks
A text-to-speech, speech-to-text and speech-to-speech library
Repo of Qwen2-Audio chat & pretrained large audio language model
Open-source framework for intelligent speech interaction
Chat & pretrained large audio language model proposed by Alibaba Cloud
Audio foundation model excelling in audio understanding
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
LLM-based Reinforcement Learning audio edit model
Comprehensive Gradio WebUI for audio processing
A Web UI for easy subtitle using whisper model
Official Python inference and LoRA trainer package
Audiocraft is a library for audio processing and generation
A Family of Open Sourced Music Foundation Models
Python Audio Analysis Library: Feature Extraction, Classification
Cloud-native open source data warehouse for analytics and AI queries
A lightweight audio-to-MIDI converter with pitch bend detection
48khz stereo neural audio codec for general audio
Qwen3-omni is a natively end-to-end, omni-modal LLM
A Python library for audio
Synchronized Translation for Videos
AudioMuse-AI is an Open Source Dockerized environment
Taming Stable Diffusion for Lip Sync
An extremely simple tool for separating vocals and background music
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD