A text-to-speech, speech-to-text and speech-to-speech library
Audio foundation model excelling in audio understanding
Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Large Audio Language Model built for natural interactions
GUI for a Vocal Remover that uses Deep Neural Networks
Official Python inference and LoRA trainer package
Download your Spotify playlists and songs along with album art
Python library for audio and music analysis
A cross-platform GUI wrapper for yt-dlp written in PySide6
A lightning fast audio upsampler
Dumb downloader that scrapes the web
Tokenizer-Free TTS for Multilingual Speech Generation
Award-Winning Open Source Video Editing Software
A simple app to get songs from YouTube in mp3 format with artist name
Audiocraft is a library for audio processing and generation
Python Audio Analysis Library: Feature Extraction, Classification
Audio Normalization for Python/ffmpeg
A Family of Open Sourced Music Foundation Models
Music player and music library manager for Linux, Windows, and macOS
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Open-source multi-speaker long-form text-to-speech model
Automagically synchronize subtitles with video
AudioMuse-AI is an Open Source Dockerized environment
A lightweight audio-to-MIDI converter with pitch bend detection