Open-source framework for intelligent speech interaction
Audio foundation model excelling in audio understanding
A text-to-speech, speech-to-text and speech-to-speech library
Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Large Audio Language Model built for natural interactions
LLM-based Reinforcement Learning audio edit model
Multi-modal large language model designed for audio understanding
GUI for a Vocal Remover that uses Deep Neural Networks
Official Python inference and LoRA trainer package
Download your Spotify playlists and songs along with album art
A cross-platform GUI wrapper for yt-dlp written in PySide6
A lightning fast audio upsampler
Audiocraft is a library for audio processing and generation
Tokenizer-Free TTS for Multilingual Speech Generation
Python library for audio and music analysis
A Python library for audio
Transforming Multimodal Content into Captivating Multilingual Audio
Dumb downloader that scrapes the web
Award-Winning Open Source Video Editing Software
Python Audio Analysis Library: Feature Extraction, Classification
A Family of Open Sourced Music Foundation Models
Taming Stable Diffusion for Lip Sync
Multilingual speech recognition and audio understanding model
A lightweight audio-to-MIDI converter with pitch bend detection