Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio foundation model excelling in audio understanding
Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Large Audio Language Model built for natural interactions
LLM-based Reinforcement Learning audio edit model
Multi-modal large language model designed for audio understanding
GUI for a Vocal Remover that uses Deep Neural Networks
Official Python inference and LoRA trainer package
Download your Spotify playlists and songs along with album art
A cross-platform GUI wrapper for yt-dlp written in PySide6
Python library for audio and music analysis
A lightning fast audio upsampler
Audiocraft is a library for audio processing and generation
Dumb downloader that scrapes the web
Tokenizer-Free TTS for Multilingual Speech Generation
A Python library for audio
Python Audio Analysis Library: Feature Extraction, Classification
Award-Winning Open Source Video Editing Software
A Family of Open Sourced Music Foundation Models
Taming Stable Diffusion for Lip Sync
Transforming Multimodal Content into Captivating Multilingual Audio
Music player and music library manager for Linux, Windows, and macOS
Multilingual speech recognition and audio understanding model