Audio foundation model excelling in audio understanding
Large Audio Language Model built for natural interactions
The official Python library for the Fish Audio API
Audio Plugin for Audio to MIDI transcription using deep learning
Official Python inference and LoRA trainer package
Compilation of authoritative information on audio and video streaming
A powerhouse of audio functionality for macOS, iOS, and tvOS
A Family of Open Sourced Music Foundation Models
Audio player that can play common audio formats
A tweak to enhance Spotify experience
Python Audio Analysis Library: Feature Extraction, Classification
HLS.js is a JavaScript library that plays HLS in browsers
Tokenizer-Free TTS for Multilingual Speech Generation
Audio Normalization for Python/ffmpeg
Code for openai.fm, a demo for the OpenAI Speech API
TTS model capable of streaming conversational audio in realtime
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Cross platform GUI tool for downloading videos from Bilibili sites
Simple and Fast Multimedia Library
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Extract audio and video content and organize it into a Markdown note
A lightning fast audio upsampler
Speakr is a personal, self-hosted web application
SOTA discrete acoustic codec models with 40/75 tokens per second
s&box is a modern game engine, built on Valve's Source 2