Audio foundation model excelling in audio understanding
Large Audio Language Model built for natural interactions
Audio Plugin for Audio to MIDI transcription using deep learning
Official Python inference and LoRA trainer package
Audiocraft is a library for audio processing and generation
Python Audio Analysis Library: Feature Extraction, Classification
Audio player that can play common audio formats
HLS.js is a JavaScript library that plays HLS in browsers
A Family of Open Sourced Music Foundation Models
A lightning fast audio upsampler
Simple and Fast Multimedia Library
Oboe is a C++ library that makes it easy to build high-performance
The missing YouTube Music macOS app
Code for openai.fm, a demo for the OpenAI Speech API
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A powerhouse of audio functionality for macOS, iOS, and tvOS
A tweak to enhance Spotify experience
Multilingual speech recognition and audio understanding model
Open-source multi-speaker long-form text-to-speech model
Implementation of AudioLM audio generation model in Pytorch
Extract audio and video content and organize it into a Markdown note
PersonaPlex code
A nearly-live implementation of OpenAI's Whisper
Stable diffusion for real-time music generation (web app)
s&box is a modern game engine, built on Valve's Source 2