Audio foundation model excelling in audio understanding
Large Audio Language Model built for natural interactions
Audio Normalization for Python/ffmpeg
Python Audio Analysis Library: Feature Extraction, Classification
Audiocraft is a library for audio processing and generation
A powerhouse of audio functionality for macOS, iOS, and tvOS
A lightning fast audio upsampler
Robust Speech Recognition via Large-Scale Weak Supervision
Give Claude the ability to watch and understand videos
AI tool converting video/audio into structured documents instantly
A JavaScript NES emulator
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Download videos from almost any website
Self-hosted collection of powerful web-based tools for everyday tasks
Open-source multi-speaker long-form text-to-speech model
The core of Membrane Framework, multimedia processing framework
AudioMuse-AI is an Open Source Dockerized environment
Flash + AIR sound effects generator. Based on Sfxr.
FFmpeg for browser, powered by WebAssembly
Improved AudioBookConverter based on freeipodsoftware release
Official repository for LTX-Video
Oboe is a C++ library that makes it easy to build high-performance
Automated YouTube Shorts pipeline
Self-hosted AI audio transcription