Audio foundation model excelling in audio understanding
A set of AI-enabled effects, generators, and analyzers for Audacity
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
GUI for a Vocal Remover that uses Deep Neural Networks
Python library for audio and music analysis
Functional programming language for signal processing
C++ Audio Plug-in Framework for desktop, mobile, xr and web
Audio Normalization for Python/ffmpeg
Python Audio Analysis Library: Feature Extraction, Classification
A professional video compression tool accessible to all
Fast and accurate automatic speech recognition (ASR) for edge devices
A lightning fast audio upsampler
A powerhouse of audio functionality for macOS, iOS, and tvOS
Automatic subtitle synchronization tool
Audiocraft is a library for audio processing and generation
A multimedia transcoded treasure chest / a FFmpeg case
Robust Speech Recognition via Large-Scale Weak Supervision
Give Claude the ability to watch and understand videos
AI tool converting video/audio into structured documents instantly
A JavaScript NES emulator
Data manipulation and transformation for audio signal processing
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Java ffmpeg and ffprobe command-line wrapper
Open speech-to-speech models and pipelines by Hugging Face toolkit AI