Audio Plugin for Audio to MIDI transcription using deep learning
Automatic Speech Recognition with Word-level Timestamps
Open Source AI Dictation App
Faster Whisper transcription with CTranslate2
Self-hosted AI audio transcription
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A private, local meeting notes assistant
A free, open source, and extensible speech-to-text application
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Fast and accurate automatic speech recognition (ASR) for edge devices
Crowdsourcing platform for full text transcription and tagging
A Web UI for easy subtitle using whisper model
Comprehensive Gradio WebUI for audio processing
Qwen3-ASR is an open-source series of ASR models
Generate blog articles from video or audio
A lightweight audio-to-MIDI converter with pitch bend detection
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper
Offline speech recognition API for Android, iOS, Raspberry Pi
AI-powered tool for generating, optimizing, and translating subtitles
PageLM is a community driven version of NotebookLM
A Family of Open Sourced Music Foundation Models
Synchronized Translation for Videos
A nearly-live implementation of OpenAI's Whisper
Multilingual speech recognition and audio understanding model
GLM-4-Voice | End-to-End Chinese-English Conversational Model