Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Local-first AI Notepad for Private Meetings
Multilingual Automatic Speech Recognition with word-level timestamps
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
An Open Source text-to-speech system built by inverting Whisper
Another whisper wrapper, built fully in C++, with some neat features.
A2M is a desktop app that converts AUDIO TO MIDI in one click.
Task of transcribing piano recordings into MIDI files
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video
Create, save, copy and edit tengwar texts with this application.
For transcribing: Writing down the content of a sound file