Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Local-first AI Notepad for Private Meetings
Multilingual Automatic Speech Recognition with word-level timestamps
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
An Open Source text-to-speech system built by inverting Whisper
Another whisper wrapper, built fully in C++, with some neat features.
Task of transcribing piano recordings into MIDI files
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video
For transcribing: Writing down the content of a sound file