Speech Note Linux app. Note taking, reading and translating
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Robust Speech Recognition via Large-Scale Weak Supervision
End-to-end speech processing toolkit
Translate the video from one language to another and embed dubbing
Fast and accurate automatic speech recognition (ASR) for edge devices
Toolkit for conversational AI
A free, open source, and extensible speech-to-text application
Underthesea - Vietnamese NLP Toolkit
Open Source Speech Language Model
Comprehensive Gradio WebUI for audio processing
OpenVINO™ Toolkit repository
Persian NLP Toolkit
Automatic Speech Recognition with Word-level Timestamps
Han Language Processing
Audio foundation model excelling in audio understanding
Faster Whisper transcription with CTranslate2
Use Microsoft Edge's online text-to-speech service from Python
Stanford CoreNLP, a Java suite of core NLP tools
Generate audiobooks from EPUBs, PDFs and text with captions
Open-source multi-speaker long-form text-to-speech model
AI-powered tool for generating, optimizing, and translating subtitles
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Towards Human-Sounding Speech
Fast multimodal LLM for real-time voice interaction and AI apps