Speech Note Linux app. Note taking, reading and translating
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Robust Speech Recognition via Large-Scale Weak Supervision
End-to-end speech processing toolkit
Fast and accurate automatic speech recognition (ASR) for edge devices
Toolkit for conversational AI
Translate the video from one language to another and embed dubbing
A free, open source, and extensible speech-to-text application
Underthesea - Vietnamese NLP Toolkit
OpenVINO™ Toolkit repository
Comprehensive Gradio WebUI for audio processing
Persian NLP Toolkit
Automatic Speech Recognition with Word-level Timestamps
Audio foundation model excelling in audio understanding
Generate audiobooks from EPUBs, PDFs and text with captions
Open Source Speech Language Model
Han Language Processing
Faster Whisper transcription with CTranslate2
Use Microsoft Edge's online text-to-speech service from Python
AI-powered tool for generating, optimizing, and translating subtitles
Stanford CoreNLP, a Java suite of core NLP tools
Open-source multi-speaker long-form text-to-speech model
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Fast multimodal LLM for real-time voice interaction and AI apps
Training data (data labeling, annotation, workflow) for all data types