Speech Note Linux app. Note taking, reading and translating
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Robust Speech Recognition via Large-Scale Weak Supervision
End-to-end speech processing toolkit
Translate the video from one language to another and embed dubbing
Toolkit for conversational AI
Fast and accurate automatic speech recognition (ASR) for edge devices
Underthesea - Vietnamese NLP Toolkit
A free, open source, and extensible speech-to-text application
OpenVINO™ Toolkit repository
Comprehensive Gradio WebUI for audio processing
Persian NLP Toolkit
Automatic Speech Recognition with Word-level Timestamps
Han Language Processing
Open Source Speech Language Model
Generate audiobooks from EPUBs, PDFs and text with captions
Audio foundation model excelling in audio understanding
Faster Whisper transcription with CTranslate2
Stanford CoreNLP, a Java suite of core NLP tools
Use Microsoft Edge's online text-to-speech service from Python
AI-powered tool for generating, optimizing, and translating subtitles
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Open-source multi-speaker long-form text-to-speech model
Fast multimodal LLM for real-time voice interaction and AI apps
Training data (data labeling, annotation, workflow) for all data types