Speech Note Linux app. Note taking, reading and translating
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Robust Speech Recognition via Large-Scale Weak Supervision
Translate the video from one language to another and embed dubbing
End-to-end speech processing toolkit
Fast and accurate automatic speech recognition (ASR) for edge devices
A free, open source, and extensible speech-to-text application
Automatic Speech Recognition with Word-level Timestamps
Toolkit for conversational AI
Comprehensive Gradio WebUI for audio processing
Underthesea - Vietnamese NLP Toolkit
Persian NLP Toolkit
Generate audiobooks from EPUBs, PDFs and text with captions
OpenVINO™ Toolkit repository
Han Language Processing
Stanford CoreNLP, a Java suite of core NLP tools
Faster Whisper transcription with CTranslate2
Open Source Speech Language Model
Fast multimodal LLM for real-time voice interaction and AI apps
Use Microsoft Edge's online text-to-speech service from Python
Audio foundation model excelling in audio understanding
Open-source multi-speaker long-form text-to-speech model
AI-powered tool for generating, optimizing, and translating subtitles
Training data (data labeling, annotation, workflow) for all data types
Voice Recognition to Text Tool