Speech Note Linux app. Note taking, reading and translating
Robust Speech Recognition via Large-Scale Weak Supervision
End-to-end speech processing toolkit
Toolkit for conversational AI
Underthesea - Vietnamese NLP Toolkit
Persian NLP Toolkit
OpenVINO™ Toolkit repository
A free, open source, and extensible speech-to-text application
Han Language Processing
Generate audiobooks from EPUBs, PDFs and text with captions
Audio foundation model excelling in audio understanding
Comprehensive Gradio WebUI for audio processing
Use Microsoft Edge's online text-to-speech service from Python
Stanford CoreNLP, a Java suite of core NLP tools
Towards Human-Sounding Speech
Training data (data labeling, annotation, workflow) for all data types
Apache OpenNLP
The Classical Language Toolkit
Open-source multi-speaker long-form text-to-speech model
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Video translation and dubbing tool powered by LLMs
A suite of advanced multi-modal LLMs
Assistant SDK to build a multimodal conversational UX for Android
Controllable and fast Text-to-Speech for over 7000 languages
Industrial-strength Natural Language Processing (NLP)