Speech-to-text, text-to-speech, and speaker recognition
Speech to Text to Speech, sends text as OSC messages
High-quality multi-lingual text-to-speech library by MyShell.ai
Robust Speech Recognition via Large-Scale Weak Supervision
A free, open source, and extensible speech-to-text application
Offline speech recognition API for Android, iOS, Raspberry Pi
Comprehensive Gradio WebUI for audio processing
Speech recognition module for Python
A fast, local neural text to speech system
Stanford CoreNLP, a Java suite of core NLP tools
A robust, efficient, low-latency speech-to-text library
A generative speech model for daily dialogue
The behavior guidance framework for customer-facing LLM agents
Chuyển đổi văn bản thành giọng nói không giới hạn
Qwen3-omni is a natively end-to-end, omni-modal LLM
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Capable of understanding text, audio, vision, video
Toolkit for conversational AI
A speech-text foundation model for real time dialogue
State-of-the-art TTS model under 25MB
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Subtitle Creation Assistant
Transcribe any audio to text, translate and edit subtitles 100% locall
A modern ebook manager and reader with sync and backup
Persian NLP Toolkit