Bailing is a voice dialogue robot similar to GPT-4o
A robust, efficient, low-latency speech-to-text library
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A free, open source, and extensible speech-to-text application
Amica is an open source interface for interactive communication
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM
Live Transcribe is an Android application
SIP Video Multiconference Media Server with WebRTC support.