RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.
Features
- Real-time transcription via microphone
- Wake-word and voice-activity detection
- Asynchronous callback architecture
- Nanosecond timing metadata
- CLI and server modes with VAD filters
- Low-latency suitable for live apps
Categories
Speech to TextLicense
MIT LicenseFollow RealtimeSTT
Other Useful Business Software
$300 in Free Credit Towards Top Cloud Services
Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of RealtimeSTT!