RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.
Features
- Real-time transcription via microphone
- Wake-word and voice-activity detection
- Asynchronous callback architecture
- Nanosecond timing metadata
- CLI and server modes with VAD filters
- Low-latency suitable for live apps
Categories
Speech to TextLicense
MIT LicenseFollow RealtimeSTT
Other Useful Business Software
AI-powered service management for IT and enterprise teams
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of RealtimeSTT!