RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.

Features

  • Real-time transcription via microphone
  • Wake-word and voice-activity detection
  • Asynchronous callback architecture
  • Nanosecond timing metadata
  • CLI and server modes with VAD filters
  • Low-latency suitable for live apps

Project Samples

Project Activity

See All Activity >

Categories

Speech to Text

License

MIT License

Follow RealtimeSTT

RealtimeSTT Web Site

Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of RealtimeSTT!

Additional Project Details

Programming Language

Python

Related Categories

Python Speech to Text Software

Registered

2025-07-03