WhisperLive

WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and network streams such as RTSP and HLS, making it flexible for live events, monitoring, or accessibility workflows. Configuration options let you control the number of clients, maximum connection time, and threading behavior so the server can be tuned for different deployment environments. On the client side, you can set the language, whether to translate into English, model size, voice activity detection, and output recording behavior.

Features

Real-time Whisper transcription server with Python client for low-latency speech-to-text
Multiple backends supported (Faster-Whisper, TensorRT, OpenVINO) for GPU and CPU acceleration
Handles microphone, local audio files, RTSP streams, and HLS audio sources
Optional translation mode to translate input speech into a target language
Browser extensions for Chrome and Firefox plus an iOS client for direct device integration
Docker-friendly deployment and tunable concurrency options (max clients, connection time, threads)

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow WhisperLive

WhisperLive Web Site

Other Useful Business Software

Gemini 3 and 200+ AI Models on One Platform

Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free

Rate This Project

User Reviews

Be the first to post a review of WhisperLive!

Additional Project Details

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2025-11-28

Similar Business Software

Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
Murf AI

Murf API is an advanced text-to-speech (TTS) solution that transforms written text into natural, lifelike voiceovers with remarkable accuracy and ease. It empowers developers and businesses with a suite of sophisticated features, including pitch and speed modulation, audio duration adjustments,...

See Software
VEED

Create videos with a single click. Add subtitles, transcribe audio and more. Keep your content, logos, color palettes and bespoke fonts all in one place. Increase productivity with your own personal Brand Kit. Create workspaces to keep your content organised. Collaborate on projects in the...

See Software
ElevenLabs

The most realistic and versatile AI speech software, ever. Eleven brings the most compelling, rich and lifelike voices to creators and publishers seeking the ultimate tools for storytelling. Generate top-quality spoken audio in any voice and style with the most advanced and multipurpose AI...

See Software
Play.ht

AI Powered Text to Voice Generation. Play.ht offers uncanny, high-fidelity AI Voices for any project where you need human-sounding voice overs and performances. Hollywood studios, auto manufacturers, and other large enterprises use Play.ht to create realistic and engaging voiceovers...

See Software
Freepik

Freepik is redefining content creation with cutting-edge generative AI tools. The platform offers seamless, AI-powered tools that transform ideas into high-quality audiovisual content in seconds. Freepik AI Image Generator lets users convert text prompts into stunning visuals across multiple...

See Software

Report inappropriate content

WhisperLive

A nearly-live implementation of OpenAI's Whisper

Get an email when there's a new version of WhisperLive

Features

Project Samples

Project Activity

Categories

License

Follow WhisperLive

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered