Voxtral Transcribe 2
Voxtral Transcribe 2 is a next-generation family of speech-to-text models from Mistral AI that delivers ultra-low-latency, high-quality audio transcription and speaker diarization with broad language support. The suite includes Voxtral Mini Transcribe V2, optimized for batch transcription with features such as word-level timestamps, context biasing, and support for 13 languages, and Voxtral Realtime, designed specifically for live, streaming speech recognition with latency configurable down to sub-200 ms for real-time applications. Both models achieve state-of-the-art transcription accuracy while running efficiently and economically, with Mini Transcribe V2 offering leading performance and low error rates, and Realtime available as open source under the Apache 2.0 license so developers can deploy it on edge devices or in private environments.
Learn more
Speechmatics
Best-in-Market Speech-to-Text & Voice AI for Enterprises.
Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional precision—across the widest range of languages, dialects, and accents.
Powered by Foundational Speech Technology, Speechmatics supports mission-critical voice applications in media, contact centers, finance, healthcare, and more. With on-prem, cloud, and hybrid deployment, businesses maintain full control over data security while unlocking voice insights.
Trusted by global leaders, Speechmatics is the top choice for best-in-class transcription and voice intelligence.
🔹 Unmatched Accuracy – Superior transcription across languages & accents
🔹 Flexible Deployment – Cloud, on-prem, and hybrid
🔹 Enterprise-Grade Security – Full data control
🔹 Real-Time & Batch Processing – Scalable transcription
Learn more
Otter.ai
Otter is where conversations live. Generate rich notes for meetings, interviews, lectures, and other important voice conversations with Otter, your AI-powered assistant. Organizations who have the Otter advantage. Teams big and small trust Otter to transcribe their important conversations. Our shiny new release, Otter 2.0, adds more functionality to improve collaboration and productivity. The Teams plan includes capabilities designed especially for small and medium businesses and teams in larger enterprises. Record and review in real time. Search, play, edit, organize, and share your conversations from any device. Record conversations using Otter on your phone or web browser. Import or sync recordings from other services. Integrate with Zoom. Get real-time streaming transcripts and, within minutes, rich, searchable notes with text, audio, images, speaker ID, and key phrases. Share or export voice notes to inform others and get on the same page.
Learn more
GPT‑Realtime‑Whisper
GPT-Realtime-Whisper is OpenAI’s streaming transcription model built for low-latency speech-to-text experiences in live products. It transcribes audio as people speak, helping voice-enabled apps feel faster, more responsive, and more natural, from captions that appear in the moment to meeting notes that keep up with the conversation. It makes live speech usable inside business workflows as it happens, so teams can power captions for meetings, classrooms, broadcasts, and events, generate notes and summaries while conversations are still in progress, build voice agents that need to understand users continuously, and create faster follow-up workflows for high-volume spoken interactions. It is part of a new generation of real-time voice models in the API that can reason, translate, and transcribe as people speak, moving real-time audio beyond simple call-and-response toward voice interfaces that can listen, translate, transcribe, and take action as a conversation unfolds.
Learn more