+
+

Related Products

  • Google Cloud Speech-to-Text
    355 Ratings
    Visit Website
  • Fathom
    7,471 Ratings
    Visit Website
  • LALAL.AI
    4,912 Ratings
    Visit Website
  • QEval
    30 Ratings
    Visit Website
  • Forethought
    166 Ratings
    Visit Website
  • Dialpad Connect
    4,168 Ratings
    Visit Website
  • Community Phone
    1,249 Ratings
    Visit Website
  • Assembled
    254 Ratings
    Visit Website
  • Evertune
    1 Rating
    Visit Website
  • DialerAI
    5 Ratings
    Visit Website

About

GPT-Realtime-Whisper is OpenAI’s streaming transcription model built for low-latency speech-to-text experiences in live products. It transcribes audio as people speak, helping voice-enabled apps feel faster, more responsive, and more natural, from captions that appear in the moment to meeting notes that keep up with the conversation. It makes live speech usable inside business workflows as it happens, so teams can power captions for meetings, classrooms, broadcasts, and events, generate notes and summaries while conversations are still in progress, build voice agents that need to understand users continuously, and create faster follow-up workflows for high-volume spoken interactions. It is part of a new generation of real-time voice models in the API that can reason, translate, and transcribe as people speak, moving real-time audio beyond simple call-and-response toward voice interfaces that can listen, translate, transcribe, and take action as a conversation unfolds.

About

Soniox develops highly accurate foundational speech models that transcribe, translate, and understand speech as it happens, and also provides the developer platform that makes it easy to integrate real-time voice intelligence into any application. Soniox Speech-to-Text API allows you to transcribe speech in 60+ languages in real-time with high accuracy - built for large scale. Soniox also provides regional data residency and is SOC 2 Type 2, GDPR and HIPAA compliant.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Live events technology teams that need low-latency speech-to-text for real-time captions, transcripts, and post-event content workflows

Audience

Medical transcription, call centers, speech translation, voice agents, media transcription, wearables

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

No images available

Pricing

$0.017 per minute
Free Version
Free Trial

Pricing

$0.10/hour of audio
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

OpenAI
Founded: 2015
United States
openai.com/index/advancing-voice-intelligence-with-new-models-in-the-api/

Company Information

Soniox
Founded: 2020
United States
soniox.com

Alternatives

Azure AI Speech

Azure AI Speech

Microsoft

Alternatives

Beey

Beey

NEWTON Technologies
Azure AI Speech

Azure AI Speech

Microsoft
SpokenData

SpokenData

ReplayWell
Utterly

Utterly

Semantic Bridge LLC

Categories

Categories

Integrations

OpenAI
OpenAI Whisper
gpt-realtime

Integrations

OpenAI
OpenAI Whisper
gpt-realtime
Claim GPT‑Realtime‑Whisper and update features and information
Claim GPT‑Realtime‑Whisper and update features and information
Claim Soniox and update features and information
Claim Soniox and update features and information