Whisper

Whisper

OpenAI
+
+

Related Products

  • Google Cloud Speech-to-Text
    378 Ratings
    Visit Website
  • Fathom
    6,670 Ratings
    Visit Website
  • QEval
    30 Ratings
    Visit Website
  • SureSync
    13 Ratings
    Visit Website
  • 4K Video Downloader
    8,864 Ratings
    Visit Website
  • Amazon Bedrock
    77 Ratings
    Visit Website
  • Squaretalk
    232 Ratings
    Visit Website
  • Nectar
    8,199 Ratings
    Visit Website
  • Nutrient SDK
    95 Ratings
    Visit Website
  • MobiPDF (formerly PDF Extra)
    5,702 Ratings
    Visit Website

About

Amazon Transcribe makes it easy for developers to add speech to text capabilities to their applications. Audio data is virtually impossible for computers to search and analyze. Therefore, recorded speech needs to be converted to text before it can be used in applications. Historically, customers had to work with transcription providers that required them to sign expensive contracts and were hard to integrate into their technology stacks to accomplish this task. Many of these providers use outdated technology that does not adapt well to different scenarios, like low-fidelity phone audio common in contact centers, which results in poor accuracy. Amazon Transcribe uses a deep learning process called automatic speech recognition (ASR) to convert speech to text quickly and accurately. Amazon Transcribe can be used to transcribe customer service calls, automate subtitling, and generate metadata for media assets to create a fully searchable archive.

About

We’ve trained and are open-sourcing a neural net called Whisper that approaches human-level robustness and accuracy in English speech recognition. Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise, and technical language. Moreover, it enables transcription in multiple languages, as well as translation from those languages into English. We are open-sourcing models and inference code to serve as a foundation for building useful applications and for further research on robust speech processing. The Whisper architecture is a simple end-to-end approach, implemented as an encoder-decoder Transformer. Input audio is split into 30-second chunks, converted into a log-Mel spectrogram, and then passed into an encoder.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers searching for an automatic speech recognition and transcription software solution to add speech to text capabilities to their applications

Audience

Anyone looking for a tool to recognize speech automatically and improve text transcription

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$0.00013
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Amazon
Founded: 1994
United States
aws.amazon.com/transcribe/

Company Information

OpenAI
United States
openai.com/blog/whisper/

Alternatives

Alternatives

Transcribe

Transcribe

Wreally

Categories

Categories

Integrations

AWS AI Services
AWS App Mesh
Amazon
Amazon API Gateway
Amazon Ads
Amazon Kendra
Amazon Redshift
Amazon S3 Glacier
Amazon Simple Notification Service (SNS)
Amazon Web Services (AWS)
Azure AI Speech
Hyprnote
NoteVocal
OpenAI
ReByte
TurboScribe
Undrstnd
Unremot
Vocode
Waveloom

Integrations

AWS AI Services
AWS App Mesh
Amazon
Amazon API Gateway
Amazon Ads
Amazon Kendra
Amazon Redshift
Amazon S3 Glacier
Amazon Simple Notification Service (SNS)
Amazon Web Services (AWS)
Azure AI Speech
Hyprnote
NoteVocal
OpenAI
ReByte
TurboScribe
Undrstnd
Unremot
Vocode
Waveloom
Claim Amazon Transcribe and update features and information
Claim Amazon Transcribe and update features and information
Claim Whisper and update features and information
Claim Whisper and update features and information