Whisper-large-v3 is OpenAI’s most advanced multilingual automatic speech recognition (ASR) and speech translation model, featuring 1.54 billion parameters and trained on 5 million hours of labeled and pseudo-labeled audio. Built on a Transformer-based encoder-decoder architecture, it supports 99 languages and delivers significant improvements in transcription accuracy, robustness to noise, and handling of diverse accents. Compared to previous versions, v3 introduces a 128 Mel bin spectrogram input and better support for Cantonese, achieving up to 20% error reduction over Whisper-large-v2. It handles zero-shot transcription and translation, performs language detection automatically, and supports features like word-level timestamps and long-form audio processing. The model integrates well with Hugging Face Transformers and supports optimizations such as batching, SDPA, and Flash Attention 2.

Features

  • Supports transcription and translation in 99 languages
  • Trained on 5M+ hours of labeled and pseudo-labeled audio
  • High accuracy with improved robustness to noise and accents
  • Enables word- and sentence-level timestamps
  • Supports long-form audio via chunked or sequential processing
  • Compatible with PyTorch, JAX, and Transformers pipeline
  • Optimized with Flash Attention 2, SDPA, and torch.compile
  • Apache 2.0 licensed for flexible commercial and research use

Project Samples

Project Activity

See All Activity >

Categories

AI Models

Follow whisper-large-v3

whisper-large-v3 Web Site

Other Useful Business Software
AI-powered service management for IT and enterprise teams Icon
AI-powered service management for IT and enterprise teams

Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Try it Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of whisper-large-v3!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-06-27