Voxtral

Voxtral

Mistral AI
+
+

Related Products

  • Google Cloud Speech-to-Text
    361 Ratings
    Visit Website
  • RunPod
    206 Ratings
    Visit Website
  • QEval
    30 Ratings
    Visit Website
  • Gemini Enterprise Agent Platform
    961 Ratings
    Visit Website
  • Bright Data
    1,360 Ratings
    Visit Website
  • PackageX OCR Scanning
    46 Ratings
    Visit Website
  • Assembled
    254 Ratings
    Visit Website
  • Cloudflare
    2,002 Ratings
    Visit Website
  • Podium
    2,128 Ratings
  • Forethought
    167 Ratings
    Visit Website

About

Deploy accurate speech recognition at scale while continuously improving model performance by labeling data and training from a single console. We deliver state-of-the-art speech recognition and understanding at scale. We do it by providing cutting-edge model training and data-labeling alongside flexible deployment options. Our platform recognizes multiple languages, accents, and words, dynamically tuning to the needs of your business with every training session. The fastest, most accurate, most reliable, most scalable speech transcription, with understanding — rebuilt just for enterprise. We’ve reinvented ASR with 100% deep learning that allows companies to continuously improve accuracy. Stop waiting for the big tech players to improve their software and forcing your developers to manually boost accuracy with keywords in every API call. Start training your speech model and reaping the benefits in weeks, not months or years.

About

Voxtral models are frontier open source speech‑understanding systems available in two sizes—a 24 B variant for production‑scale applications and a 3 B variant for local and edge deployments, both released under the Apache 2.0 license. They combine high‑accuracy transcription with native semantic understanding, supporting long‑form context (up to 32 K tokens), built‑in Q&A and structured summarization, automatic language detection across major languages, and direct function‑calling to trigger backend workflows from voice. Retaining the text capabilities of their Mistral Small 3.1 backbone, Voxtral handles audio up to 30 minutes for transcription or 40 minutes for understanding and outperforms leading open source and proprietary models on benchmarks such as LibriSpeech, Mozilla Common Voice, and FLEURS. Accessible via download on Hugging Face, API endpoint, or private on‑premises deployment, Voxtral also offers domain‑specific fine‑tuning and advanced enterprise features.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Companies looking for Speech to Text (STT) API for real-time and batch transcriptions, on premise or in the cloud.

Audience

Developers and product teams requiring a solution to build multilingual voice interfaces and real‑time speech‑to‑action applications

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$0
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Deepgram
Founded: 2015
United States
deepgram.com

Company Information

Mistral AI
Founded: 2023
France
mistral.ai/news/voxtral

Alternatives

Alternatives

Azure AI Speech

Azure AI Speech

Microsoft
Azure AI Speech

Azure AI Speech

Microsoft

Categories

Categories

Transcription Features

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Speech Recognition Features

Audio Capture
Automatic Form Fill
Automatic Transcription
Call Analysis
Concatenated Speech
Continuous Speech
Customizable Macros
Multi-Languages
Specialty Vocabularies
Speech-to-Text Analysis
Variable Frequency
Voice Recognition

Integrations

Agentic.Market
Amazon Web Services (AWS)
Astro
Axis LMS
Bolna
ContactSwing
Creovai
Docker
Fluents.ai
Google Cloud Platform
Hugging Face
Hunch
MacWhisper
Mistral AI
Nova-3
Splutter AI
Submind
Utterly Voice
Vellum
Vocode

Integrations

Agentic.Market
Amazon Web Services (AWS)
Astro
Axis LMS
Bolna
ContactSwing
Creovai
Docker
Fluents.ai
Google Cloud Platform
Hugging Face
Hunch
MacWhisper
Mistral AI
Nova-3
Splutter AI
Submind
Utterly Voice
Vellum
Vocode
Claim Deepgram and update features and information
Claim Deepgram and update features and information
Claim Voxtral and update features and information
Claim Voxtral and update features and information