Voxtral

Voxtral

Mistral AI
+
+

Related Products

  • Google Cloud Speech-to-Text
    361 Ratings
    Visit Website
  • QEval
    30 Ratings
    Visit Website
  • LALAL.AI
    5,019 Ratings
    Visit Website
  • 4K Video Downloader
    12,052 Ratings
    Visit Website
  • Google AI Studio
    12 Ratings
    Visit Website
  • Fathom
    7,583 Ratings
    Visit Website
  • Screencapt
    131 Ratings
    Visit Website
  • Muzaic
    2 Ratings
    Visit Website
  • Dialpad Connect
    4,168 Ratings
    Visit Website
  • AdvancedMD
    2 Ratings
    Visit Website

About

Transcribe audio and video into text. Get accurate transcriptions of podcasts with domain-specific speech recognition. SpeechText.AI is a powerful artificial intelligence software for speech to text conversion and audio transcription. Upload audio or video files. AI transcription software supports various file formats and transcribes from speech to text in any language. Select domain. Select industry domain and audio type from predefined categories to improve the recognition accuracy of domain-specific words. Transcribe. Our speech transcription engine uses state-of-the-art deep neural network models to convert from audio to text with close to human accuracy. Edit & Export. Search, modify and verify audio transcriptions using interactive editing tools. Export your content in different formats. Why SpeechText.AI? Set of amazing features to help you transcribe audio and video in seconds. Speech recognition. Powerful speech-to-text tech.

About

Voxtral models are frontier open source speech‑understanding systems available in two sizes—a 24 B variant for production‑scale applications and a 3 B variant for local and edge deployments, both released under the Apache 2.0 license. They combine high‑accuracy transcription with native semantic understanding, supporting long‑form context (up to 32 K tokens), built‑in Q&A and structured summarization, automatic language detection across major languages, and direct function‑calling to trigger backend workflows from voice. Retaining the text capabilities of their Mistral Small 3.1 backbone, Voxtral handles audio up to 30 minutes for transcription or 40 minutes for understanding and outperforms leading open source and proprietary models on benchmarks such as LibriSpeech, Mozilla Common Voice, and FLEURS. Accessible via download on Hugging Face, API endpoint, or private on‑premises deployment, Voxtral also offers domain‑specific fine‑tuning and advanced enterprise features.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Anyone that needs a program to convert audio files to text

Audience

Developers and product teams requiring a solution to build multilingual voice interfaces and real‑time speech‑to‑action applications

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$19 one-time payment
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

SpeechText.AI
Founded: 2019
Germany
speechtext.ai

Company Information

Mistral AI
Founded: 2023
France
mistral.ai/news/voxtral

Alternatives

Alternatives

SoapBox

SoapBox

Soapbox Labs
Azure AI Speech

Azure AI Speech

Microsoft
Transcribe

Transcribe

Wreally

Categories

Categories

Speech Recognition Features

Audio Capture
Automatic Form Fill
Automatic Transcription
Call Analysis
Concatenated Speech
Continuous Speech
Customizable Macros
Multi-Languages
Specialty Vocabularies
Speech-to-Text Analysis
Variable Frequency
Voice Recognition

Integrations

Hugging Face
LazyTyper
Mistral AI
Quickwork

Integrations

Hugging Face
LazyTyper
Mistral AI
Quickwork
Claim SpeechText.AI and update features and information
Claim SpeechText.AI and update features and information
Claim Voxtral and update features and information
Claim Voxtral and update features and information