Mistral OCR 3Mistral AI
|
VoxtralMistral AI
|
|||||
Related Products
|
||||||
About
Mistral OCR 3 is the third-generation optical character recognition model from Mistral AI designed to achieve a new frontier in accuracy and efficiency for document processing by extracting text, embedded images, and structure from a wide range of documents with exceptional fidelity. It delivers breakthrough performance with a 74% overall win rate over the previous generation on forms, scanned documents, complex tables, and handwriting, outperforming both enterprise document processing solutions and AI-native OCR tools. OCR 3 supports output in clean text, Markdown, or structured JSON with HTML table reconstruction to preserve layout, enabling downstream systems and workflows to understand both content and structure. It powers the Document AI Playground in Mistral AI Studio for drag-and-drop parsing of PDFs and images and integrates via API for developers to automate document extraction workflows.
|
About
Voxtral models are frontier open source speech‑understanding systems available in two sizes—a 24 B variant for production‑scale applications and a 3 B variant for local and edge deployments, both released under the Apache 2.0 license. They combine high‑accuracy transcription with native semantic understanding, supporting long‑form context (up to 32 K tokens), built‑in Q&A and structured summarization, automatic language detection across major languages, and direct function‑calling to trigger backend workflows from voice. Retaining the text capabilities of their Mistral Small 3.1 backbone, Voxtral handles audio up to 30 minutes for transcription or 40 minutes for understanding and outperforms leading open source and proprietary models on benchmarks such as LibriSpeech, Mozilla Common Voice, and FLEURS. Accessible via download on Hugging Face, API endpoint, or private on‑premises deployment, Voxtral also offers domain‑specific fine‑tuning and advanced enterprise features.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Enterprise developers and data teams who need high-fidelity document digitization and structured extraction to power automation, search, and AI workflows across complex documents
|
Audience
Developers and product teams requiring a solution to build multilingual voice interfaces and real‑time speech‑to‑action applications
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
$14.99 per month
Free Version
Free Trial
|
Pricing
No information available.
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationMistral AI
Founded: 2023
France
mistral.ai/
|
Company InformationMistral AI
Founded: 2023
France
mistral.ai/news/voxtral
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|
|||||
|
|
||||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
Adobe Acrobat Reader
HTML
Hugging Face
JSON
LazyTyper
Markdown
Mistral AI
Mistral AI Studio
Mistral Document AI
|
Integrations
Adobe Acrobat Reader
HTML
Hugging Face
JSON
LazyTyper
Markdown
Mistral AI
Mistral AI Studio
Mistral Document AI
|
|||||
|
|
|