DeepSeek-OCR

DeepSeek-OCR

DeepSeek
Pixtral Large

Pixtral Large

Mistral AI
+
+

Related Products

  • MyQ
    197 Ratings
    Visit Website
  • TinyPNG
    58 Ratings
    Visit Website
  • PackageX OCR Scanning
    48 Ratings
    Visit Website
  • CirrusPrint
    2 Ratings
    Visit Website
  • ONLYOFFICE Docs
    715 Ratings
    Visit Website
  • AthenaHQ
    38 Ratings
    Visit Website
  • MobiPDF (formerly PDF Extra)
    6,998 Ratings
    Visit Website
  • Evertune
    1 Rating
    Visit Website
  • MASV
    94 Ratings
    Visit Website
  • Foxit Document Workflow APIs
    6 Ratings
    Visit Website

About

DeepSeek-OCR is an open source model for Contexts Optical Compression, built to explore the boundaries of visual-text compression and investigate the role of vision encoders from an LLM-centric viewpoint. It is designed to compress long contexts through optical 2D mapping, using DeepEncoder as the core engine and DeepSeek3B-MoE-A570M as the decoder. DeepEncoder maintains low activations under high-resolution input while achieving high compression ratios, keeping the number of vision tokens manageable for document understanding. The model supports OCR and document parsing workflows for images and PDFs, with inference through vLLM or Transformers. Users can run image OCR with streaming output, process PDFs with high concurrency, or run batch evaluation for benchmarks. DeepSeek-OCR can convert documents to Markdown, perform free OCR without layouts, parse figures, describe images in detail, and locate referenced text inside an image.

About

Pixtral Large is a 124-billion-parameter open-weight multimodal model developed by Mistral AI, building upon their Mistral Large 2 architecture. It integrates a 123-billion-parameter multimodal decoder with a 1-billion-parameter vision encoder, enabling advanced understanding of documents, charts, and natural images while maintaining leading text comprehension capabilities. With a context window of 128,000 tokens, Pixtral Large can process at least 30 high-resolution images simultaneously. The model has demonstrated state-of-the-art performance on benchmarks such as MathVista, DocVQA, and VQAv2, surpassing models like GPT-4o and Gemini-1.5 Pro. Pixtral Large is available under the Mistral Research License for research and educational use, and under the Mistral Commercial License for commercial applications.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

AI researchers and document-processing engineers who need an open OCR model for efficient document parsing, Markdown conversion, and vision-text compression experiments

Audience

AI developers interested in a powerful multimodal model

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

No images available

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

DeepSeek
Founded: 2023
China
github.com/deepseek-ai/DeepSeek-OCR

Company Information

Mistral AI
Founded: 2023
France
mistral.ai/news/pixtral-large/

Alternatives

GLM-OCR

GLM-OCR

Z.ai

Alternatives

Aya Vision

Aya Vision

Cohere
DeepSeek-VL

DeepSeek-VL

DeepSeek
DeepSeek-V2

DeepSeek-V2

DeepSeek
Mistral Small

Mistral Small

Mistral AI
Ministral 3

Ministral 3

Mistral AI
DeepSeek-V4

DeepSeek-V4

DeepSeek
Mistral 7B

Mistral 7B

Mistral AI

Categories

Categories

Integrations

AI-FLOW
Airtrain
BlueGPT
Continue
DeepSeek
Echo AI
Groq
Humiris AI
Kiin
Lunary
Melies
Microsoft Foundry Agent Service
MindMac
Mistral AI
Pipeshift
Ragas
Superinterface
Symflower
bolt.diy
promptmate.io

Integrations

AI-FLOW
Airtrain
BlueGPT
Continue
DeepSeek
Echo AI
Groq
Humiris AI
Kiin
Lunary
Melies
Microsoft Foundry Agent Service
MindMac
Mistral AI
Pipeshift
Ragas
Superinterface
Symflower
bolt.diy
promptmate.io
Claim DeepSeek-OCR and update features and information
Claim DeepSeek-OCR and update features and information
Claim Pixtral Large and update features and information
Claim Pixtral Large and update features and information