DeepSeek-OCRDeepSeek
|
DeepSeek R2DeepSeek
|
|||||
Related Products
|
||||||
About
DeepSeek-OCR is an open source model for Contexts Optical Compression, built to explore the boundaries of visual-text compression and investigate the role of vision encoders from an LLM-centric viewpoint. It is designed to compress long contexts through optical 2D mapping, using DeepEncoder as the core engine and DeepSeek3B-MoE-A570M as the decoder. DeepEncoder maintains low activations under high-resolution input while achieving high compression ratios, keeping the number of vision tokens manageable for document understanding. The model supports OCR and document parsing workflows for images and PDFs, with inference through vLLM or Transformers. Users can run image OCR with streaming output, process PDFs with high concurrency, or run batch evaluation for benchmarks. DeepSeek-OCR can convert documents to Markdown, perform free OCR without layouts, parse figures, describe images in detail, and locate referenced text inside an image.
|
About
DeepSeek R2 is the anticipated successor to DeepSeek R1, a groundbreaking AI reasoning model launched in January 2025 by the Chinese AI startup DeepSeek. Building on R1’s success, which disrupted the AI industry with its cost-effective performance rivaling top-tier models like OpenAI’s o1, R2 promises a quantum leap in capabilities. It is expected to deliver exceptional speed and human-like reasoning, excelling in complex tasks such as advanced coding and high-level mathematical problem-solving. Leveraging DeepSeek’s innovative Mixture-of-Experts architecture and efficient training methods, R2 aims to outperform its predecessor while maintaining a low computational footprint, potentially expanding its reasoning abilities to languages beyond English.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
AI researchers and document-processing engineers who need an open OCR model for efficient document parsing, Markdown conversion, and vision-text compression experiments
|
Audience
Experts, engineers, and institutions looking for cutting-edge open-source reasoning models to tackle complex challenges in areas such as mathematics, programming, and AI-powered problem-solving
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and VideosNo images available
|
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
Free
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationDeepSeek
Founded: 2023
China
github.com/deepseek-ai/DeepSeek-OCR
|
Company InformationDeepSeek
Founded: 2023
China
deepseek.com
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|
|||||
|
|
|
|||||
|
|
||||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
DeepSeek
C
C++
Clojure
Elixir
Go
HTML
Java
JavaScript
Kotlin
|
Integrations
DeepSeek
C
C++
Clojure
Elixir
Go
HTML
Java
JavaScript
Kotlin
|
|||||
|
|
|