DeepSeek-OCRDeepSeek
|
||||||
Related Products
|
||||||
About
DeepSeek-OCR is an open source model for Contexts Optical Compression, built to explore the boundaries of visual-text compression and investigate the role of vision encoders from an LLM-centric viewpoint. It is designed to compress long contexts through optical 2D mapping, using DeepEncoder as the core engine and DeepSeek3B-MoE-A570M as the decoder. DeepEncoder maintains low activations under high-resolution input while achieving high compression ratios, keeping the number of vision tokens manageable for document understanding. The model supports OCR and document parsing workflows for images and PDFs, with inference through vLLM or Transformers. Users can run image OCR with streaming output, process PDFs with high concurrency, or run batch evaluation for benchmarks. DeepSeek-OCR can convert documents to Markdown, perform free OCR without layouts, parse figures, describe images in detail, and locate referenced text inside an image.
|
About
Docling is an easy-to-use, self-contained, MIT-licensed open source toolkit for converting messy documents into structured data and simplifying downstream document and AI processing. It can parse many popular document formats into a unified and richly structured Docling Document, including PDF, DOCX, PPTX, XLSX, HTML, Markdown, AsciiDoc, CSV, images, audio, and scanned pages through an OCR engine of the user’s choice. Docling detects tables, formulas, reading order, chunks, bounding boxes, page headers and footers, pictures, captions, code, list items, paragraphs, cells, and document structure, making extracted content easier to process, search, and ingest into AI, RAG, and agentic systems. It can export parsed documents to JSON, text, Markdown, HTML, and Doctags, giving developers flexible outputs for pipelines and applications. Docling stores and traverses components according to reading order, partitions documents into bite-sized contiguous text chunks.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
AI researchers and document-processing engineers who need an open OCR model for efficient document parsing, Markdown conversion, and vision-text compression experiments
|
Audience
AI engineers, data teams, and developers building RAG or document-intelligence systems who need an open-source toolkit to convert complex documents into structured, searchable, AI-ready data
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
Free
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationDeepSeek
Founded: 2023
China
github.com/deepseek-ai/DeepSeek-OCR
|
Company InformationDocling
United States
www.docling.ai/
|
|||||
Alternatives |
Alternatives |
|||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
Markdown
DeepSeek
Google Sheets
HTML
JSON
Microsoft Excel
Model Context Protocol (MCP)
Python
|
Integrations
Markdown
DeepSeek
Google Sheets
HTML
JSON
Microsoft Excel
Model Context Protocol (MCP)
Python
|
|||||
|
|
|