DeepSeek-OCR 2

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents with rich spatial structure. The repository provides model code and inference scripts that let researchers and developers run and benchmark the system on both images and PDFs, with support for batch evaluation and optimized pipelines leveraging vLLM and transformers.

Features

Visual causal token ordering for improved OCR
Support for complex layouts and semantic structure
Inference scripts for images and PDF processing
Integration with vLLM and transformer backends
Batch evaluation and benchmark tools
Outputs suited for markdown and structured text

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow DeepSeek-OCR 2

DeepSeek-OCR 2 Web Site

Other Useful Business Software

Vibes don’t ship, Retool does

Start from a prompt and build production-ready apps on your data—with security, permissions, and compliance built in.

Vibe coding tools create cool demos, but Retool helps you build software your company can actually use. Generate internal apps that connect directly to your data—deployed in your cloud with enterprise security from day one. Build dashboards, admin panels, and workflows with granular permissions already in place. Stop prototyping and ship on a platform that actually passes security review.

Build apps that ship

Rate This Project

User Reviews

Be the first to post a review of DeepSeek-OCR 2!

Additional Project Details

Programming Language

Python

Related Categories

Python OCR Software, Python AI Models

Registered

7 hours ago

Report inappropriate content

DeepSeek-OCR 2

Visual Causal Flow

Get an email when there's a new version of DeepSeek-OCR 2

Features

Project Samples

Project Activity

Categories

License

Follow DeepSeek-OCR 2

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered