character recognition source code free download

DeepSeek-OCR

Contexts Optical Compression

DeepSeek-OCR is an open-source optical character recognition solution built as part of the broader DeepSeek AI vision-language ecosystem. It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body text, interpreting tables, or recognizing handwritten versus printed words. ...

Downloads: 2 This Week

Last Update: 2026-01-27

See Project

GLM-OCR

Accurate × Fast × Comprehensive

GLM-OCR is an open-source multimodal optical character recognition (OCR) model built on a GLM-V encoder–decoder foundation that brings robust, accurate document understanding to complex real-world layouts and modalities. Designed to handle text recognition, table parsing, formula extraction, and general information retrieval from documents containing mixed content, GLM-OCR excels across major benchmarks while remaining highly efficient with a relatively compact parameter size (~0.9B), enabling deployment in high-concurrency services and edge environments. ...

Downloads: 23 This Week

Last Update: 2026-04-08

See Project

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents with rich spatial structure. ...

Downloads: 7 This Week

Last Update: 2026-02-03

See Project

Scribe.js

JavaScript OCR and text extraction for images and PDFs

Scribe.js is a JavaScript library that provides Optical Character Recognition (OCR) and text extraction capabilities for both images and PDF documents, aimed at developers who want to build OCR features directly into their applications. The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. In...

Downloads: 9 This Week

Last Update: 2026-03-14

See Project

dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

dots.ocr is a cutting-edge multilingual document parsing system built on a unified vision-language model that combines layout detection, text recognition, and structural understanding into a single architecture. Unlike traditional OCR pipelines that rely on multiple specialized components, dots.ocr integrates these processes end-to-end, reducing error propagation and improving consistency across tasks. The model is designed to recognize virtually any human script, making it highly effective...

Downloads: 0 This Week

Last Update: 2026-03-24

See Project

VietOCR

Provides optical character recognition (OCR) solutions for Vietnamese language.

24 Reviews

Downloads: 183 This Week

Last Update: 2026-01-17

See Project

pdfsandwich

pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries:...

8 Reviews

Downloads: 348 This Week

Last Update: 2018-08-12

See Project

Unified Character Recognition

UCR is a project name for the development of an handwritten characters in Korean language. The goal is to create a UCR Library for handwriting as well as OCR from off-line, on-line data. And we have a plan to build a UCR library for mobile.

Downloads: 0 This Week

Last Update: 2014-07-02

See Project

COSI

The Common OCR Service Interface. COSI is an API that allows developpers to easily bring OCR (Optical Character Recognition) capabilities to image processing applications. COSI supports existing OCR tools such as Tesseract, GOCR or GNU Ocrad.

Downloads: 3 This Week

Last Update: 2014-06-14

See Project

Socr3

Socr3 is a plugin-oriented, open source platform upon which I'm building an OCR suite. The name Socr3 stands for "Open Source Optical Character Recognition, Reading, Rendering, and Exporting", and is subject to change in the future.

Downloads: 0 This Week

Last Update: 2016-11-29

See Project

Conjecture

Conjecture is a modular, extensible, open-source C++ framework for Optical Character Recognition (OCR). It is not a single OCR, but rather an extensible collection of OCRs that can be explored, compared, extended and modified within a unified environment

Downloads: 0 This Week

Last Update: 2012-12-27

See Project

OOCR (Open OCR)

OOCR is a open source character recognition program, it is used to convert images to editable text.

1 Review

Downloads: 1 This Week

Last Update: 2013-04-18

See Project

Neuronal Optical Character Recognition

It's a tool who shows the concepts of a type of neuronal networks (multi-layers percetron). It's not a real ocr, it's just a little didactical application.

Downloads: 0 This Week

Last Update: 2014-06-12

See Project

Csillag

Optical Character Recognition (OCR) software.

Downloads: 0 This Week

Last Update: 2014-04-27

See Project

Search Results for "character recognition source code"

14 projects for "character recognition source code" with 2 filters applied:

DeepSeek-OCR

GLM-OCR

DeepSeek-OCR 2

Scribe.js

dots.ocr

VietOCR

pdfsandwich

Unified Character Recognition

COSI

Socr3

Conjecture

OOCR (Open OCR)

Neuronal Optical Character Recognition

Csillag

Search Results for "character recognition source code"

14 projects for "character recognition source code" with 2 filters applied:

DeepSeek-OCR

GLM-OCR

DeepSeek-OCR 2

Scribe.js

dots.ocr

VietOCR

pdfsandwich

Unified Character Recognition

COSI

Socr3

Conjecture

OOCR (Open OCR)

Neuronal Optical Character Recognition

Csillag

Related Searches

Related Categories