java ocr extraction text free download

Umi-OCR

OCR software, free and offline

Umi-OCR is a free and open-source optical character recognition (OCR) tool designed to provide fast, offline text extraction from images, screenshots, PDFs, and more without requiring a network connection. It includes a highly efficient offline OCR engine with built-in multilingual recognition libraries, so users can extract text across multiple languages with high accuracy directly on their machines.

Downloads: 43 This Week

Last Update: 2026-01-15

See Project

GLM-OCR

Accurate × Fast × Comprehensive

GLM-OCR is an open-source multimodal optical character recognition (OCR) model built on a GLM-V encoder–decoder foundation that brings robust, accurate document understanding to complex real-world layouts and modalities. Designed to handle text recognition, table parsing, formula extraction, and general information retrieval from documents containing mixed content, GLM-OCR excels across major benchmarks while remaining highly efficient with a relatively compact parameter size (~0.9B), enabling deployment in high-concurrency services and edge environments. ...

Downloads: 4 This Week

Last Update: 2026-04-08

See Project

Scribe.js

JavaScript OCR and text extraction for images and PDFs

Scribe.js is a JavaScript library that provides Optical Character Recognition (OCR) and text extraction capabilities for both images and PDF documents, aimed at developers who want to build OCR features directly into their applications. The library can take image files (such as PNG or JPEG) and recognize the text they contain, and it can also extract text from PDF files that either already contain text or are image-based scans, using modern web standards and WebAssembly under the hood. ...

Downloads: 1 This Week

Last Update: 2026-05-06

See Project

VietOCR

Provides optical character recognition (OCR) solutions for Vietnamese language.

24 Reviews

Downloads: 195 This Week

Last Update: 2026-01-17

See Project

chessPDFBrowser

Chess application whichs allows working with chess PDF books and PGNs.

Chess application which allows working with PDFs and PGNs. You can work with the chess games of the PDF and edit their tree of variants. Graphical environment. Standard PGN TAGs. PGN comments. Ocr like (Fen string detection from chess board position images). Connection to Uci chess engines (like stockfish). Position analysis, full game analysis. You can now play games against uci engines. pdf2pgn command line command included. Detailed documentation. Multilanguage...

1 Review

Downloads: 32 This Week

Last Update: 2026-04-04

See Project

DocWire SDK

Award-winning modern data processing SDK in C++20

DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...

Downloads: 1 This Week

Last Update: 2026-03-27

See Project

OpenKM Document Management - DMS

Document Management System and Content Management System

OpenKM Community Edition is a free Document Management System (DMS) that helps businesses control the production, storage, management and distribution of electronic documents, boosting effectiveness and productivity. It integrates document management, collaboration and advanced search into one easy-to-use solution, including administration tools for user roles, access control, security levels, activity logs and automation setup. With OpenKM Community Edition you can: Collect information...

32 Reviews

Downloads: 344 This Week

Last Update: 2026-05-14

See Project

MyBox

Easy Tools of PDF, Image, File, Network, Data, and Medias

javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.

Downloads: 0 This Week

Last Update: 2026-02-10

See Project

Manga Rikai OCR

Manga Rikai is the first consumer-ready multi-page manga OCR/translation engine. It is a spiritual successor to Capture2Text, Visual Novel Reader, and Textractor. At the moment, the engine can capture and translate single text box, detect all text boxes in a page or as many pages as you want. Not only that, you can edit the text, save your progress, and even export your work as an HTML file. Got problems? Join our discord: https://discord.com/invite/BuNuanw

1 Review

Downloads: 6 This Week

Last Update: 2021-02-23

See Project

cbrTekStraktor

an application to automatically extract text from comic books.

cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is...

Downloads: 5 This Week

Last Update: 2017-06-14

See Project

OCR Web based

OCR web based for Browser Firefox & PC

...Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction

2 Reviews

Downloads: 0 This Week

Last Update: 2018-09-05

See Project

Eye

Eye is an experimental OCR (image-to-text) application.

2 Reviews

Downloads: 0 This Week

Last Update: 2014-09-27

See Project

TCR Neuroph -Text Character Recognition

TCR Neuroph - Text Character Recognition is java tool developed to recognize scanned text , using Java Neural Network Framework - Neuroph

Downloads: 0 This Week

Last Update: 2015-09-01

See Project

JOcrad

JOcrad is a graphical frontend for GNU/Ocrad written in Java. GNU Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method.JOcrad supports italian and english languages, JPG,PNG and GIF images.

Downloads: 0 This Week

Last Update: 2014-05-10

See Project

Search Results for "java ocr extraction text"

Showing 14 open source projects for "java ocr extraction text"

Umi-OCR

GLM-OCR

Scribe.js

VietOCR

chessPDFBrowser

DocWire SDK

OpenKM Document Management - DMS

MyBox

Manga Rikai OCR

cbrTekStraktor

OCR Web based

Eye

TCR Neuroph -Text Character Recognition

JOcrad

Search Results for "java ocr extraction text"

Showing 14 open source projects for "java ocr extraction text"

Umi-OCR

GLM-OCR

Scribe.js

VietOCR

chessPDFBrowser

DocWire SDK

OpenKM Document Management - DMS

MyBox

Manga Rikai OCR

cbrTekStraktor

OCR Web based

Eye

TCR Neuroph -Text Character Recognition

JOcrad

Related Searches

Related Categories