java ocr extraction text free download

Umi-OCR

OCR software, free and offline

Umi-OCR is a free and open-source optical character recognition (OCR) tool designed to provide fast, offline text extraction from images, screenshots, PDFs, and more without requiring a network connection. It includes a highly efficient offline OCR engine with built-in multilingual recognition libraries, so users can extract text across multiple languages with high accuracy directly on their machines.

Downloads: 43 This Week

Last Update: 2026-01-15

See Project

GLM-OCR

Accurate × Fast × Comprehensive

GLM-OCR is an open-source multimodal optical character recognition (OCR) model built on a GLM-V encoder–decoder foundation that brings robust, accurate document understanding to complex real-world layouts and modalities. Designed to handle text recognition, table parsing, formula extraction, and general information retrieval from documents containing mixed content, GLM-OCR excels across major benchmarks while remaining highly efficient with a relatively compact parameter size (~0.9B), enabling deployment in high-concurrency services and edge environments. ...

Downloads: 4 This Week

Last Update: 2026-04-08

See Project

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

...Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu and Ali to complete text recognition locally. Support GPU acceleration, after GPU acceleration, you can get higher accuracy and faster extraction speed. (CLI version) No need for users to manually set the subtitle area, the project automatically detects the subtitle area through the text detection model.

1 Review

Downloads: 41 This Week

Last Update: 2026-04-05

See Project

HunyuanOCR

OCR expert VLM powered by Hunyuan's native multimodal architecture

HunyuanOCR is an open-source, end-to-end OCR (optical character recognition) Vision-Language Model (VLM) developed by Tencent‑Hunyuan. It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a...

Downloads: 1 This Week

Last Update: 2026-05-11

See Project

OCR Reader

The tool supports template-based parsing, allowing structured output i

OCR Reader is a lightweight Windows utility designed to extract text from PDF files and images using OCR (Tesseract engine). The tool supports template-based parsing, allowing structured output into CSV or TXT without manual coding. Core components Tesseract OCR engine Poppler (PDF rendering) Template-based extraction system Homepage: https://martan1484.github.io/OCR_Reader

Downloads: 0 This Week

Last Update: 2026-04-17

See Project

Search Results for "java ocr extraction text"

Showing 5 open source projects for "java ocr extraction text"

Umi-OCR

GLM-OCR

Video-subtitle-extractor

HunyuanOCR

OCR Reader

Search Results for "java ocr extraction text"

Showing 5 open source projects for "java ocr extraction text"

Umi-OCR

GLM-OCR

Video-subtitle-extractor

HunyuanOCR

OCR Reader

Related Searches

Related Categories