pdf ocr windows free download

Showing 36 open source projects for "pdf ocr windows"

View related business solutions

OCR C++ Clear Filters & Widen Search

Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Tesseract OCR

Open Source OCR Engine

Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns.

5 Reviews

Downloads: 2,141 This Week

Last Update: 2025-12-26
See Project
2

PaddleOCR-json

OCR offline image text recognition command line windows program

PaddleOCR-json is an OCR engine based on the PaddleOCR project that provides a command-line interface and tools for extracting text from images and exporting results in structured JSON format. It wraps the PaddleOCR models, which are capable of detecting and recognizing text in a wide variety of languages and layouts, into a self-contained executable that can be run locally without needing a deep learning environment configured manually. This makes it practical for developers or system...

Downloads: 134 This Week

Last Update: 2026-01-15
See Project
3

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle

...PaddleOCR is easy to install and easy to use on Windows, Linux, MacOS and other systems.

Downloads: 63 This Week

Last Update: 2026-06-11
See Project
4

Hathi Download Helper

Download books from the hathitrust website in a fast and easy manner

2025-05-08 ====================== PLEASE NOTE ======================= Due to changes to the API of the hathirtust homepage, the HDH is no longer functional!! Please check the project Wiki for alternative methods. https://sourceforge.net/p/hathidownloadhelper/alternative/ ---------------------------------------------------------------------------------------------- Hathi Download Helper was a tool for downloading public domain books from hathitrust.org. E-Mail contact:...

8 Reviews

Downloads: 53 This Week

Last Update: 2026-03-13
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
5

DeepDetect

Deep Learning API and Server in C++14 support for Caffe, PyTorch

The core idea is to remove the error sources and difficulties of Deep Learning applications by providing a safe haven of commoditized practices, all available as a single core. While the Open Source Deep Learning Server is the core element, with REST API, and multi-platform support that allows training & inference everywhere, the Deep Learning Platform allows higher level management for training neural network models and using them as if they were simple code snippets. Ready for applications...

Downloads: 2 This Week

Last Update: 2026-03-27
See Project
6

DocWire SDK

Award-winning modern data processing SDK in C++20

DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...

Downloads: 1 This Week

Last Update: 2026-06-06
See Project
7

OCR Manga Reader for Android

Android Manga reader with Japanese OCR and dictionary capabilities

OCR Manga Reader is a free and open source Android app that allows you to quickly OCR and lookup Japanese words in real-time. It does not have ads or telemetry/spyware and does not require an Internet connection. Supports both EDICT and EPWING dictionaries. Requires Android 4.0 (Ice Cream Sandwich) or higher. See http://ocrmangareaderforandroid.sourceforge.net/ for details.

3 Reviews

Downloads: 30 This Week

Last Update: 2023-10-07
See Project
8

VideoSubFinder

...It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions. 2) Generation of cleared from background text images, which allows with usage of OCR programs (like FineReader, Subtitle Edit, Google Drive) to generate complete subtitles with original text and timing. For working of this program on Windows will be required "Microsoft Visual C++ Redistributable runtime libraries 2022": https://support.microsoft.com/en-us/help/2977003/the-latest-supported-visual-c-downloads Latest versions were built and tested on: Windows 10 x64, Ubuntu 20.04.5 LTS, openSUSE Leap 15.4, Arch Linux (EndeavourOS Cassini Nova 03-2023) For faster support in case of bug fixes please contact me in: https://vk.com/skosnits For donate: https://sourceforge.net/projects/videosubfinder/donate

18 Reviews

Downloads: 508 This Week

Last Update: 2023-05-01
See Project
9

Capture2Text

Quickly OCR part of the screen and save resulting text to clipboard

Capture2Text enables users to quickly OCR a portion of the screen using a keyboard shortcut. The resulting text will be saved to the clipboard by default. Supports 90+ languages including Chinese, English, French, German, Japanese, Korean, Russian, and Spanish. Portable and does not require installation. See http://capture2text.sourceforge.net for details.

89 Reviews

Downloads: 2,085 This Week

Last Update: 2022-03-19
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
10

gImageReader

A graphical frontend to tesseract-ocr

gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**:...

27 Reviews

Downloads: 158 This Week

Last Update: 2022-01-28
See Project
11

TNN

Uniform deep learning inference framework for mobile

TNN, a high-performance, lightweight neural network inference framework open sourced by Tencent Youtu Lab. It also has many outstanding advantages such as cross-platform, high performance, model compression, and code tailoring. The TNN framework further strengthens the support and performance optimization of mobile devices on the basis of the original Rapidnet and ncnn frameworks. At the same time, it refers to the high performance and good scalability characteristics of the industry's...

Downloads: 0 This Week

Last Update: 2022-08-03
See Project
12

cuneiformplus

Fork of OCR software cuneiform

Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD

Downloads: 0 This Week

Last Update: 2020-12-08
See Project
13

nunn

This is an implementation of a machine learning library in C++17

nunn is a collection of ML algorithms and related examples written in modern C++17.

Downloads: 1 This Week

Last Update: 2019-10-25
See Project
14

openalpr

Automatic license plate recognition library

Deploy license plate and vehicle recognition with Rekor’s OpenALPR suite of solutions designed to provide invaluable vehicle intelligence which enhances business capabilities, automates tasks, and increases overall community safety! Rekor’s OpenALPR suite of solutions utilizes artificial intelligence and machine learning to greatly surpass legacy OCR solutions. Now, in real-time, users can receive a vehicle's plate number, make, model, color, and direction of travel. Rekor’s OpenALPR suite...

Downloads: 10 This Week

Last Update: 2021-06-08
See Project
15

Terese OCR verifier

Terese is a tool for proofreading OCR text. Terese tries to map the text back to the scanned image, and visually shows the differences. See the homepage for further details.

Downloads: 0 This Week

Last Update: 2015-05-20
See Project
16

yagf

YAGF is a tesseract and cuneiform wrapper and helper*

YAGF is a graphical front-end for cuneiform and tesseract OCR tools. With YAGF you can open already scanned image files or obtain new images via XSane (scanning results are automatically passed to YAGF). Once you have a scanned image you can prepare it for recognition, select particular image areas for recognition, set the recognition language and so on. Recognized text is displayed in a editor window where it can be corrected, saved to disk or copied to clipboard. YAGF also provides some...

2 Reviews

Downloads: 5 This Week

Last Update: 2016-11-25
See Project
17

tesseract-ocr alternative download

Alternative download for tesseract-ocr project

Alternative download for tesseract-ocr project

Downloads: 1,254 This Week

Last Update: 2014-05-15
See Project
18

CD+Graphics Magic

Timeline based editor for creating Compact Disc Subcode Graphics (also known as CD+G or CDG). Both karaoke and multimedia styles of content are supported. Please visit cdgmagic.sf.net for examples playable directly in the HTML5 CD+G player. CD+Graphics Scribe utility (separate download -- click "Browse All Files" above) can now convert existing CDG karaoke content to CMP (CD+Graphics Magic Project), LRC (Enhanced Lyrics), and ASS (Advanced SubStation Alpha) format.

4 Reviews

Downloads: 14 This Week

Last Update: 2013-07-25
See Project
19

CuneiDjVu

DjVu OCR based on CuneiForm

CuneiDjVu is a graphical frontend to a set of the Windows console utilities providing the DjVu OCR capability based on the CuneiForm-Linux OCR Engine

2 Reviews

Downloads: 1 This Week

Last Update: 2013-05-30
See Project
20

VedVarsha - Rain Of Knowledge

Vedvarsha is an application for 2 purposes: 1. Handwariting script recognition that extracts recognized letters into documents. 2. OCR (Optical Character Recogniton) that works only for non-cursive and isolated characters. It depends upon libsyntactic,

Downloads: 0 This Week

Last Update: 2014-06-09
See Project
21

zTranslator

zTranslator is a free and open-source software which translates text between more than 50 different languages (powered by Bing Translator, Google Translator…). It also supports OCR, which let you easy to capture any text on the screen to translate.

Downloads: 0 This Week

Last Update: 2013-04-11
See Project
22

Unified Character Recognition

UCR is a project name for the development of an handwritten characters in Korean language. The goal is to create a UCR Library for handwriting as well as OCR from off-line, on-line data. And we have a plan to build a UCR library for mobile.

Downloads: 0 This Week

Last Update: 2014-07-02
See Project
23

ocrlib

OCR c++ library of computer optical recognition methods. In library: contour recognition; contour vectorisation; matrix letters feature recognition; web based GUI; assembler core on SS3 instruction; xml support; detect page rotation and segmentation;

Downloads: 0 This Week

Last Update: 2016-07-23
See Project
24

YagpoOCRUnicode c++library

OCR c++ library. Include: contour recognition; vectorisation; matrix letter feature recognition; auto page segmentation and detect rotation; SS3 ASM core; XML base; web-based GUI; 99,6% printed Unicode text recognition; letter base up to 1200 letters.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
25

valen

Processing monocrome images to analyse the morphology. Input: image. Output: skeleton of the picture or class of the recognized object. May function as OCR.

Downloads: 0 This Week

Last Update: 2013-03-21
See Project