ocr free download - SourceForge

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files

OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.

Downloads: 112 This Week

Last Update: 2026-04-06

See Project

OCRBase

MD/.JSON Document OCR and structured data extraction API

OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput.

Downloads: 0 This Week

Last Update: 1 day ago

See Project

Unredact

A simple tool for reading in poorly redacted documents

Unredact is a specialized tool that attempts to reconstruct redacted or obscured text in images, PDFs, or screenshots using a combination of image processing and generative AI inference to suggest plausible completions of blurred, black-boxed, or jumbled content. Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and linguistic patterns to produce candidate reconstructions. It accepts a variety of input formats, automatically identifies redacted regions, and then generates text suggestions that are presented alongside visual overlays so users can choose or refine outputs.

Downloads: 11 This Week

Last Update: 2026-02-03

See Project

OpenDataLoader PDF

PDF Parser for AI-ready data. Automate PDF accessibility

...The tool combines deterministic parsing methods with an optional hybrid AI-powered mode that improves extraction quality for difficult layouts such as multi-column documents, scanned files, and scientific papers. It includes built-in OCR capabilities supporting dozens of languages, making it suitable for digitizing low-quality or image-based PDFs. A key differentiator is its emphasis on accessibility automation, as it can generate tagged PDFs aligned with accessibility standards, significantly reducing manual remediation effort.

Downloads: 9 This Week

Last Update: 2026-04-03

See Project

NAPS2 - Not Another PDF Scanner

Scan documents to PDF and other file types, as simply as possible.

Visit NAPS2's home page at www.naps2.com. NAPS2 is a document scanning application with a focus on simplicity and ease of use. Scan your documents from WIA- and TWAIN-compatible scanners, organize the pages as you like, and save them as PDF, TIFF, JPEG, PNG, and other file formats. Available on Windows, Mac, and Linux. NAPS2 is currently available in over 40 different languages. Want to see NAPS2 in your preferred language? Help translate! See the wiki for more details.

149 Reviews

Downloads: 753 This Week

Last Update: 2026-01-10

See Project

realwatermark

A Python application to add watermarks (text or image) to PDF files

A Python application to add watermarks (text or image) to PDF files, converts them into image and back to PDF with options for OCR and compression.

Downloads: 1 This Week

Last Update: 2025-01-27

See Project

Super PDF Editor (a Batch PDF Processor)

Create, Edit, Delete, Organize , Convert, Export, Secure & Sign PDF.

...The easy-to-use software is complete with editing tools for modifying PDF files your way. Most comprehensive, powerful, process-based and lightning-fast batch processor software. OCR PDF. PDF Imposition, Reverse Pages, Resize Page, Scale Page, Booklet, N-up Pages, Merge, Split by page, Extract Page, Rotate Page. Replace Page, Insert Page, Delete Page. Export To Word, Excel. Password Protection, Remove Password, Watermark/Background. Your Privacy, Our Priority Protect Your Data with Complete Confidence. ...

6 Reviews

Downloads: 33 This Week

Last Update: 2026-03-08

See Project

Super-PDF-Editor

World's most comprehensive, powerful, process-based PDF editor

World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. PDF editing with 60+ features rich tools and function like OCR pdf and images and produce output like searchable PDF, Text, Hocr, Box, Unlv. Also, improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry. Easy pdf imposition, booklet, n ups pages, and more. OCR performs in pdf files, scanned pdf files and any pdf files. ...

3 Reviews

Downloads: 12 This Week

Last Update: 2023-02-02

See Project

Super-PDF-Editor-Lite

World's most comprehensive, powerful, process-based PDF editor

...Create a processing log file. Extract Page, Split Page, Rotate Page, Merge Page, Duplicate page, Move Page, Printing, and Compress Page. Improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry. Easy pdf imposition, booklet, n ups pages, and more. OCR performs in pdf files, scanned pdf files and any pdf files. OCR performs in image files, and supports multiple image formats. Auto and manual image enhancement for better OCR accuracy and quality. ...

3 Reviews

Downloads: 1 This Week

Last Update: 2023-02-02

See Project

Merge PDF Files

It is a Windows library that merges standard PDFs into a final PDF

...You can send the input PDFs (by file name or by byte array) and you can have the final PDF (saved on a file or get back on a byte array). The library calls can be synchronous or asynchronous. We want to give you a benchmark, the library was used to create a PDF from single page(scanned) image by an OCR SDK (it is not included in our library, you can use any on the market): 20,000 Images (the OCR SDK creates single page PDF text searchable, running 50 threads) in 80 minutes. The size of the final PDF searchable was 800Mb. If you download the library, we provide a sample which cover all the scenarios possible (synchronous and asynchronous).

Downloads: 0 This Week

Last Update: 2020-02-12

See Project

PDF2EpubMaker

Convert PDF to epub by OCR

Qt Application to convert PDF in EPub format with several step : - convert PDF to png with libpoppler - convert pnf to txt by libtesseract - suppress hyphenate - spell checkinng

Downloads: 0 This Week

Last Update: 2019-12-25

See Project

iText®, a JAVA PDF library

PDF Library for Developers

...With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions of iText build on the success of previous versions and feature an improved document engine, high and low-level programming capabilities, and a more efficient modular structure. iText represents the next level for developers looking to leverage PDF in document workflows. The main project page for iText is now on GitHub, and all the latest releases, code samples, open source add-ons and tools, etc. can be found at https://github.com/itext/.

Downloads: 183 This Week

Last Update: 2024-06-01

See Project

Search Results for "ocr"

Showing 12 open source projects for "ocr"

OCRmyPDF

OCRBase

Unredact

OpenDataLoader PDF

NAPS2 - Not Another PDF Scanner

realwatermark

Super PDF Editor (a Batch PDF Processor)

Super-PDF-Editor

Super-PDF-Editor-Lite

Merge PDF Files

PDF2EpubMaker

iText®, a JAVA PDF library

Search Results for "ocr"

Showing 12 open source projects for "ocr"

OCRmyPDF

OCRBase

Unredact

OpenDataLoader PDF

NAPS2 - Not Another PDF Scanner

realwatermark

Super PDF Editor (a Batch PDF Processor)

Super-PDF-Editor

Super-PDF-Editor-Lite

Merge PDF Files

PDF2EpubMaker

iText®, a JAVA PDF library

Related Searches

Related Categories