Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
OCR Software
Search Results

Search Results for "ocr application python"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 46
Linux 44
Mac 29
More...
BSD 11
ChromeOS 7
Mobile Operating Systems 2
Server Operating Systems 1

Category

Artificial Intelligence 60
Multimedia 12
Business 11
Software Development 5
System 5
Communications 4
Formats and Protocols 4
Scientific/Engineering 4
Security 3
Desktop Environment 2
Text Editors 2
Education 1
Games 1
Internet 1
Mobile 1

License

OSI-Approved Open Source 44
Other License 1
Public Domain 1

Translations

Programming Language

Python 29
Java 7
JavaScript 5
C++ 4
More...
C 2
Delphi/Kylix 2
Perl 2
PHP 2
Visual Basic .NET 2
Ada 1
ASP.NET 1
C# 1
Julia 1
Pascal 1
R 1
Visual Basic 1

Status

Production/Stable 12
Beta 10
Pre-Alpha 4
Mature 3
More...
Planning 1
Alpha 1

Showing 60 open source projects for "ocr application python"

View related business solutions

OCR Clear Filters & Widen Search

Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)

Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.

Learn More
Free and Open Source HR Software
OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.

Learn More
1

Umi-OCR

OCR software, free and offline

Umi-OCR is a free and open-source optical character recognition (OCR) tool designed to provide fast, offline text extraction from images, screenshots, PDFs, and more without requiring a network connection. It includes a highly efficient offline OCR engine with built-in multilingual recognition libraries, so users can extract text across multiple languages with high accuracy directly on their machines. The software supports flexible usage patterns including screenshot capture OCR, batch...

Downloads: 81 This Week

Last Update: 2026-01-15
See Project
2

DeepSeek-OCR

Contexts Optical Compression

DeepSeek-OCR is an open-source optical character recognition solution built as part of the broader DeepSeek AI vision-language ecosystem. It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body text, interpreting tables, or recognizing handwritten versus printed words. ...

Downloads: 11 This Week

Last Update: 2025-10-25
See Project
3

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files

OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.

Downloads: 108 This Week

Last Update: 6 days ago
See Project
4

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle

PaddleOCR offers exceptional, multilingual, and practical Optical Character Recognition (OCR) tools that can help users train better models and apply them into practice. Inspired by PaddlePaddle, PaddleOCR is an ultra lightweight OCR system, with multilingual recognition, digit recognition, vertical text recognition, as well as long text recognition. It features a PPOCR series of high-quality pre-trained models, which includes: ultra lightweight ppocr_mobile series models, general...

Downloads: 55 This Week

Last Update: 2026-01-20
See Project
FDM4: Cloud-Based Enterprise Resource Planning Solutions
We offer a solution that ties together software, hardware, development, and design to provide you with the answer to your business needs.

FDM4 is the best solution for ERP software tailored for the apparel industry. The best way to know a company, take a look inside. Our corporate video takes you through the FDM4 International office, introducing our team and our values as a corporation. We offer a solution that ties together software, hardware, development, and design to provide you with the answer to your business needs. Learn more about FDM4 and our dedication to growing your business. The most difficult part of choosing a software solution? Finding a solution that both meets your needs, and complies with the industry’s standards. At FDM4 we’ve thought of that, which is why we have built our software to be multi-purposed and multi-functioning. To meet your needs, whether apparel, hard goods, or consumer goods, no matter what those needs may be. Nothing tells you more about a company than the clients. Which is why we invite you to explore the clients that have grown with FDM4.

Learn More
5

EasyOCR

Ready-to-use OCR with 80+ supported languages

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. EasyOCR is a python module for extracting text from image. It is a general OCR that can read both natural scene text and dense text in document. We are currently supporting 80+ languages and expanding. Second-generation models: multiple times smaller size, multiple times faster inference, additional characters and comparable accuracy to the first generation models. ...

Downloads: 31 This Week

Last Update: 2024-09-24
See Project
6

HunyuanOCR

OCR expert VLM powered by Hunyuan's native multimodal architecture

HunyuanOCR is an open-source, end-to-end OCR (optical character recognition) Vision-Language Model (VLM) developed by Tencent‑Hunyuan. It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a...

Downloads: 6 This Week

Last Update: 2026-01-13
See Project
7

DocTR

Library for OCR-related tasks powered by Deep Learning

DocTR provides an easy and powerful way to extract valuable information from your documents. Seemlessly process documents for Natural Language Understanding tasks: we provide OCR predictors to parse textual information (localize and identify each word) from your documents. Robust 2-stage (detection + recognition) OCR predictors with pretrained parameters. User-friendly, 3 lines of code to load a document and extract text with a predictor. State-of-the-art performances on public document...

Downloads: 1 This Week

Last Update: 2025-07-09
See Project
8

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu...

1 Review

Downloads: 65 This Week

Last Update: 2025-05-13
See Project
9

Paper2GUI

Convert AI papers to GUI

Convert AI papers to GUI，Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术 Paper2GUI: An AI desktop APP toolbox for ordinary people. It can be used immediately without installation. It already supports 40+ AI models, covering AI painting, speech synthesis, video frame complementing, video super-resolution, object detection, and image stylization. , OCR recognition and other fields. Support Windows, Mac, Linux systems. Paper2GUI:...

Downloads: 5 This Week

Last Update: 2024-09-20
See Project
User-friendly DAM and PIM in one marketing platform | Kontainer
Improve ROI, save time and professionalize with Kontainer Digital Asset Management that enables brands to organize, tag, convert and distribute photos

Improve ROI, save time and professionalize with Kontainer Digital Asset Management that enables brands to organize, tag, convert and distribute photos, videos, graphics, PDFs and templates that are easy to search and ready to share and integrate.

Learn More
10

Papermerge

Open Source Document Management System for Digital Archives

Papermerge is an open source document management system (DMS) primarily designed for archiving and retrieving your digital documents. Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats. Instantly find relevant information using full text, tags and metadata-based search. Papermerge is free and...

Downloads: 22 This Week

Last Update: 2025-07-24
See Project
11

AnyTXT Searcher

A Powerful Desktop Full-Text Search Engine, Just Like Local Google.

AnyTXT Searcher is a powerful file full-text search engine, a desktop search application for fast document retrieval. Just like a local disk Google search engine, much faster than Windows Search, it is your ideal desktop file content full-text search engine. It has a powerful document parsing engine built in, which extracts the text of commonly used file formats without installing any other software, and combines the built-in high-speed indexing system to store the metadata of the...

13 Reviews

Downloads: 5,061 This Week

Last Update: 2025-06-19
See Project
12

chessPDFBrowser

Chess application whichs allows working with chess PDF books and PGNs.

Chess application which allows working with PDFs and PGNs. You can work with the chess games of the PDF and edit their tree of variants. Graphical environment. Standard PGN TAGs. PGN comments. Ocr like (Fen string detection from chess board position images). Connection to Uci chess engines (like stockfish). Position analysis, full game analysis.

1 Review

Downloads: 53 This Week

Last Update: 2025-12-26
See Project
13

NAPS2 - Not Another PDF Scanner

Scan documents to PDF and other file types, as simply as possible.

Visit NAPS2's home page at www.naps2.com. NAPS2 is a document scanning application with a focus on simplicity and ease of use. Scan your documents from WIA- and TWAIN-compatible scanners, organize the pages as you like, and save them as PDF, TIFF, JPEG, PNG, and other file formats. Available on Windows, Mac, and Linux. NAPS2 is currently available in over 40 different languages. Want to see NAPS2 in your preferred language? Help translate! See the wiki for more details.

149 Reviews

Downloads: 837 This Week

Last Update: 2026-01-10
See Project
14

bitfarm-Archiv Document Management - DMS

bitfarm-Archiv is a powerful Document Management (DMS), Enterprise Content Management (ECM) and Knowledge Management System (KMS) with Workflow Components. Help us! As we live in the internet age, the best thing, you can help, is to write a short statement about your scenario and your use of the DMS, along with your experiences and put it on your own website or in a blog or forum. It would help us best, if you can also add a hyperlink to our site http://www.bitfarm-archiv.com. By this...

10 Reviews

Downloads: 8 This Week

Last Update: 2025-11-25
See Project
15

OpenKYC - FaceOnLive Community Project

FaceOnLive Open KYC: Streamlining Identity Verification with AI

Immerse yourself in the groundbreaking realm of the FaceOnLive Open KYC Project, a trailblazing endeavor at the forefront of redefining identity verification paradigms. With a commitment to leveraging the latest advancements in biometric technology, our platform presents a comprehensive solution encompassing cutting-edge features such as face recognition, face liveness detection, and ID document recognition. By seamlessly integrating these powerful tools, we empower businesses across...

149 Reviews

Downloads: 0 This Week

Last Update: 2024-04-02
See Project
16

DocWire SDK

Award-winning modern data processing SDK in C++20

DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...

Downloads: 8 This Week

Last Update: 2026-01-20
See Project
17

Super PDF Editor (a Batch PDF Processor)

Create, Edit, Delete, Organize , Convert, Export, Secure & Sign PDF.

Super PDF Editor - Powerful, superfast, lightweight PDF processor. All-in-one PDF solution, PDF editing with 80+ tools and functions. The easy-to-use software is complete with editing tools for modifying PDF files your way. Most comprehensive, powerful, process-based and lightning-fast batch processor software. OCR PDF. PDF Imposition, Reverse Pages, Resize Page, Scale Page, Booklet, N-up Pages, Merge, Split by page, Extract Page, Rotate Page. Replace Page, Insert Page, Delete Page....

6 Reviews

Downloads: 46 This Week

Last Update: 1 day ago
See Project
18

Nougat

Implementation of Nougat Neural Optical Understanding

Nougat is a multi-modal generative modeling framework that bridges vision and text modalities with structured generation control (e.g. layout, scene composition) rather than treating images as flat contexts. It combines object-centric modules with transformer-based reasoning to propose, refine, and render scenes in a generative pipeline. The architecture allows you to specify or prompt a layout (which objects should be where) and then the model fills in appearance, context, lighting, and...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
19

Screen Translate

An OCR translator tool made by utilizing tesseract & python-opencv

STL is an easy to use and light OCR translator tool that can be use to translate your screen. Made with python by utilizing Tesseract and opencv-python. For full view of the project you can check the Github repository: https://github.com/Dadangdut33/Screen-Translate REQUIREMENTS - Tesseract : https://github.com/UB-Mannheim/tesseract/wiki. Needed for the ocr.

3 Reviews

Downloads: 47 This Week

Last Update: 2023-02-08
See Project
20

OpenKM Document Management - DMS

Document Management System and Content Management System

OpenKM is a electronic document management system and record management system EDRMS ( DMS, RMS, CMS ). It provides modern and flexible architecture that meet today's IT demands, based on open technology (Java, Tomcat, GWT, Lucene, Hibernate, Spring and jBPM), powerful and scalable multiplatform application. OpenKM is a Web 2.0 application that works with Internet Explorer, Firefox, Safari and Opera. Can be configured in major DMBS like Oracle, PostgreSQL and MySQL among...

32 Reviews

Downloads: 515 This Week

Last Update: 2022-11-25
See Project
21

CCTV Frame Timestamp Extractor

CCTV Footage Timestamp Search Tool

...framestitch.py- Attempts to construct video using frames extracted from unplayable video. OCR.py- Performs image preprocessing & OCR on the extracted frames.

Downloads: 1 This Week

Last Update: 2022-07-14
See Project
22

LayoutParser

A Unified Toolkit for Deep Learning Based Document Image Analysis

With the help of state-of-the-art deep learning models, Layout Parser enables extracting complicated document structures using only several lines of code. This method is also more robust and generalizable as no sophisticated rules are involved in this process. A complete instruction for installing the main Layout Parser library and auxiliary components. Learn how to load DL Layout models and use them for layout detection. The full list of layout models currently available in Layout Parser....

Downloads: 0 This Week

Last Update: 2022-08-04
See Project
23

Ozyr

Ozyr is a simple and easy to use OCR snipping tool

Ozyr is a simple and easy to use OCR snipping tool to get text from images so you can copy and edit it. Source Code: https://github.com/PETEROLO291/Ozyr Installer: 117MB Program: 524MB Version: 1.0

1 Review

Downloads: 0 This Week

Last Update: 2022-04-13
See Project
24

e-Dokyumento

e-Dokyumento is web-based Document Management System (DMS)

e-Dokyumento is opensource web-based Document Management System (DMS) A Document Management which automates the basic office document workflow such as receiving, filing, routing, and approving through capturing (scanning), digitizing (OCR Reading), storing, tagging, and electronically routing and approving (e-signature) of electronic documents. # Demo : https://e-dokyumento.herokuapp.com/ https://edokyu.seillig.com/ (refer to Readme.md for the...

2 Reviews

Downloads: 12 This Week

Last Update: 2022-05-14
See Project
25

Paperless-ng

A supercharged version of paperless, scan, index and archive docs

Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have to...

Downloads: 0 This Week

Last Update: 2022-03-04
See Project

Previous
You're on page 1
2
3
Next

Related Searches

twain scanner

ocr

pdf editor

recoll

anytxt

openkm

anytxt searcher

screen translator

video subtitle extractor

chess

Related Categories

Artificial Intelligence

Multimedia

Business

Software Development

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: