Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "optical character recognition"

x

Sort By:

Relevance

OS

Linux 78
Windows 70
Mac 55
More...
BSD 35
ChromeOS 18
Desktop Operating Systems 4
Mobile Operating Systems 4

Category

Artificial Intelligence 52
Multimedia 24
Scientific/Engineering 12
Software Development 12
Education 8
System 6
Text Editors 6
Business 4
Formats and Protocols 3
Internet 3
Security 3
Communications 1
Social sciences 1

License

OSI-Approved Open Source 71
Creative Commons Attribution License 4
Public Domain 2
GNU Free Documentation License 1

Translations

English 38
Russian 5
Chinese (Simplified) 4
French 4
More...
Italian 3
Japanese 3
Polish 3
Chinese (Traditional) 2
Czech 2
German 2
Slovak 2
Spanish 2
Bulgarian 1
Catalan 1
Dutch 1
Hindi 1
Lithuanian 1
Persian 1
Portuguese 1
Romanian 1
Slovene 1
Tamil 1
Turkish 1
Ukrainian 1
Vietnamese 1

Programming Language

Java 21
C++ 19
C 18
Python 15
More...
PHP 9
C# 6
JavaScript 6
OCaml (Objective Caml) 2
Ruby 2
Ada 1
Assembly 1
Delphi/Kylix 1
MATLAB 1
Perl 1
Prolog 1
S/R 1
Scala 1
Swift 1
Tcl 1
Visual Basic 1
Visual Basic .NET 1

Status

Alpha 18
Production/Stable 18
Beta 14
Pre-Alpha 11
More...
Planning 5
Mature 4
Inactive 1

Showing 94 open source projects for "optical character recognition"

View related business solutions

Auth for GenAI | Auth0
Enable AI agents to securely access tools, workflows, and data with fine-grained control and just a few lines of code.

Easily implement secure login experiences for AI Agents - from interactive chatbots to background workers with Auth0. Auth for GenAI is now available in Developer Preview

Try free now
Simplify IT and security with a single endpoint management platform
Automate the hardest parts of IT

NinjaOne automates the hardest parts of IT, delivering visibility, security, and control over all endpoints for more than 20,000 customers. The NinjaOne automated endpoint management platform is proven to increase productivity, reduce security risk, and lower costs for IT teams and managed service providers. The company seamlessly integrates with a wide range of IT and security technologies. NinjaOne is obsessed with customer success and provides free and unlimited onboarding, training, and support.

Learn More
1

Tesseract OCR

Open Source OCR Engine

Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns. Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports...

Downloads: 2,010 This Week

Last Update: 2025-05-25
See Project
2

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle

PaddleOCR offers exceptional, multilingual, and practical Optical Character Recognition (OCR) tools that can help users train better models and apply them into practice. Inspired by PaddlePaddle, PaddleOCR is an ultra lightweight OCR system, with multilingual recognition, digit recognition, vertical text recognition, as well as long text recognition. It features a PPOCR series of high-quality pre-trained models, which includes: ultra lightweight ppocr_mobile series models, general ppocr_server...

Downloads: 22 This Week

Last Update: 2025-06-05
See Project
3

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files

OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.

Downloads: 70 This Week

Last Update: 2025-05-27
See Project
4

Self-Operating Computer

A framework to enable multimodal models to operate a computer

.... The framework supports features like Optical Character Recognition (OCR) and Set-of-Mark (SoM) prompting to enhance visual grounding capabilities. It is designed to be compatible with macOS, Windows, and Linux (with X server installed), and is released under the MIT license.

1 Review

Downloads: 9 This Week

Last Update: 2025-02-28
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

Concordia

Crowdsourcing platform for full text transcription and tagging

Concordia is a platform for crowdsourcing transcription and tagging of text in digitized images. It was developed by the Library of Congress so that volunteers of all backgrounds could transcribe and tag digitized images of manuscripts and typed materials from the Library’s collections that could not otherwise be done by optical character recognition.

Downloads: 3 This Week

Last Update: 2025-05-09
See Project
6

Docspell

Assist in organizing your piles of documents

Docspell is a personal document organizer. Or sometimes called a "Document Management System" (DMS). You'll need a scanner to convert your papers into files. Docspell can then assist in organizing the resulting mess. It can unify your files from scanners, emails, and other sources. It is targeted for home use, i.e. families, households, and also for smaller groups/companies. You can associate tags, set correspondent,s and lots of other predefined and custom metadata. If your documents are...

Downloads: 11 This Week

Last Update: 2025-03-15
See Project
7

Tesseract.js

A pure Javascript Multilingual OCR

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images. The main Tesseract.js functions (ex. recognize, detect) take an image...

Downloads: 15 This Week

Last Update: 2025-04-07
See Project
8

Best-of Machine Learning with Python

A ranked list of awesome machine learning Python libraries

This curated list contains 900 awesome open-source projects with a total of 3.3M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome! General-purpose machine learning and deep learning...

Downloads: 4 This Week

Last Update: 6 days ago
See Project
9

Qualisys Unity SDK

Unity package for the C# (.NET) implementation for Qualisys Track

... bodies or any other Unity object. Qualisys provides a robust skeleton solver that lets you solve one or more actors in real-time. Capturing crouching, wrestling and lying on the floor has never been this straightforward. By combining skeleton solving with AIM, you can capture advanced setups in a simplified workflow. An FBX is the easiest way to read mocap data in external gaming or animation software. Our FBX files contain characters, skeletons, optical markers and actors.

Downloads: 0 This Week

Last Update: 2023-01-18
See Project
Gen AI apps are built with MongoDB Atlas
Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.

Start Free
10

VietOCR

Provides optical character recognition (OCR) solutions for Vietnamese language.

24 Reviews

Downloads: 503 This Week

Last Update: 2025-06-08
See Project
11

tgcf

The ultimate tool to automate custom telegram message forwarding

The ultimate tool to automate custom telegram message forwarding. Live-syncer, Auto-poster, backup-bot, cloner, chat-forwarder, duplicator, ... Call it whatever you like! tgcf is an advanced telegram chat forwarding automation tool that can fulfill all your custom needs.

Downloads: 2 This Week

Last Update: 2024-09-19
See Project
12

Image To Text tools

ITTT is a Free tool designed to Scan and extract Text from Images.

Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.

Downloads: 61 This Week

Last Update: 2024-02-21
See Project
13

scantailor-experimental

Scan Tailor Experimental is an interactive post-processing tool

Scan Tailor Experimental is an interactive post-processing tool for scanned pages. You give it raw scans, and you get pages ready to be printed or assembled into a PDF or DJVU file. Scanning, optical character recognition, and assembling multi-page documents are out of scope of this project.

Downloads: 30 This Week

Last Update: 2024-11-27
See Project
14

KoboldCpp

Run GGUF models easily with a UI or API. One File. Zero Install.

KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models, inspired by the original KoboldAI. It's a single self-contained distributable that builds off llama.cpp and adds many additional powerful features.

Downloads: 113 This Week

Last Update: 4 days ago
See Project
15

Img2Txt

Img2Txt - Extract Text From Images using AI

Important: If you are sharing this program. Please Include the official Download Link What is Img2Txt? Img2Txt is a Python-based application packaged using PyInstaller that utilizes the power of pytesseract, an AI-powered optical character recognition (OCR) library, to extract text from images and convert it into plain text. The application features a simple and modern user-friendly interface created using customtkinter, allowing users to easily process images and obtain the text within...

1 Review

Downloads: 8 This Week

Last Update: 2023-08-15
See Project
16

DocWire SDK

Award-winning modern data processing SDK in C++20

DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...

Downloads: 61 This Week

Last Update: 2025-05-23
See Project
17

realwatermark

A Python application to add watermarks (text or image) to PDF files

A Python application to add watermarks (text or image) to PDF files, converts them into image and back to PDF with options for OCR and compression.

Downloads: 1 This Week

Last Update: 2025-01-27
See Project
18

Windows Power Utilities

Windows Power Utilities adds automation tools to Windows

Utilities for Windows Power Users

Downloads: 5 This Week

Last Update: 2025-05-27
See Project
19

Dual Clip Translator

Translation of Selected text or Clipboard contents powered by Google. HotKeys Paste/Change Text auto translated. View in Balloon/Window the result of translation, besides being sent to the clipboard. Screen Capture of Desktop/Game > OCR > Translated.

5 Reviews

Downloads: 38 This Week

Last Update: 2023-05-26
See Project
20

queXF

Web based, Open Source alternative to Remark OMR or Teleform

queXF, a CADE (Computer Assisted Data Entry) Tool, processes filled paper forms that were created in queXML, such as survey questionnaires. queXF can be used as a web based, Open Source alternative to programs such as Cardiff Teleform and Remark OMR.

2 Reviews

Downloads: 7 This Week

Last Update: 2024-07-23
See Project
21

cuneimusicplus

Optical music recognition library

Optical music recognition library in C++/C

Downloads: 0 This Week

Last Update: 2023-02-07
See Project
22

tom_core

tom_core - a tool for automating events on a computer

tom_core is a software tool used for the automation of everything that happens on your computer. By using this application, you can easily record your activity on your computer, starting the recording at any moment that you choose. The application repeats all your clicks or drags, keystrokes, hotkeys, etc. All in exactly the timing and number of repetitions you need. The toolbox such as the optical recognition and voice control enables to branch out the recordings into complex forms...

Downloads: 0 This Week

Last Update: 2022-05-17
See Project
23

SwiftOCR

Fast and simple OCR library written in Swift

SwiftOCR is a fast and simple OCR library written in Swift. It uses a neural network for image recognition. As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. Tesseract...

Downloads: 4 This Week

Last Update: 2023-05-29
See Project
24

pdfsandwich

pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries...

8 Reviews

Downloads: 329 This Week

Last Update: 2018-08-12
See Project
25

Text Picker

Use your camera to identify and pick texts such as serial numbers.

The TextPicker uses your camera and optical character recognition to extract a text from what your camera sees. You must type a regex pattern (or choose one from the several pre-configured regex pattern). Only texts that match the pattern will be picked. This software is mainly used for recognizing serial numbers in currencies of the world. You can make other similar uses as well.

Downloads: 0 This Week

Last Update: 2018-06-10
See Project

Previous
You're on page 1
2
3
4
Next

Related Searches

tesseract-ocr

tesseract

ocr

tesseract-ocr-setup-3.02.02.exe

tesseract ocr

delphiocr

arabic ocr

pdf ocr

optical character recognition delphi

screen translator

Related Categories

Artificial Intelligence

Multimedia

Scientific/Engineering

Software Development

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Want the latest updates on software, tech news, and AI?

Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: