tesseract free download

Showing 35 open source projects for "tesseract"

View related business solutions

Windows Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

Tesseract OCR

Open Source OCR Engine

Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns.

5 Reviews

Downloads: 2,817 This Week

Last Update: 2025-12-26
See Project
2

Tesseract.js

A pure Javascript Multilingual OCR

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. Tesseract.js' library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS. Tesseract.js is a javascript library that gets words in almost any spoken language out of images.

Downloads: 22 This Week

Last Update: 2025-12-15
See Project
3

Paperless-ngx

A community-supported supercharged version of paperless

Paperless-ngx is a community-supported open-source document management system that transforms your physical documents into a searchable online archive so you can keep, well, less paper.

Downloads: 14 This Week

Last Update: 2026-04-27
See Project
4

JavaCV

Java interface to OpenCV, FFmpeg, and more

JavaCV uses wrappers from the JavaCPP Presets of commonly used libraries by researchers in the field of computer vision (OpenCV, FFmpeg, libdc1394, FlyCapture, Spinnaker, OpenKinect, librealsense, CL PS3 Eye Driver, videoInput, ARToolKitPlus, flandmark, Leptonica, and Tesseract) and provides utility classes to make their functionality easier to use on the Java platform, including Android. JavaCV also comes with hardware accelerated full-screen image display (CanvasFrame and GLCanvasFrame), easy-to-use methods to execute code in parallel on multiple cores (Parallel), user-friendly geometric and color calibration of cameras and projectors (GeometricCalibrator, ProCamGeometricCalibrator, ProCamColorCalibrator), detection and matching of feature points (ObjectFinder), a set of classes that implement direct image alignment of projector-camera systems (mainly GNImageAligner, ProjectiveTransformer, ProjectiveColorTransformer, ProCamTransformer, and ReflectanceInitializer), and more.

Downloads: 7 This Week

Last Update: 2026-02-22
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Extractous

Fast and efficient unstructured data extraction

...For broader format support, the system combines its Rust core with ahead-of-time compiled Apache Tika shared libraries, which allows it to extend parsing coverage while still avoiding traditional server-based overhead. It also supports OCR for images and scanned documents through Tesseract, making it useful for document ingestion pipelines that include image-based or scanned inputs.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
6

LLM-Aided OCR Project

Enhances Tesseract OCR output using LLMs (local or API)

LLM Aided OCR is an open-source system designed to improve optical character recognition accuracy by combining traditional OCR tools with large language models. The project addresses common OCR challenges such as distorted text, unusual fonts, historical documents, and complex layouts that often produce inaccurate results with standard OCR pipelines. The system first extracts raw text using OCR engines and then applies language models to analyze and correct recognition errors based on...

Downloads: 0 This Week

Last Update: 2026-03-22
See Project
7

VietOCR

Provides optical character recognition (OCR) solutions for Vietnamese language.

24 Reviews

Downloads: 139 This Week

Last Update: 2026-01-17
See Project
8

OculiX

Visual Automation IDE — automate anything you see on screen

OculiX is the evolution of SikuliX, actively maintained with the full agreement of its original creator RaiMan. Automate any desktop application using image recognition (OpenCV) and OCR (Tesseract + PaddleOCR). No access to source code or DOM required — if you can see it, you can automate it. Key features: - Guided step-by-step recorder with live code preview - Image recognition via OpenCV 4.10 - Dual OCR: Tesseract (built-in) + PaddleOCR (neural, high precision) - Local and remote automation via integrated VNC - SSH tunnels via embedded JSch - Cross-platform: Windows, macOS (Apple Silicon M1-M4), Linux - Scripting: Jython, JRuby, Java, PowerShell, AppleScript - Java 17 recommended (Java 8+ supported) - Full CI/CD with automated builds for all platforms Used worldwide for test automation, RPA, and visual regression testing. ...

Downloads: 39 This Week

Last Update: 5 days ago
See Project
9

Screen Translate

An OCR translator tool made by utilizing tesseract & python-opencv

STL is an easy to use and light OCR translator tool that can be use to translate your screen. Made with python by utilizing Tesseract and opencv-python. For full view of the project you can check the Github repository: https://github.com/Dadangdut33/Screen-Translate REQUIREMENTS - Tesseract : https://github.com/UB-Mannheim/tesseract/wiki. Needed for the ocr. Install it with all the language pack. - Libretranslate (Optional for offline translation support) - Internet connection for translation if not using libretranslate # Tutorial on How To Setup https://github.com/Dadangdut33/Screen-Translate#installation-and-setup

3 Reviews

Downloads: 27 This Week

Last Update: 2023-02-08
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Convert-Screenshot-To-Text

...If you only need to recognize English, please only select English." -No installation required. It's ready to use as soon as you open it.- I have made a major upgrade to CSTT this time, including support for all Tesseract-supported languages, improved OCR accuracy, added multiple recognition modes, added keyboard shortcuts for canvas movement and zooming, and enabled users to adjust OCR settings. If you like it, please support me. Author: A_A Email: A_A_kent_leung@hotmail.com Donation: (Buy Me a Coffee) https://www.buymeacoffee.com/AAkent (PATREON) patreon.com/A_A_KENT (PAYPAL) https://www.paypal.com/paypalme/AAKENT

1 Review

Downloads: 4 This Week

Last Update: 2023-04-05
See Project
11

gImageReader

A graphical frontend to tesseract-ocr

gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. ...

27 Reviews

Downloads: 124 This Week

Last Update: 2022-01-28
See Project
12

Screen Capture Image Text App Launcher

Run defined applications by detecting text in a captured screenshot

This application requires the "TESSERACT" OCR engine to decode text in a captured screenshot. The text file can be analysed to look for specific trigger words which will run a defined application. TESSERACT OCR is available for Windows users here: https://digi.bib.uni-mannheim.de/tesseract/ Information on its use is generally available, this medium post provides an overview: https://medium.com/quantrium-tech/installing-and-using-tesseract-4-on-windows-10-4f7930313f82 During the testing I used version 5 of the software. ...

Downloads: 0 This Week

Last Update: 2021-04-05
See Project
13

cuneiformplus

Fork of OCR software cuneiform

Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD

Downloads: 3 This Week

Last Update: 2020-12-08
See Project
14

OCR Image Simply

Simple Windows application to OCR images

Probably the simplest Windows application to OCR images with use of Tesseract 3.05.02. Languages recognized: German, English, French, Italian, Polish, Spanish Just download ZIP file Then unzip archive And feel free to use everywhere - Solution published under MIT license Description can be found at: https://coolautomations.com/ocr-as-simple-as-it-can-be/

Downloads: 5 This Week

Last Update: 2020-11-03
See Project
15

BL3-MayhemMod

Automatically Re-roll Mayhem Modifiers in Borderlands3

An Autoit Script for re-rolling the Mayhem 10 modifiers in Borderlands 3 automatically. Uses Tesseract-OCR for text recognition.

Downloads: 0 This Week

Last Update: 2020-10-25
See Project
16

SwiftOCR

Fast and simple OCR library written in Swift

...As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. Tesseract is written in C++ and over 30 years old. To use it you first have to write a Objective-C++ wrapper for it. The main issue that's slowing down Tesseract is the way memory is managed. ...

Downloads: 0 This Week

Last Update: 2023-05-29
See Project
17

neocr

Provides OCR solutions for Nepali, based on Tesseract 4.0.

NeOCR is a free software based on Tesseract (Open Source OCR Engine) for the Windows operating system. It provides an easy and user-friendly user interface to recognize texts contained in images as well as PDF documents and convert to editable text formats (.txt, .doc, .docx). This product is accessible to Blind and Visually Impaired peoples (tested with NVDA and Narrator).

3 Reviews

Downloads: 6 This Week

Last Update: 2020-04-17
See Project
18

Tess4J

A Java JNA wrapper for Tesseract OCR API

9 Reviews

Downloads: 79 This Week

Last Update: 2018-05-26
See Project
19

JATI - Just Another Tesseract Interface

Another interface for tesseract OCR to convert image to text.

Tesseract OCR is an open source, highly accurate image to text converter. Nevertheless, Tesseract OCR provides only command line interface. JATI is just another interface to the Tesseract OCR engine, providing GUI interface to convert an image to text. It can do batch conversion, including converting only portion of the image into text.

3 Reviews

Downloads: 4 This Week

Last Update: 2018-08-31
See Project
20

cbrTekStraktor

an application to automatically extract text from comic books.

...It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text. A subsequent version of the application will integrate with translation software in order to provide automated translation of comic book texts and re-inserion of translated texts

Downloads: 2 This Week

Last Update: 2017-06-14
See Project
21

OCR-Using-Tesseract-Java-API

This paper represent a development and deployment and/or Implementation of Optical Character Recognition (OCR) to translate images of typewritten or handwritten characters into electronically editable format by preserving font properties. OCR can do this by applying pattern matching algorithm. The Recognized characters are stored in editable format. Thus OCR make the computer read the printed documents discarding noise. Keywords- Optical Character Recognition, Image convert to character,...

Downloads: 0 This Week

Last Update: 2016-06-02
See Project
22

OCR For Visually Challenged Person

Provides GUI for Tessaract OCR

It converts scanned image into text, braille and audio format. The image should be scanned with atleast 300 dpi for better accuracy.

Downloads: 1 This Week

Last Update: 2015-05-24
See Project
23

Toxin OCR

Android ocr app using tessaract engine

Toxin Finder is an android app which uses google's tesseract ocr engine in order to capture an image of a product's ingredient list and return a list of harmful ingredients.

Downloads: 0 This Week

Last Update: 2015-02-26
See Project
24

Sanskrit / Hindi - Tesseract OCR

Devanagari fonts traineddata for Tesseract OCR

Read https://sourceforge.net/projects/tesseracthindi/files/OCRHindi_using_VietOCR_and_Tesseract.pdf/download for how to use vietocr gui for OCR of Hindi and Sanskrit texts using tesseract-ocr ***** Please see https://github.com/Shreeshrii/ imagessan and imageshin for newer box/tiff pairs, traineddata files, ocr evaluation statistics and ground truth files with images for Sanskrit and Hindi. ***** Following is OLD information - saved only for archival purposes. Tesseract OCR 3.02 provides hin.traineddata for recognizing texts in devanagari scripts. ...

2 Reviews

Downloads: 4 This Week

Last Update: 2017-02-17
See Project
25

lector

An interface to tesseract ocr

An interface to tesseract ocr

1 Review

Downloads: 0 This Week

Last Update: 2016-05-16
See Project