Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "java ocr extraction text" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Windows 58
Linux 52
Mac 46
More...
BSD 20
ChromeOS 18
Desktop Operating Systems 1
Mobile Operating Systems 1

Category

Artificial Intelligence 58
Scientific/Engineering 13
Multimedia 9
Software Development 9
Business 5
Text Editors 5
Education 3
System 3
Communications 1
Database 1
Desktop Environment 1
Games 1
Internet 1

License

OSI-Approved Open Source 47
Other License 3
Creative Commons Attribution License 1
Public Domain 1

Translations

Programming Language

Java 35
Python 12
C++ 5
JavaScript 4
More...
Perl 2
C# 1
COBOL 1
Go 1
Rust 1
Scala 1
TypeScript 1

Status

Production/Stable 14
Beta 10
Alpha 6
Mature 2
More...
Planning 1
Inactive 1

Showing 58 open source projects for "java ocr extraction text"

View related business solutions

Artificial Intelligence Windows Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
1

Common Resource Grep - crgrep

Common Resource Grep

CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...

3 Reviews

Downloads: 1 This Week

Last Update: 2023-04-23
See Project
2

aseryla

Aseryla code repositories

This project describes a model of how the semantic human memory represents the information relevant to the objects of the world in text format. It provides a system and a GUI application capable of extracting and managing concepts and relations from English texts. https://aseryla2.sourceforge.io/

Downloads: 0 This Week

Last Update: 2021-10-29
See Project
3

Manga Rikai OCR

Manga Rikai is the first consumer-ready multi-page manga OCR/translation engine. It is a spiritual successor to Capture2Text, Visual Novel Reader, and Textractor. At the moment, the engine can capture and translate single text box, detect all text boxes in a page or as many pages as you want. Not only that, you can edit the text, save your progress, and even export your work as an HTML file. Got problems? Join our discord: https://discord.com/invite/BuNuanw

1 Review

Downloads: 6 This Week

Last Update: 2021-02-23
See Project
4

TIES

A smart search engine for medical documents

TIES (Text Information Extraction System) is a clinical text search engine that uses Natural Language Processing techniques to extract medical concepts from free text clinical reports. It provides secure de-identified access to this information and has in built collaboration tools and honest broker functionality. It is licensed for academic use under the BSD license. For commercial use please contact Nexi at http://nexihub.com *** NOTICE: this software and forum are no longer...

1 Review

Downloads: 0 This Week

Last Update: 2019-09-09
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Convolutional Recurrent Neural Network

Convolutional Recurrent Neural Network (CRNN) for image-based sequence

Convolutional Recurrent Neural Network provides an implementation of the Convolutional Recurrent Neural Network (CRNN) architecture, a deep learning model designed for image-based sequence recognition tasks such as optical character recognition and scene text recognition. The architecture combines convolutional neural networks for extracting visual features from images with recurrent neural networks that model sequential dependencies in the extracted features. This hybrid approach allows the...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
6

cbrTekStraktor

an application to automatically extract text from comic books.

cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is...

Downloads: 5 This Week

Last Update: 2017-06-14
See Project
7

OCR Web based

OCR web based for Browser Firefox & PC

...Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction

2 Reviews

Downloads: 0 This Week

Last Update: 2018-09-05
See Project
8

Ansj Chinese word segmentation

Ansj word segmentation

The real java implementation of ict. The word segmentation effect is faster than the open source version of ict. Chinese word segmentation, name recognition, part-of-speech tagging, user-defined dictionary. This is a java implementation of Chinese word segmentation based on n-Gram+CRF+HMM. The word segmentation speed reaches about 2 million words per second (tested under mac air), and the accuracy rate can reach more than 96%. At present, it has realized the functions of Chinese word...

1 Review

Downloads: 0 This Week

Last Update: 2021-09-22
See Project
9

OCR For Visually Challenged Person

Provides GUI for Tessaract OCR

It converts scanned image into text, braille and audio format. The image should be scanned with atleast 300 dpi for better accuracy.

Downloads: 4 This Week

Last Update: 2015-05-24
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

DJVU++

The DjVu complete solution,with OCR Technology(Arabic ,English).

DjVu++ is a user-friendly program that used to manipulate DjVu file formats such as eBooks with a penalty of editing features. The program introduce a free replacement for the property PDF format with similar resolution and smaller file size DjVu++ also support OCR to handle text in scanned books and images. The program shows good performance for English. In addition to the Arabic language to lead free and commercial software in this area. The main features of DjVu++ program are: o...

4 Reviews

Downloads: 5 This Week

Last Update: 2015-08-24
See Project
11

Vision2u

free image processing software

Vision2u offers a free image processing software for personal use and research. Primary tasks of the image processing can be realized during simple operation of the software. Every Web cam owner can have simplest measuring, counting or tasks of monitoring done without high capital outlays.

Downloads: 0 This Week

Last Update: 2015-05-01
See Project
12

Eye

Eye is an experimental OCR (image-to-text) application.

2 Reviews

Downloads: 0 This Week

Last Update: 2014-09-27
See Project
13

FALCON - Text Search Java Project

JSON based text search Java Project

----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language...

Downloads: 0 This Week

Last Update: 2014-04-18
See Project
14

TML - Text Mining Library for LSA & CMM

TML is a Java Library for LSA and extracting Concept Maps from text

TML has moved to http://www.villalon.cl/tml.html and the code to https://github.com/villalon/tml

3 Reviews

Downloads: 0 This Week

Last Update: 2013-08-05
See Project
15

TextProcessor

A Java package to preprocess text datasets for posterior text analysis

The TextProcessor Java package is a text processing toolkit, which provides some frequently used text processing functions such as stemming, removing stop-words, generating a term vocabulary, and calculating the term-doc frequency matrix. Basic topic mining models such as LDA and sparse NMF are also supported. The package can also generate feature files from a given text dataset with LDA and LIBSVM format for posterior procedures such as classification or clustering. The toolkit is also...

Downloads: 0 This Week

Last Update: 2015-11-23
See Project
16

RapidMiner Information Extraction Plugin

The Information Extraction Plugin allows the use of information extraction techniques within RapidMiner. It can be seen as an interface between natural language and IE- or datamining-methods, by extracting interesting information out of documents.

Downloads: 0 This Week

Last Update: 2015-08-07
See Project
17

G-Asks

G-Asks is a question generation system, developed by LATTE(Learning and Affect Technologies Engineering) research group at The University of Sydney. It uses Natural Language Processing techniques and Machine learning algorithms to generate specific trigger questions. If you use this software in a publication, please cite the paper 2. 1.Ming Liu and Rafael A. Calvo (2012) “Using Information Extraction to Generate Trigger Question for Academic Writing Support”, 11th International Conference...

Downloads: 0 This Week

Last Update: 2013-04-29
See Project
18

DBpedia Spotlight

DBpedia Spotlight is a tool for annotating mentions of DBpedia resources in natural language text. The source code is now hosted on GitHub: https://github.com/dbpedia-spotlight

1 Review

Downloads: 0 This Week

Last Update: 2013-06-04
See Project
19

BioEvent

This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text.

Downloads: 0 This Week

Last Update: 2013-04-25
See Project
20

text-analysis

This project aims to implement in java the following text mining techniques: Text Language Detection, Keywords and keyphrases extraction, Text Classification, Text Clustering, Single or multiple documents Summarization, Plagiarism Detection.

Downloads: 0 This Week

Last Update: 2014-05-20
See Project
21

TextMarker

TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-29
See Project
22

SEMANTIXS

SEMANTIXS is a semantic information extraction system that can extract, represent and visualize domain-specific information from free-text in the form of complex (and simple) relationships. Refer - http://www.cs.iastate.edu/~semantix/ for more info.

Downloads: 0 This Week

Last Update: 2013-05-02
See Project
23

iracema

An information extraction library implementing modern algorithms for the extraction of named entities from text.

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
24

moara

Moara is a biological text mining tool and consists of a Java library and some auxiliary MySQL databases for gene/protein training and extraction of mentions and its further normalization and disambiguation.

Downloads: 0 This Week

Last Update: 2013-04-10
See Project
25

TCR Neuroph -Text Character Recognition

TCR Neuroph - Text Character Recognition is java tool developed to recognize scanned text , using Java Neural Network Framework - Neuroph

Downloads: 0 This Week

Last Update: 2015-09-01
See Project

Previous
1
You're on page 2
3
Next

Related Searches

grep

manga translate

ocr comic reader

search text

webp to jpg converter

invoice templates libreoffice

braille

djvu to pdf

webcam ocr

ocr

Related Categories

Artificial Intelligence

Scientific/Engineering

Multimedia

Software Development

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise