JIRA plugin for Pentaho Data Integration
an application to automatically extract text from comic books.
An Arabic collocation extraction tool
Adhoc Data Exploration - Live & Easy
Ansj word segmentation
General-Purpose PDF Library for Java and .NET
Personalized Search Engine for Your Files
Personalized Search Engine for Commonly Used Files
Provides GUI for Tessaract OCR
The DjVu complete solution,with OCR Technology(Arabic ,English).
Minimal offline PDF to ePUB converter for Android
A RESTFul/JSON Web Service for text and metata extraction
free image processing software
JSON based text search Java Project
Detexter is an app designed to extract text from PDF files.
TML is a Java Library for LSA and extracting Concept Maps from text
A Java package to preprocess text datasets for posterior text analysis
An OCR assistant for visually impaired people
Annotation Tool to Extract Endangered Animals from Text Resources
Java Based Heavy-duty utilitity to process large delimited text files
Automatic Arabic Domain-Relevant Term Extraction