@Note2 - A workbench for Biomedical Text Mining
PDF Library for Developers
Convolutional Recurrent Neural Network (CRNN) for image-based sequence
Java based framework for extraction information from Arabic text
JIRA plugin for Pentaho Data Integration
an application to automatically extract text from comic books.
An Arabic collocation extraction tool
OCR web based for Browser Firefox & PC
Adhoc Data Exploration - Live & Easy
Fast Parallel Async HTTP/SSH/TCP/UDP/Ping Client Java Library
Ansj word segmentation
General-Purpose PDF Library for Java and .NET
Personalized Search Engine for Your Files
Personalized Search Engine for Commonly Used Files
Provides GUI for Tessaract OCR
The DjVu complete solution,with OCR Technology(Arabic ,English).
Minimal offline PDF to ePUB converter for Android
A RESTFul/JSON Web Service for text and metata extraction
free image processing software
Mining knowledge from text data
JSON based text search Java Project
Detexter is an app designed to extract text from PDF files.
TML is a Java Library for LSA and extracting Concept Maps from text
A Java package to preprocess text datasets for posterior text analysis