Fast Parallel Async HTTP/SSH/TCP/UDP/Ping Client Java Library
Ansj word segmentation
General-Purpose PDF Library for Java and .NET
Personalized Search Engine for Your Files
Personalized Search Engine for Commonly Used Files
The DjVu complete solution,with OCR Technology(Arabic ,English).
A RESTFul/JSON Web Service for text and metata extraction
Mining knowledge from text data
JSON based text search Java Project
Detexter is an app designed to extract text from PDF files.
TML is a Java Library for LSA and extracting Concept Maps from text
A Java package to preprocess text datasets for posterior text analysis
Annotation Tool to Extract Endangered Animals from Text Resources
Java Based Heavy-duty utilitity to process large delimited text files
Automatic Arabic Domain-Relevant Term Extraction