An extremely fast implementation of Aho Corasick algorithm
A smart search engine for medical documents
The Binary Usenet Tool
MITIE: library and tools for information extraction
@Note2 - A workbench for Biomedical Text Mining
PDF Library for Developers
JIRA plugin for Pentaho Data Integration
Particle Image Velocimetry
Lightweight Java web crawler framework with jQuery-style extraction
Literate programming for eclipse
An Arabic collocation extraction tool
Natural Language Processing (NLP) for the Masses
an application to automatically extract text from comic books.
Adhoc Data Exploration - Live & Easy
Ansj word segmentation
Statistical phrase-based machine translation system
Pdf images extractor
General-Purpose PDF Library for Java and .NET
Personalized Search Engine for Your Files
Personalized Search Engine for Commonly Used Files