Self-hostable alternative to Google Timeline
ANT4DOCBOOK is an ANT task for DOCBOOK
Award-winning modern data processing SDK in C++20
Computation and Visualization environment
The most accurate natural language detection library for Java
Machine learning software to solve data mining problems
Search engine and data mining applications and ClueWeb datasets.
A C library for parsing/normalizing street addresses around the world
Innovative text document search. http://dynaq.opendfki.de for details.
@Note2 - A workbench for Biomedical Text Mining
Data and Text Mining Software for Everyone
A wrapper for the famous SentiWordNet, a resource for opinion mining
GNAT recognizes gene names in text and maps them to NCBI Entrez Gene
DSTK - DataScience ToolKit for All of Us
Weka wrapper for the SGM toolkit for text classification and modeling.
Offline stemmer for Gujarati , which is one of 22 Indian languages.
cost estimation and management accounting, using neural networks
Mining knowledge from text data
TML is a Java Library for LSA and extracting Concept Maps from text