Python binding to the Apache Tika™ REST services
Lambda architecture on Apache Spark, Apache Kafka for real-time
DSTK - DataScience ToolKit for All of Us
Weka wrapper for the SGM toolkit for text classification and modeling.
Intelligent SEO keyword miner and predicing tool
A forensic file identification tool using neural networks
Non-disjoint groupping of Documents based on word sequence approach
neural network implementation in java