Python binding to the Apache Tika™ REST services
Lambda architecture on Apache Spark, Apache Kafka for real-time
DSTK - DataScience ToolKit for All of Us
Weka wrapper for the SGM toolkit for text classification and modeling.
Supervised Ranking of Contigs in de novo Assemblies
Workflow Designer, Hive Editor, Pig Editor, File System Browser
A forensic file identification tool using neural networks
Non-disjoint groupping of Documents based on word sequence approach
neural network implementation in java