A collection of ACO algorithms for the data mining classification task
MYRA is a collection of Ant Colony Optimization (ACO) algorithms for the data mining classification task. It includes popular rule induction and decision tree induction algorithms. The algorithms are ready to be used from the command line or can be easily called from your own Java code. They are build using a modular architecture, so they can be easily extended to incorporate different procedures and/or use different parameter values.
This project is now hosted at: https://github.com/febo/myra
Web-as-corpus tools in Java.
* Simple Crawler (and also integration with Nutch and Heritrix)
* HTML cleaner to remove boiler plate code
* Language recognition
* Corpus builder
JRete is a rule engine written in Java. Advantages over other expert system shell and artificial intelligence(AI) API - code rules in java language, data may compute accross network with multiple JRete, auto data persistence to database, event-fire direc
Eureka is a software for information processing. It can be used by scientists, students, journalists or writer to organize their work. Eureka can work with multiple sources of information : Web pages, HTML contents indexation, books notes.