iScore measures the interestingness of news articles in a limited user environment. It is an online learning algorithm that combines a large set of disparate features to classify documents. To download the source code, please use subversion.
R2HTML is a package for R, which includes HTML function and methods to output R objects to HTML. Allow for redirection on the fly. A driver for Sweave allows to parse HTML flat files containing R code and to automatically write the corresponding outputs.
Automatic generation of documentation on Delphi projects from source code. Distinctive features are exact parsing gathering lots of information and a division of the parser and configurable generators (HTML, Win- & HTML-Help, PDF, LaTeX, XMI export)
NDAD is the UK National Digital Archive of Datasets. This project contains the source code for the bespoke software which drives the public website and many aspects of the data curation/archiving process.
Parallel IBDWS is an upgrade from IBDWS that utilizes parallel processing. Also, the incorporation of DNA sequence data has been added since the last code release
febrl-gen is a Java-based frontend to Febrl, an open-source data linkage system written in Python. Users can configure the parameters of a linkage project through the frontend, and febrl-gen will generate a Febrl-ready configuration file.
Poor Man's HPC is a framework that allows distributing and running code on a server farm. pmHPC is a scaled down and simplified version of distributed computing projects such as SETI, so is a perfect fit for enthusiasts and universities.
Comprehensive & high performance data distribution &transformation system. Features a simple, user-friendly event driven scripting interface transparently generates & execs highly efficient Perl/C code. Uses:ETL,datawarehousing,statistics,data-cleansing.
K-automaton is a new parsing (syntactic analysis) machine isomorphous to language. Implemented in Java. Can generate Java code from grammars described in EBNF.
A code for fast multi-dimensional density estimation . Instead of assuming an a-priori metric definition, it calculates a locally adaptive metric for each data point by using, a Shannon Entropy based, binary space partitioning scheme.
NOTE: Use Fltk_Contour insted this code! FMesh is a simply scientific data visualization tool based on OpenGL. FMesh can generate 2D and 3D graph of contour and color maps for any (x,y,z) scattered data set, many other options are included.
JRete is a rule engine written in Java. Advantages over other expert system shell and artificial intelligence(AI) API - code rules in java language, data may compute accross network with multiple JRete, auto data persistence to database, event-fire direc
Recon-Calc is a source code feature location tool based on the Software Reconnaissance technique. It allows you to name features of your system and then, based on coverage profiles gathered, you can find where they are implemented in the source code.
Uni-wordsplit aimed to provide a unicode(lexical analysis/word splitter) system.Especially designed for CJK(China/Japan/Korea) users.
The Code based on Mozilla-XPCOM code.
SNNSraster is a utility for quick ANN analysis of raster GIS maps with the use of Stuttgart Neural Network Simulator trained network files. It was developed to read and write binary raster files.
SNNSraster is a project of the Geography Laboratory of the University of Siena. The code was developed by Giancarlo Macchi Jánica between 2006 and 2007. SNNSraster's fundamental objective is to improve the ability to integrate the use of artificial neural networks in GIS environments.
MultiBoost is a C++ implementation of the multi-class AdaBoost algorithm. AdaBoost is a powerful meta-learning algorithm commonly used in machine learning. The code is well documented and easy to extend, especially for adding new weak learners.
DotPlot is an Eclipse plug-in to graphically compare word sequences of any type of text. Matches will be plotted as dots on a graph. Similarities in thousands of lines of text or code will result in typical textures and diagonals in the plot.
"mASN1" - mini ASN.1 framework is a light ASN.1 framework written in C# for .NET
framework. It can be used for creating classes that model ASN.1 types and
are capable of encoding/decoding themselves to BER/DER codes. PER support is planned.
This is a implementation of the 'enhanced Topic-based Vector Space Model' (eTVSM) using the python language. A Java-Version and maybe other java-code contributions are planned.
An open-source package for analysis of neurophysiological data-- offline (vs real-time) processing of single-neuron spike trains or EEG data associated with behavior and memory processes. Currently MATLAB heavy, with some Windows specific code.
Framework for storing and editing Forestry-related XML data, with a specialised graphical interface for each object type. Plug-in API for adding functionality. Coded in Java, uses WebStart for distribution, swing GUI, ~100k lines of code.
CloneAnalyzer is a tool for software quality analysis. It allows you to find, display and inspect clones, which are fragments of duplicated source code resulting from lack of proper reuse. It can be used as a plugin for Eclipse and on the command line.