Maui is a multi-purpose automatic topic indexing algorithm. Given a document, Maui automatically identifies its topics. Depending on the task topics are tags, keywords, keyphrases, vocabulary terms, descriptors or Wikipedia titles.
weka outlier is an implementation of outlier detection algorithms for WEKA.
CODB (Class Outliers: Distance-Based) Algorithm is the first algorithm developed using WEKA framework.
An experimental unsupervised learning method for improvising jazz melodies, based on restricted Boltzmann machines (RBMs) layered into Deep Belief Networks (forms of neural networks).
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Extensible framework that enables productive design, analysis, and execution of arbitrary-sized neural-networks or system on a distributed, scalable, high-throughput runtime platform. Enables synapse-oriented-programming.
Blunder is an automated tool for analyzing chained exceptions in Java. It's usefull for classify, generate a customized error message and a list for possible solutions.
A project aims to develop a system which trains LDA model in distributed enviorenment. I studied Hadoop based solution and found that Hadoop is not fit for distributed LDA training case. In this project I implement a platform based on socket.
A suite of machinelearning benchmarks where each agent must solve a lot of different tasks without recompilation. This means that the programmers cannot manually specify topologies or adjust parameters to specific tasks.
Dataset Retrieval through Intelligent Agents (DARIA): is an Open Source project for facilitating the construction of ARFF data set files for use with WEKA or any such MachineLearning/Data Mining Software through the use of Intelligent Agents.
T-Rex (Trainable Relation Extraction) is a highly configurable machinelearning-based Information Extraction from Text framework, which includes tools for document classification, entity extraction and relation extraction.
TimeSleuth discovers temporal decision rules. It also judges the (a)causality of the rules. TimeSleuth can discover rules that involve time: {if (rainy_yesterday = true) then rainy_today = true}, or {if (rainy_tomorrow = true) then rainy_today = true}.
vbullmin is a data miner bot for vBulletin boards. vbullmin can get all Forums, Topics, Post and Users from a vBulletin. It can be export this values with phpbb2 database schema. It's a sample for MachineLearning. It's using patterns for getting data.