A tool that determines the winner of a vote through many different systems. Amongth the supported systems are: Borda count, Instant-runoff voting, Condorcet method, Ranked pairs, Cloneproof schwartz sequential droping / beatpath winner.
Enrich and query corpora in the TEI-XML vocabulary. CorpusReader manage very large corpora and corpora containing milestone annotation. It provides tools for enriching corpora with output of linguistic parsers, and for extracting quantitative information
Cougar Squared is a new Java library for machine learning and data mining research, supporting research needs of the community. It is written by researchers for researchers. It extends the WEKA and YALE machine learning frameworks.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
The system searches synonyms (and related words) in Wikipedia. WikIDF generates index database of Wikipedia (for Russian, English, and German). The continuation of this project is "wikokit" at code.google.com
z0mg antihax will be a server-side anti-cheat solution for first-person shooter games. It combats cheating in general rather than specific exploits by employing heuristics and analysis of player behaviour instead of scanning the client systems.
Feating constructs a classification ensemble comprising a set of local models. It is effective at reducing the error of both stable and unstable learners, including SVM. For details see the paper at http://dx.doi.org/10.1007/s10994-010-5224-5.
A collection of compact, powerful programs and functions, written in C, for the analysis and transformation of data. No object or structure dependencies, no fancy interfaces - just good tools.
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Executable program that measures sizes and other properties of colonies arrayed in a grid format (intended for 768, 384, or 96 colonies on agar plates) from jpeg images
This is a MATLAB toolbox for the quality control and scoring of EMAP and SGA genetic interaction data. It includes a graphical user interface and some automatic plot-generating tools.
Ferda is a user friendly data mining tool. It is a modular distributed multiplatform framework based on Internet Communications Engine. Ferda is very powerful in working with association rules.
rSIGNAL - ruby Simple Inter-Interface Generic Aggregation Layer. It allows external data sources to be aggregated, store the aggregated data and print the aggregated data out in a transformed format to a file (or integrated HTTP server) on demand.
The NITE XML Toolkit supports the creation, analysis, and browsing of annotated multimodal, text, or spoken language corpora, and represents both timing and rich linguistic structure. It contains libraries for developers and some end user tools.
Siafu simulates individual agents and their context, from home to city-wide scenarios. As a developer, you use the API to write your simulation for the purposes of data-set generation, test or visualization, optionally hooking it to your own application.
Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
Windows application for data mining with self-organizing neural networks. It works with text data files and has convenient user interface. Is solves data mining tasks of class Recognition, continuous value Prediction, and time series Forecast.
CAMEL (Content-based Audio and Music Extraction Library) is an easy-to-use C++ framework developed for content-based audio and music analysis. The framework provides a set of tools for easy Segmentation, Feature Extraction, Domain Extraction, etc.
Regexp testing tool allows to apply group of regexps to huge arrays of data (millions or so) in order to investigate search or search/replacement possibilities of regexp group.
Methods and testing of methods for automatic analysis of in situ cyclic
voltammetry data.
This, at least initially, is the code from my masters thesis, which was
done as a contribution to a larger project called Aevum. Aevum is being
developed at t
Logbus-ng consists in a set of tools to aid developers perform Log Analysis in all the stages: log generation, collection, distribution, storage and analysis. It is designed specifically for Field Failure Data Analysis in critical distributed systems
Content Addressable Memory, Multi-Variate Statistics, Data Mining Includes analyzing datasets, extracting patterns, creating empirical expert system. Computes joint probabilities and implements a "belief" as the solution of an equilibrium equation
QtStock allows to do fundamental analysis of companies and companies shares according to the National Association of Investors Corporation (NAIC). It is completely written in C++ with Qt.