Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.
Build generative AI apps with Vertex AI. Switch between models without switching platforms.
OpenForecast is a package of general purpose, forecasting models written in Java that can be applied to any data series. No knowledge of forecasting is required. Using this package, the most appropriate algorithm for your data will be selected.
Echoproc is a piece of scientific analysis software used to extract physical ice sheet characteristics from radio echograms from experiments like the Center for the Remote sensing of ice sheets (https://www.cresis.ku.edu/)
A collection of compact, powerful programs and functions, written in C, for the analysis and transformation of data. No object or structure dependencies, no fancy interfaces - just good tools.
Ferda is a user friendly data mining tool. It is a modular distributed multiplatform framework based on Internet Communications Engine. Ferda is very powerful in working with association rules.
Parsers for biological data based on scanner generators like Flex (C), Re2c(C), Jflex (Java) and Ifickle (Tcl). This scanner generators are providing easier maintainance, development and higher speed than hand written scanners. Scanner output is SQL.
Enrich and query corpora in the TEI-XML vocabulary. CorpusReader manage very large corpora and corpora containing milestone annotation. It provides tools for enriching corpora with output of linguistic parsers, and for extracting quantitative information
The system searches synonyms (and related words) in Wikipedia. WikIDF generates index database of Wikipedia (for Russian, English, and German). The continuation of this project is "wikokit" at code.google.com
Cougar Squared is a new Java library for machine learning and data mining research, supporting research needs of the community. It is written by researchers for researchers. It extends the WEKA and YALE machine learning frameworks.
rSIGNAL - ruby Simple Inter-Interface Generic Aggregation Layer. It allows external data sources to be aggregated, store the aggregated data and print the aggregated data out in a transformed format to a file (or integrated HTTP server) on demand.
Siafu simulates individual agents and their context, from home to city-wide scenarios. As a developer, you use the API to write your simulation for the purposes of data-set generation, test or visualization, optionally hooking it to your own application.
This is a MATLAB toolbox for the quality control and scoring of EMAP and SGA genetic interaction data. It includes a graphical user interface and some automatic plot-generating tools.
Logbus-ng consists in a set of tools to aid developers perform Log Analysis in all the stages: log generation, collection, distribution, storage and analysis. It is designed specifically for Field Failure DataAnalysis in critical distributed systems
Methods and testing of methods for automatic analysis of in situ cyclic
voltammetry data.
This, at least initially, is the code from my masters thesis, which was
done as a contribution to a larger project called Aevum. Aevum is being
developed at t
Regexp testing tool allows to apply group of regexps to huge arrays of data (millions or so) in order to investigate search or search/replacement possibilities of regexp group.
library for capturing, storing and visualizing timeseries data
The JTimeSeries has moved to github
Please go to https://github.com/JTimeSeries/jtimeseries
The SourceForge copy has not been maintained since Sep 2012
A java library to assist with capturing and storing timeseries data/metrics. Provides facilities to publish timeseries data across a network, a lightweight server to persist series data, and client user interface components for real time visualization
ngram is a module to compute the similarity between two strings. It is different to python's "difflib.SequenceMatcher" in that it cares more about the size of both strings. ngram is an port and extension of the perl module called "String::Trigram
JGraph is the most powerful, lightweight, feature-rich, and thoroughly documented open-source graph component available for Java. See the project homepage at www.jgraph.com for information and downloads.
Ontea - Pattern based Semantic Annotation Platform. Ontea search or create semantic meta data from text or documents using pattern based approaches. Implementation currently includes regular expressions (regex) patterns
Data mines the voting record and other actions of Members of the UK Parliament. Extracts information from the parliament website and stores it in a database. Provides tools to analyse the information, producing statistics and tables about the MPs.
PerfParse. Storage and analysis of binary performance data produced by Nagios. High quality accurate graphs of live data from standard Nagios plugins. Permanent history of plugin results with advanced analysis tools.
Contextor is a light-weight simple-to-use Java based library to help developers and researchers working with the general concept of a resource; as examples, resources can be text resources, web resources, images and videos.