Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.
Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
The system searches synonyms (and related words) in Wikipedia. WikIDF generates index database of Wikipedia (for Russian, English, and German). The continuation of this project is "wikokit" at code.google.com
Parallel Colt is a multithreaded version of Colt - a library for high performance scientific computing in Java. It contains efficient algorithms for dataanalysis, linear algebra, multi-dimensional arrays, Fourier transforms, statistics and histogramming
H-mito is a mitochondrial DNA haplogroup prediction tool based on the phylogenetic tree http://www.phylotree.org/. Ancillary scripts: mitoP.py to extract mutation lists and clustal-2-fasta.zip to join, align through clustalw and convert data.
WordNetLMF converts WordNet (http://wordnet.princeton.edu/) lexicographer files into KYOTO-LMF, the LMF dialect used in the KYOTO project (http://www.kyoto-project.eu/).
PocketAnalyzerPCA combines a geometric algorithm for detecting pockets in proteins with Principal Component Analysis and clustering. This enables visualization and analysis of pocket conformational distributions of large sets of protein structures.
Cougar Squared is a new Java library for machine learning and data mining research, supporting research needs of the community. It is written by researchers for researchers. It extends the WEKA and YALE machine learning frameworks.
A population-based method for DNA copy number analysis: recurrent copy number aberration indentification in multiple samples (with no need of single-sample calling). Developed for a quick analysis of high resolution and large population data.
Open Source Particle Image Velocimetry and PIV Analysis
OpenPIV is an initiative of scientists to develop a software, algorithms and methods for the state-of-the-art experimental tool of Particle Image Velocimetry (PIV) which are free, open source, and easy to operate.
Small tools to convert binary data format from spectrometers and oscilloscopes. Now the translators for * Starnford SR430 * LeCROY oscilloscope are included.
IMAS is a Visual Analytics system for the discovery of knowledge in genomic information. IMAS enables genomic scientists to rapidly analyze and visualize a set of microbial genomic sequences using tools such as Glimmer, Blast, Clustal-W, Primer3.
A database and a web front end for physiologic data on animal feeding, developed with PostgreSQL and Django at NESCent (http://www.nescent.org) for Mammalian Feeding Working Group (http://www.feedexp.org).
Current development of the codebase is on Github at http://github.com/NESCent/feedingdb.
Windows application for data mining with self-organizing neural networks. It works with text data files and has convenient user interface. Is solves data mining tasks of class Recognition, continuous value Prediction, and time series Forecast.
rSIGNAL - ruby Simple Inter-Interface Generic Aggregation Layer. It allows external data sources to be aggregated, store the aggregated data and print the aggregated data out in a transformed format to a file (or integrated HTTP server) on demand.
Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
The Protein Geometry Database hosts the development code for a flexible database for searching protein geometry, as well as a library for accessing this data for protein modeling & refinement programs.
Siafu simulates individual agents and their context, from home to city-wide scenarios. As a developer, you use the API to write your simulation for the purposes of data-set generation, test or visualization, optionally hooking it to your own application.
This is a MATLAB toolbox for the quality control and scoring of EMAP and SGA genetic interaction data. It includes a graphical user interface and some automatic plot-generating tools.
This project hosts tools used for analysis of Gaussian Mixture Distributions (GMDs) which are used for statistical signal processing. The tools are libraries for implementing GMD operations and programs used to analyze properties of GMDs.
Logbus-ng consists in a set of tools to aid developers perform Log Analysis in all the stages: log generation, collection, distribution, storage and analysis. It is designed specifically for Field Failure DataAnalysis in critical distributed systems