Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.
Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
SEMANTIXS is a semantic information extraction system that can extract, represent and visualize domain-specific information from free-text in the form of complex (and simple) relationships. Refer - http://www.cs.iastate.edu/~semantix/ for more info.
TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
OpenForecast is a package of general purpose, forecasting models written in Java that can be applied to any data series. No knowledge of forecasting is required. Using this package, the most appropriate algorithm for your data will be selected.
K people is a software platform capable of integrating with enterprise systems. It enables workers to improve productivity, while giving management the the means to plan and monitor activity.
A cross-platform Java program to turn a simple Geiger counter into the equivalent of a state-of-the art digital model, plus some – using your laptop computer.
MediaWikiRevisionsExtractor extracts the history of a particular wiki page, computes the modifications made between each revisions and finally, stores the whole set of modifications in a file.
Knime (http://www.knime.org) nodes for sequence bioinformatics. Sequime is an eclipse plug-in for the KNIME data mining platform, providing additional nodes for reading, processing and visualizing sequence information.
Parsers for biological data based on scanner generators like Flex (C), Re2c(C), Jflex (Java) and Ifickle (Tcl). This scanner generators are providing easier maintainance, development and higher speed than hand written scanners. Scanner output is SQL.
Enrich and query corpora in the TEI-XML vocabulary. CorpusReader manage very large corpora and corpora containing milestone annotation. It provides tools for enriching corpora with output of linguistic parsers, and for extracting quantitative information
Cougar Squared is a new Java library for machine learning and data mining research, supporting research needs of the community. It is written by researchers for researchers. It extends the WEKA and YALE machine learning frameworks.
The system searches synonyms (and related words) in Wikipedia. WikIDF generates index database of Wikipedia (for Russian, English, and German). The continuation of this project is "wikokit" at code.google.com
Feating constructs a classification ensemble comprising a set of local models. It is effective at reducing the error of both stable and unstable learners, including SVM. For details see the paper at http://dx.doi.org/10.1007/s10994-010-5224-5.
The NITE XML Toolkit supports the creation, analysis, and browsing of annotated multimodal, text, or spoken language corpora, and represents both timing and rich linguistic structure. It contains libraries for developers and some end user tools.
Executable program that measures sizes and other properties of colonies arrayed in a grid format (intended for 768, 384, or 96 colonies on agar plates) from jpeg images
Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
Siafu simulates individual agents and their context, from home to city-wide scenarios. As a developer, you use the API to write your simulation for the purposes of data-set generation, test or visualization, optionally hooking it to your own application.
Regexp testing tool allows to apply group of regexps to huge arrays of data (millions or so) in order to investigate search or search/replacement possibilities of regexp group.
library for capturing, storing and visualizing timeseries data
The JTimeSeries has moved to github
Please go to https://github.com/JTimeSeries/jtimeseries
The SourceForge copy has not been maintained since Sep 2012
A java library to assist with capturing and storing timeseries data/metrics. Provides facilities to publish timeseries data across a network, a lightweight server to persist series data, and client user interface components for real time visualization
OpenSHORE is an XML based Semantic Document Repository (SDR) with a free definable meta model that builds up a semantic network from sections and relations in documents. The acronym SHORE means Semantic Hypertext Object Repository.