Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.
Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
Start Free Trial
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.
Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
Open data mining platform. Provides common architecture for algorithms of various types. Efficient processing of arbitrarily large volumes of data thanks to data streaming. Weka and Rseslib partially integrated. (www.debellor.org)
SPIDR (Space Physics Interactive Data Resource) is a distributed database and application server network, built to select, visualize and model historical space weather data. SPIDR is a web-application and a grid of data mining web-services.
The Varro toolkit is a system for identifying and frequently recurring unordered subtrees in semi-structured data. It is mostly for linguistics but has applications in semi-structured data mining too.
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.
Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
Moara is a biological text mining tool and consists of a Java library and some auxiliary MySQL databases for gene/protein training and extraction of mentions and its further normalization and disambiguation.
A java tool for anytime and interactive sequence mining. Aims at providing users with a way of analyzing her activity traces and extract activity schemes from them.
The Minervan project aims at aiding intelligent software development. It integrates reporting, analysis and data mining to support better decision making.
This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
LAITOR is a text mining software developed to find co-occurrence of biological entities (gene/protein terms) together with biointeractions and concepts term from customized dictionaries.
Decima is a database that was designed to support time-series data mining. It consists of PostgreSQL custom type definition, implementation of GiST index for that type and snowflake database schema.
PMML-compliant scoring engine and analytic toolkit
...The new project page is augustus.googlecode.com. New releases of the project are not currently being released to sourceforge.
Augustus is designed for statistical and data mining models and produces and consumes models with 10,000s of segments.
Versions of Augustus support PMML 3, 4.0.1, and 4.1.
The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
The Fiber project seeks to create a modular open source text mining tool that provides a contextual foundation for analysis in the dissemination of large quantities of text data.
TextMine is for the Perl hacker who is grappling with the problems of managing unstructured text from various sources. You can use these text mining tools to search the Web, index text, extract entities, categorize your e-mail, and summarize documents.
@Note2 is now available in www.anote-project.org
@Note is a Biomedical Text Mining workbench that integrates current Biomedical Text Mining (BioTM) methods and provides biologists with intuitive tools capable of supporting their bibliographic searches and further literature curation.
ESSE is a flexible, efficient and easy to use search engine for data mining in environmental data archives. ESSE will help you find useful data even if you don't know exactly what you are looking for.
The Databionics ESOM Tools offer many data mining tasks using Emergent Self-Organizing Maps. Visualization, clustering, and classification of high-dimensional data using databionics principles can be performed interactively or automatically.
Crow - Computational Representation Of Whatever. A platform for the integration and mining of complex and distributed data. Represents cross-linked semantic web documents as a network of software objects and offers easy ways to filter, and sort them.
Data mining tool for the extraction of spatio-temporal frequent patterns ("Trajectory patterns" or "T-patterns") from GPS-like trajectories of a set of moving objects. Work performed within the European project GeoPKDD - www.geopkdd.eu