Showing 257 open source projects for "data mining"

View related business solutions
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    ht://Check is more than a link checker. It's particularly suitable for checking broken links, anchors and web accessibility barriers, but retrieved data can also be used for Web structure mining. Uses a MySQL backend. Derived from ht://Dig.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SPIDR (Space Physics Interactive Data Resource) is a distributed database and application server network, built to select, visualize and model historical space weather data. SPIDR is a web-application and a grid of data mining web-services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    The Varro toolkit is a system for identifying and frequently recurring unordered subtrees in semi-structured data. It is mostly for linguistics but has applications in semi-structured data mining too.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Isolation Forest detects data-anomalies using binary trees. Platform: R (www.r-project.org) Reference: Fei Tony Liu, Kai Ming Ting, and Zhi-Hua Zhou, “Isolation Forest”, IEEE International Conference on Data Mining 2008 (ICDM 08)
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    ADWIN is an adaptive sliding window algorithm for detecting change and keeping updated statistics from a data stream, and use it as a black-box in place or counters in learning and mining algorithms initially not designed for drifting data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    This project was an exercise for a Software Engineering course. The software is incomplete, and there are no plans to complete it. The Concurrent Versions Data Mining System (CVDMS) is a Web application designed to provide data mining of CVS repositories in the form of statistics and visualizations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    A tool to aid in the execution of Systematic Mapping Studies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    A suite of data mining algorithms implemented in C++ for use in a distributed environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    WSRF-compliant tools and services for data mining in grid computing environments, based on: Globus Toolkit 4, Condor and Triana workflow system. Learn more at: http://www.datamininggrid.org Copyright (c) 2008 DataMiningGrid Consortium.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Datacleaning Open Source
    A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The Minervan project aims at aiding intelligent software development. It integrates reporting, analysis and data mining to support better decision making.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Wiki .NET
    A set of libraries and programs to access MediaWiki based websites' data through the MediaWiki API. Also includes visual tools to aid editing wiki-markup pages. Created using the .Net framework and C#
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Excel to Arff converter. You can easily convert the excel datas will be used data mining process to arff file format and then easily analyze your datas and results using WEKA Data Mining Utility.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    Catalyst based application intended to be a robot fitting tool, character planner, mining yield planner and eventually also a killboard/corporation management tool for Perpetuum Online. This is the base project, without any non-free data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Webstats Solr is an attempt to make Apache Access log easier to Data Mine. By adding a powerful Search Engine (SOLR) as a Backend and using Java Script and HTML and maybe PHP I hope to out date AWStats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LABnovo offers a method for laboratory housekeeping by correlating the lab processes that ultimately lead to the production of sequences. The metadata collected enables further data-mining in a community-centric environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Decima is a database that was designed to support time-series data mining. It consists of PostgreSQL custom type definition, implementation of GiST index for that type and snowflake database schema.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Augustus

    PMML-compliant scoring engine and analytic toolkit

    ...The new project page is augustus.googlecode.com. New releases of the project are not currently being released to sourceforge. Augustus is designed for statistical and data mining models and produces and consumes models with 10,000s of segments. Versions of Augustus support PMML 3, 4.0.1, and 4.1.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Facilitates data mining/natural language processing experiments to be executed on weblogs, such as classification, clustering and rating. As part of these experiments, it is possible to apply Latent Semantic Analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    BF-2PkA is an algorithm for anonymization of sequence data. It transforms a dataset in a k-anonymous dataset. Moreover, the sequential pattern mining results are preserved very well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A mini data CUBE to do some embedded data analysis and mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Dataset Retrieval through Intelligent Agents (DARIA): is an Open Source project for facilitating the construction of ARFF data set files for use with WEKA or any such Machine Learning/Data Mining Software through the use of Intelligent Agents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The Fiber project seeks to create a modular open source text mining tool that provides a contextual foundation for analysis in the dissemination of large quantities of text data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Vademecum is a Data Mining suite with a novel help function, semi-automated protocol, consequently modular construction that is also equipped with the most elaborated Data Mining methods.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB