Showing 257 open source projects for "data mining"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1

    SCaVis

    Scientific Computation and Visualization Environment

    SCaVis is an environment for scientific computation, data analysis and data visualization for scientists, engineers and students. The program is fully multiplatform (100% Java) and integrated with Java and a number of scripting languages: Jython (Python), Groovy, JRuby, BeanShell. SCaVis can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Gen3D

    Gen3D

    A tool for 3D genome and chromosome structural model construction

    Gen3D is an application designed to determine three-dimensional genome and chromosome models. It uses chromosomal contact data to construct three-dimensional conformations. This method can generate three-dimensional chromosomal models satisfying a large portion of chromosomal contacts. The software is developed in Prof. Jianlin Cheng's Bioinformatics, Data Mining and Machine Learning Laboratory in the Computer Science Department at the University of Missouri - Columbia, USA. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PROPER is a package for visual evaluation of ranking classifiers for biological big data mining studies in the mathematical language MATLAB. It is an efficient tool for optimization and comparison of the state-of-the-art ranking classifiers by generating over 20 different high quality two- and three-dimensional performance curves.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    The c2001 spatio-temporal mining library

    The c2001 spatio-temporal mining library

    An open source spatio-temporal data mining library

    Current functions: 1. The General Association Rule Mining Framework(GARMF) library, which support mining association rules from transactions(boolean, weighted, fuzzy), spatial datasets (vector and raster) and spatio-temporal datasets (raster snapshots). Besides it support incremental mining. 2. Rule Filtering Library (RFL), a library for rule evaluation. 3. Besides, DAP-Shell, a GUI shell for GARMF and RFL, will be provided. However I'll pay no more attention on it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    CNV Explorer

    CNV Explorer

    Interactive Copy Number Analysis for Cancer Genomics

    CNV Explorer for Gene discovery: https://arraycgh.shinyapps.io/CNV_Explorer/ CNV Explorer for survival analysis: https://arraycgh.shinyapps.io/CNV_Explorer_Survival/ CNV_Explorer is a data mining tool for cancer researchers interested in the structural and copy number changes that happen in cancer genomes. Huge volumes of genomics data from nearly every cancer type are now freely available and several online databases have begun to collate and store this information. However, current tools focus on individual gene queries rather than chromosomal and region-based queries more relevant to some researchers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Cenobi

    Cenobi

    cost estimation and management accounting, using neural networks

    Cenobi is designed for management accountants, not (only) for statisticians and data mining experts. Carefully arranged default settings make sure you can concentrate on Cenobi's many accounting features rather than worrying about setting up artificial neural networks or genetic algorithms, which are the main machine learning tools under Cenobi's hood. Cenobi's main benefits are: - ease of use - Utilizing artificial neural networks to estimate cost relationships, Cenobi is able to reflect the non-linear cost-behavior, that occurs in most business processes...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Appliance Usage Model Manager

    Appliance Usage Model Manager

    A tool to extract and handle appliance usage models

    The appliance usage model manager is a tool that can be used to model device demand by means of machine learning and data mining techniques. We propose a general purpose interface to handle such kind of models, which is used to query various types of models which can be learned from energy consumption datasets. The tool provides a hub where models can be loaded and used in the HEMS market simulator. Please check the documentation in the file section if you want to get further information concerning the usage of the tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Mr.FSM

    Mr.FSM

    Large-Scale Frequent Subgraph Mining in MapReduce

    This is the program used in the following paper: Wenqing Lin, Xiaokui Xiao, and Gabriel Ghinita. Large-Scale Frequent Subgraph Mining in MapReduce. In Proceedings of the 30th IEEE International Conference on Data Engineering (ICDE), pages 844-855, 2014. Please cite the paper if you choose to use the program. If having any problems, please report to {wlin1 at ntu dot edu dot sg}.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    libVMR

    VMR - machine learning library

    libVMR is a class library written in Java which implements code generator for group method of data handling - GMDH. The library is intended for users, with machine learning skills. libVMR provides an effective framework for the research and development of data mining and predictive analytics. libVMR is based on the most popular neural network model with a higher generalization ability from kernel tricks - vector machine by Reshetov (VMR).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10

    FlexDM

    Simple, parallel and fault-tolerant data mining using WEKA

    FlexDM has moved to http://madiflannery.github.io/FlexDM/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    QuazarCoin
    QuazarCoin (QCN) is a new cryptocurrency based on the CryptoNote and uses the CryptoNight algorithm. QuazarCoin protects your data and privacy with help of completely anonymous transactions with ring signatures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    VSViewer3D

    3D Virtual Screening Viewer

    This project is an interactive viewer for 3D virtual screening data.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    Iris Classifier

    The classifier for iris flowers (data mining)

    This implements KNN algorithm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Persica-A new Persian corpus for NLP

    This project presents a new corpus for NEWS text analysis in Persian

    Lack of multi-application text corpus despite of the surging text data is a serious bottleneck in the text mining and natural language processing especially in Persian language. This project presents a new corpus for NEWS articles analysis in Persian called Persica. NEWS analysis includes NEWS classification, topic discovery and classification, category classification and many more procedures. Dealing with NEWS has special requirements and first of all a valid and reliable corpus to perform the experiments on them. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    giServer

    giServer

    giServer the easy to use and extensible batch and integration server

    ...Instead of using complex XML configuration files an elaborate GUI for batch job management is included. Some possible usage scenarios are: - Automatic processing of incoming data files - Big Data applications - Process automation - Data Mining/Aggregation applications - Automatic Reporting - Processing and analysis of database records
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MRA

    MRA

    A general recommender system with basic models and MRA

    Multi-categorization Recommendation Adjusting (MRA) is to optimize the results of recommendation based on traditional(basic) recommendation models, through introducing objective category information and taking use of the feature that users always get the habits of preferring certain categories. Besides this, there are two advantages of this improved model: 1) it can be easily applied to any kind of existing recommendation models. And 2) a controller is set in this improved model to provide...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    webtextanalysis

    Mining knowledge from text data

    This project aims to implement in java the following text mining techniques: Text Language Detection, Keywords and keyphrases extraction, Text Classification, Text Clustering, Single or multiple documents Summarization, Plagiarism Detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Graph-RAT
    Graph-RAT is a database abstraction layer designed to make it easy to use a large library of graph-analysis routines on a database as well as add new kinds of algorithms to data mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    cmotif

    A method for mining conditional phosphorylation motifs

    Phosphorylation motifs represent position-specific amino acid patterns around the phosphorylation sites in the set of phosphopeptides. The discovery of phosphorylation motifs is a very valuable work in bioinformatics. Although several algorithms have been proposed to uncover phosphorylation motifs, the problem of efficiently discovering a set of significant motifs with sufficiently high coverage and non-redundancy still remains unsolved. In this paper, we propose an algorithm called C-Motif...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Cost-sensitive Classifiers

    Adaboost extensions for cost-sentive classification

    Adaboost extensions for cost-sentive classification CSExtension 1 CSExtension 2 CSExtension 3 CSExtension 4 CSExtension 5 AdaCost Boost CostBoost Uboost CostUBoost AdaBoostM1 Implementation of all the listed algorithms of the cluster "cost-sensitive classification". They are the meta algorithms which requires base algorithms e.g. Decision Tree Moreover, Voting criteria is also required e.g. Minimum expected cost criteria Input also requires to load an arff file and a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22

    RedLDA

    Redundancy Aware LDA Gibbs Sampler

    Redundancy-Aware Topic Modeling Copy Paste Redundancy or Data Duplication are prevalent in many corpora.This redundancy has a negative impact on the quality of text mining and topic modeling in particular. This is a software package of a novel variant of Latent Dirichlet Allocation (LDA) topic modeling, Red-LDA, which takes into account the inherent redundancy of corpora when modeling content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    CyberSpy

    A cyberintelligence turn-based strategy game

    CyberSpy is a turn-based strategy game in the style of Civilization. The player takes on the role of the head of a highly secretive intelligence agency tasked with using data-mining to prevent security threats. Players must learn to balance the privacy and security of their citizens in order to succeed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    GISBox by Citiviz

    GISBox by Citiviz

    Ready-to-use appliance for GIS, Data Mining and Analysis, Modelling...

    GISBox by Citiviz is a ready-to-use virtual appliance containing an assortment of powerful, free and open source GIS, Data Mining and Analysis, Modelling and Programming software and packages. The pre-configured and tested GISBox allows you to try and use in few minutes a variety of software used by data scientists, engineers, physicists, mathematicians, geomatics engineers, and geographers from around the globe, without installing anything more than once the Oracle VirtualBox™ software (https://www.virtualbox.org/) on your Linux, Windows or Mac computer. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    BINow

    BINow

    Data mining complete solutión.

    Plataforma de explotación de datos multidimensional que permite extraer datos y ofrecerlos a diferentes niveles de usuarios empresariales y aplicaciones con alta disponibilidad y alta velocidad de acceso a los mismos.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB