Showing 54 open source projects for "data"

View related business solutions
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • 1

    Chordalysis

    Log-linear analysis (data modelling) for high-dimensional data

    ...However, due to its exponential nature, previous approaches did not allow scale-up to more than a dozen variables. We present here Chordalysis, a log-linear analysis method for big data. Chordalysis exploits recent discoveries in graph theory by representing complex models as compositions of triangular structures, also known as chordal graphs. Chordalysis makes it possible to discover the structure of datasets with thousands of variables on a standard desktop computer. Associated papers at ICDM 2013, ICDM 2014 and SDM 2015 can be found at http://www.francois-petitjean.com/Research/ YourKit is supporting Chordalysis open source project with its full-featured Java Profiler. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    MODLEM

    MODLEM

    rule-based, WEKA compatible, Machine Learning algorithm

    This project is a WEKA (Waikato Environment for Knowledge Analysis) compatible implementation of MODLEM - a Machine Learning algorithm which induces minimum set of rules. These rules can be adopted as a classifier (in terms of ML). It is a sequential covering algorithm, which was invented to cope with numeric data without discretization. Actually the nominal and numeric attributes are treated in the same way: attribute's space is being searched to find the best rule condition during rule induction. In result numeric attribute's conditions are more precise and closely describe the class. This algorithm contains some aspects of Rough Set Theory: the class definition can be described accordingly to its lower or upper approximation. ...
    Leader badge
    Downloads: 18 This Week
    Last Update:
    See Project
  • 3

    libVMR

    VMR - machine learning library

    ...The library has been designed to learn from data sets. Typical applications here are pattern recognition ( binary classification).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Flamingo Project

    Flamingo Project

    Workflow Designer, Hive Editor, Pig Editor, File System Browser

    Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor. 1. Easy Tool for big data 2. Use comfortable in Hadoop EcoSystem projects 3. Based GPL V3 License Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 99.99% Uptime for MySQL and PostgreSQL on Google Cloud Icon
    99.99% Uptime for MySQL and PostgreSQL on Google Cloud

    Enterprise Plus edition delivers sub-second maintenance downtime and 2x read/write performance. Built for critical apps.

    Cloud SQL Enterprise Plus gives you a 99.99% availability SLA with near-zero downtime maintenance—typically under 10 seconds. Get 2x better read/write performance, intelligent data caching, and 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server with built-in vector search for gen AI apps. New customers get $300 in free credit.
    Try Cloud SQL Free
  • 5
    LPCforSOS is a machine learning framework with a special focus on structured output spaces and pairwise learning. It supports currently multiclass, ordinal, hierarchical, multi-label and label ranking classification settings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Consilium Sentence Suggestions Tools

    Consilium Sentence Suggestions Tools

    Consilium – User Defined sentence Suggestion Tool.

    ...But in reality style of writing changes person to person. While our aim is to provide a sentence suggestion tool which will give suggestion to complete the sentence according previously enterd data by the user. Output or suggestion for same sentence or word will change person to person according to previously entered data by that person. So, it will be very easy to type any document, sms, mail, chatting etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    HW SOM

    HW SOM

    SOM - Self-Organizing Maps of Teuvo Kohonen

    It's a "Hello World" implementation of SOM (Self-Organizing Map) of Teuvo Kohonen, otherwise called as the Kohonen map or Kohonen artificial neural networks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    NN Image Recognition (with source-code)

    This is ANN trained application to predict digits from 0 - 9.

    ...I trained ANN with 100 samples of each digit. It takes input of 20x20 pixel image and predicts it with Neural Network. It may predict wrong digit due to very low sample data but it work 90% correctly. Note: JRE 1.6 is required to run this application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DocCO

    DocCO

    Non-disjoint groupping of Documents based on word sequence approach

    ...It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing the learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit for Your Google Cloud Projects Icon
    $300 in Free Credit for Your Google Cloud Projects

    Build, test, and explore on Google Cloud with $300 in free credit. No hidden charges. No surprise bills.

    Launch your next project with $300 in free Google Cloud credit—no hidden charges. Test, build, and deploy without risk. Use your credit across the Google Cloud platform to find what works best for your needs. After your credits are used, continue building with free monthly usage products. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 10
    feed4weka is an open library that enriches weka (http://www.cs.waikato.ac.nz/ml/weka/), an open source project for data analysis. It integrates new classification and clustering algorithms, and adds the coclustering and outlier detection frameworks
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TreeLiker

    TreeLiker

    TreeLiker is a collection of fast algorithms for working with complex

    TreeLiker is a collection of fast algorithms for working with complex structured data in relational form. The data can, for example, describe large organic molecules such as proteins or groups of individuals such as social networks or predator-prey networks etc. The algorithms included in TreeLiker are unique in that, in principle, they are able to search given sets of relational patterns exhaustively, thus guaranteeing that if some good pattern capturing an important feature of the problem exists, it will be found. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    AdPreqFr4SL

    Adaptive Prequential Learning Framework

    The AdPreqFr4SL learning framework for Bayesian Network Classifiers is designed to handle the cost / performance trade-off and cope with concept drift. Our strategy for incorporating new data is based on bias management and gradual adaptation. Starting with the simple Naive Bayes, we scale up the complexity by gradually updating attributes and structure. Since updating the structure is a costly task, we use new data to primarily adapt the parameters and only if this is really necessary, do we adapt the structure. The method for handling concept drift is based on the Shewhart P-Chart. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MACSY

    MACSY

    Modular Architecture for Cognitive Systems

    Macsy is a framework for developing modular agents. Data is organised in blackboards. Computations are performed by modules that annotate the data in the blackboards. Modules communicate indirectly through the annotations that they leave in the blackboards. The framework enables the development of decentralised software agents for a plethora of applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    cognity

    A neural network library for Java.

    Cognity is an object-oriented neural network library for Java. It's goal is to provide easy-to-use, high level architecture for neural network computations along with reasonable performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    HSSVM(Hyper-Sphere Support Vector Machines) is a software for solving multi-classification problem, implemented by Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A Machine Learning and Data Retrieval Framework
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ICT-Alive
    The aim of ALIVE is to develop new approaches to the engineering of flexible, adaptable distributed service-oriented systems based on the adaptation of social coordination and organisation mechanisms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    KeplerWeka adds the functionality of the open-source machine learning and data mining workbench WEKA to the free and open-source, scientific workflow application, Kepler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This project aims to develop a method to identify communities in a social network according to some point of view.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SAIM allows to interlink knowledge bases in the Semantic Web. It focuses on instance matching of very large knowledge bases available as SPARQL endpoints. SAIM uses machine learning techniques and is compatible with SILK.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Text annotation application (Tapp) is a stand alone software component that facilitates the quick annotation of text files for the purpose of creating labelled data for training, testing, and deploying machine learning models
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Program to performing the complete cycle of neural networks analysis: preparing data, choosing neural network (CasCor, MP, LogRegression, PNN), learning of network, monitoring learning state, ROC-analysis, optimization of network parameters using GA.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Sanchay
    Sanchay is a collection of tools and APIs for language researchers. It has some implementations of NLP algorithms, some flexible APIs, several user friendly annotation interfaces and Sanchay Query Language for language resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    E-learning Miner, formerly DŽEMUj is a tool for data mining from e-learning data. Aimed for teachers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Blunder is an automated tool for analyzing chained exceptions in Java. It's usefull for classify, generate a customized error message and a list for possible solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB