Showing 39 open source projects for "pdf data mining"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 1
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • 2

    TabuVis

    An Interactive Visualisation for Tabular Data

    TabuVis is a comprehensive visual analysis tool that provides a flexible, customizable and interactive visualization for tabular (or multidimensional) data. It utilizes scatter-plot visualization approach to provide comprehensive and interactive views for different attribute mappings. It provides single scatter-plot and multiple scatter-plots together with Map (GIS) capability. The project is available at: http://staff.scem.uws.edu.au/~vinh/projects/TabuVis/ Related publications: https://www.sciencedirect.com/science/article/pii/S2468502X20300358 https://dl.acm.org/doi/pdf/10.1145/2968220.2968248 https://link.springer.com/content/pdf/10.1007/s11432-013-4870-1.pdf https://dl.acm.org/doi/pdf/10.1145/2397696.2397705
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    Divide and Conquer Treemaps

    Divide and Conquer Treemap Visualisation

    The project develops new new techniques and algorithms for quickly partitioning and visualizing very large hierarchical structures within a variety of arbitrary shapes and space using Divide and Conquer approach for Treemaps. Related publications: https://www.sciencedirect.com/science/article/pii/S1045926X1500066X https://dl.acm.org/doi/pdf/10.1145/2493102.2493112
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    ADaMSoft

    Open Source and data mining software

    ADaMSoft is a free and Open Source Data Mining software developed in Java. It contains data management methods and it can create ready to use reports. It can read data from several sources and it can write the results in different formats.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    VIKAMINE is a flexible environment for visual analytics, data mining and business intelligence - implemented in pure Java. It features several powerful visualization and mining methods, and can utilize background knowledge.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    MYRA

    MYRA

    A collection of ACO algorithms for the data mining classification task

    MYRA is a collection of Ant Colony Optimization (ACO) algorithms for the data mining classification task. It includes popular rule induction and decision tree induction algorithms. The algorithms are ready to be used from the command line or can be easily called from your own Java code. They are build using a modular architecture, so they can be easily extended to incorporate different procedures and/or use different parameter values.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    Lattice Miner is a data mining prototype for creating, visualizing and exploring concept (Galois) lattices. It allows the generation of formal concepts and association rules.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    GUI Ant-Miner is a tool for extracting classification rules from data. It is an updated version of a data mining algorithm called Ant-Miner (Ant Colony-based Data Miner).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Graph-RAT
    Graph-RAT is a database abstraction layer designed to make it easy to use a large library of graph-analysis routines on a database as well as add new kinds of algorithms to data mining.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    The FAKE GAME tool uses natural evolution to evolve Data Mining models. It incorporates several preprocessing, optimization and visualization methods aimed to streamline the Knowledge Discovery process. Knowledge Extraction from data is being automated!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    GAMES ESMI

    GAMES ESMI

    The Energy Sensing and Monitoring Infrastructure of the GAMES project

    This is part of the GAMES Project, which consists of a set of innovative methodologies and Open Source ICT tools for designing and managing energy efficiency in IT Service Centres. The ESMI is the monitoring and sensing infrastructure which allows the fine-grained measurement and provides both an event-based and a real-time stream of data conveying information about the energy consumption in the key points of the overall system. It is completed by a set of Nagios plugin and an assessment tool which implements some green Performance Indicators and integrates data mining with the self adaptive controllers of the entire framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Ruleminer is an experimental tool for performing various data-mining tasks, especially focused on searching for association rules. The aim is to research the association rules mining process and to develop new approaches for solving the related problems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This project aims to be a easy-to-use toolkit of algorithms and utilities for semantic data mining. So far all algorithms are implemented as web services and we provide widgets for their use in the Orange4WS data mining platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Knime (http://www.knime.org) nodes for sequence bioinformatics. Sequime is an eclipse plug-in for the KNIME data mining platform, providing additional nodes for reading, processing and visualizing sequence information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Cougar Squared is a new Java library for machine learning and data mining research, supporting research needs of the community. It is written by researchers for researchers. It extends the WEKA and YALE machine learning frameworks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TAXOMO
    Data mining tool for sequences (e.g. trajectories on a map, visited web pages, etc.) that creates a succinct description of the sequences, given a taxonomy (e.g. regions and sub-regions in the map, categories and sub-categories of pages, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The EDAM ENCHILADA - The Exploratory Data Analysis and Management Project's Environmental Chemistry Data Processing and Mining Application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Contextor
    Contextor is a light-weight simple-to-use Java based library to help developers and researchers working with the general concept of a resource; as examples, resources can be text resources, web resources, images and videos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Open data mining platform. Provides common architecture for algorithms of various types. Efficient processing of arbitrarily large volumes of data thanks to data streaming. Weka and Rseslib partially integrated. (www.debellor.org)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SPASE Model is a collection of tools for working with the structured data model information. Tools can convert the relational version of the data model into various expressions, including XSD, XMI and PDF documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Minervan project aims at aiding intelligent software development. It integrates reporting, analysis and data mining to support better decision making.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next