Showing 68 open source projects for "pdf data mining"

View related business solutions
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 1
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    ...When teaching data mining, we like to illustrate rather than only explain.
    Downloads: 53 This Week
    Last Update:
    See Project
  • 2
    ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • 3

    TabuVis

    An Interactive Visualisation for Tabular Data

    TabuVis is a comprehensive visual analysis tool that provides a flexible, customizable and interactive visualization for tabular (or multidimensional) data. It utilizes scatter-plot visualization approach to provide comprehensive and interactive views for different attribute mappings. It provides single scatter-plot and multiple scatter-plots together with Map (GIS) capability. The project is available at: http://staff.scem.uws.edu.au/~vinh/projects/TabuVis/ Related publications: https://www.sciencedirect.com/science/article/pii/S2468502X20300358 https://dl.acm.org/doi/pdf/10.1145/2968220.2968248 https://link.springer.com/content/pdf/10.1007/s11432-013-4870-1.pdf https://dl.acm.org/doi/pdf/10.1145/2397696.2397705
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Divide and Conquer Treemaps

    Divide and Conquer Treemap Visualisation

    The project develops new new techniques and algorithms for quickly partitioning and visualizing very large hierarchical structures within a variety of arbitrary shapes and space using Divide and Conquer approach for Treemaps. Related publications: https://www.sciencedirect.com/science/article/pii/S1045926X1500066X https://dl.acm.org/doi/pdf/10.1145/2493102.2493112
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    AlphaPlot

    AlphaPlot

    Interactive scientific graphing and data analysis software.

    Alpha Plot can generate different types of 2D and 3D plots (such as line, scatter, bar, pie, and surface plots) from data that is either imported from ASCII files, entered by hand, or calculated using formulas. The data is held in spreadsheets which are referred to as tables with column-based data (typically X and Y values for 2D plots) or matrices (for 3D plots). The spreadsheets as well as graphs and note windows are gathered in a project and can be organized using folders. The built-in...
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 6

    ADaMSoft

    Open Source and data mining software

    ADaMSoft is a free and Open Source Data Mining software developed in Java. It contains data management methods and it can create ready to use reports. It can read data from several sources and it can write the results in different formats.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    VIKAMINE is a flexible environment for visual analytics, data mining and business intelligence - implemented in pure Java. It features several powerful visualization and mining methods, and can utilize background knowledge.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    QtiPlot
    QtiPlot is a user-friendly, platform independent data analysis and visualization application similar to the non-free Windows program Origin.
    Downloads: 45 This Week
    Last Update:
    See Project
  • 11

    VecText

    Converting text to a structured representation

    ...In the command-line interface mode, all options need to be specified using the command line parameters. This way of non-interactive communication enables incorporating the application into a more complicated data mining process integrating several software packages or performing multiple conversions in a batch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MYRA

    MYRA

    A collection of ACO algorithms for the data mining classification task

    MYRA is a collection of Ant Colony Optimization (ACO) algorithms for the data mining classification task. It includes popular rule induction and decision tree induction algorithms. The algorithms are ready to be used from the command line or can be easily called from your own Java code. They are build using a modular architecture, so they can be easily extended to incorporate different procedures and/or use different parameter values.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    FormRead

    FormRead

    Free OMR - OCR web sofware based on javascript and PHP

    ...Once you have parameterized your form, you can print many of them, give it to your students/respondents, scan and recognize them with formread, and you can finally export the data in your preferred formats (excel, pdf, csv)
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Lattice Miner is a data mining prototype for creating, visualizing and exploring concept (Galois) lattices. It allows the generation of formal concepts and association rules.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Kohonen neural network library is a set of classes and functions for design, train and use Kohonen network (self organizing map) which is one of AI algorithms and useful tool for data mining and discovery knowledge in data (http://knnl.sf.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    GUI Ant-Miner is a tool for extracting classification rules from data. It is an updated version of a data mining algorithm called Ant-Miner (Ant Colony-based Data Miner).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Mass-based dissimilarity

    Mass-based dissimilarity

    A data dependent dissimilarity measure based on mass estimation.

    This software calculates the mass-based dissimilarity matrix for data mining algorithms relying on a distance measure. References: Overcoming Key Weaknesses of Distance-based Neighbourhood Methods using a Data Dependent Dissimilarity Measure. KDD 2016 http://dx.doi.org/10.1145/2939672.2939779 The source code, presentation slide and poster are attached under "Files". The presentation video in KDD 2016 is published on https://youtu.be/eotD_-SuEoo .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Tool support for creating FMC* diagrams [Block diagrams, Petri nets, Entity-Relationship diagrams (ERD)] in MS-Visio 2000 and newer. Features: stencils, consistency checking, Petri net simulation, exporter e.g. pdf, ... *Fundamental Modeling Concepts A stripped down version of the stencil set is available for TAM (Technical Architecture Modeling of SAP). This set uses UML notation and contains Block, Activity, Sequence, State, Class, and Component diagrams. It doesn't contain Simulation,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Graph-RAT
    Graph-RAT is a database abstraction layer designed to make it easy to use a large library of graph-analysis routines on a database as well as add new kinds of algorithms to data mining.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    GISBox by Citiviz

    GISBox by Citiviz

    Ready-to-use appliance for GIS, Data Mining and Analysis, Modelling...

    GISBox by Citiviz is a ready-to-use virtual appliance containing an assortment of powerful, free and open source GIS, Data Mining and Analysis, Modelling and Programming software and packages. The pre-configured and tested GISBox allows you to try and use in few minutes a variety of software used by data scientists, engineers, physicists, mathematicians, geomatics engineers, and geographers from around the globe, without installing anything more than once the Oracle VirtualBox™ software (https://www.virtualbox.org/) on your Linux, Windows or Mac computer. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    cextools

    Command line helpers for Conexp files.

    Some small command line programs and a file parser for Concept Explorer (conexp) written in C++. Currently features include: Converters from concept explorer into PDF, PostScript, SVG and PovRay, a modified 3D Freese layout.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The FAKE GAME tool uses natural evolution to evolve Data Mining models. It incorporates several preprocessing, optimization and visualization methods aimed to streamline the Knowledge Discovery process. Knowledge Extraction from data is being automated!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    GAMES ESMI

    GAMES ESMI

    The Energy Sensing and Monitoring Infrastructure of the GAMES project

    This is part of the GAMES Project, which consists of a set of innovative methodologies and Open Source ICT tools for designing and managing energy efficiency in IT Service Centres. The ESMI is the monitoring and sensing infrastructure which allows the fine-grained measurement and provides both an event-based and a real-time stream of data conveying information about the energy consumption in the key points of the overall system. It is completed by a set of Nagios plugin and an assessment tool which implements some green Performance Indicators and integrates data mining with the self adaptive controllers of the entire framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next