Showing 51 open source projects for "pdf data mining"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Orange Data Mining

    Orange Data Mining

    Orange: Interactive data analysis

    ...When teaching data mining, we like to illustrate rather than only explain.
    Downloads: 52 This Week
    Last Update:
    See Project
  • 2

    TabuVis

    An Interactive Visualisation for Tabular Data

    TabuVis is a comprehensive visual analysis tool that provides a flexible, customizable and interactive visualization for tabular (or multidimensional) data. It utilizes scatter-plot visualization approach to provide comprehensive and interactive views for different attribute mappings. It provides single scatter-plot and multiple scatter-plots together with Map (GIS) capability. The project is available at: http://staff.scem.uws.edu.au/~vinh/projects/TabuVis/ Related publications: https://www.sciencedirect.com/science/article/pii/S2468502X20300358 https://dl.acm.org/doi/pdf/10.1145/2968220.2968248 https://link.springer.com/content/pdf/10.1007/s11432-013-4870-1.pdf https://dl.acm.org/doi/pdf/10.1145/2397696.2397705
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    Divide and Conquer Treemaps

    Divide and Conquer Treemap Visualisation

    The project develops new new techniques and algorithms for quickly partitioning and visualizing very large hierarchical structures within a variety of arbitrary shapes and space using Divide and Conquer approach for Treemaps. Related publications: https://www.sciencedirect.com/science/article/pii/S1045926X1500066X https://dl.acm.org/doi/pdf/10.1145/2493102.2493112
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    AlphaPlot

    AlphaPlot

    Interactive scientific graphing and data analysis software.

    Alpha Plot can generate different types of 2D and 3D plots (such as line, scatter, bar, pie, and surface plots) from data that is either imported from ASCII files, entered by hand, or calculated using formulas. The data is held in spreadsheets which are referred to as tables with column-based data (typically X and Y values for 2D plots) or matrices (for 3D plots). The spreadsheets as well as graphs and note windows are gathered in a project and can be organized using folders. The built-in...
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5

    ADaMSoft

    Open Source and data mining software

    ADaMSoft is a free and Open Source Data Mining software developed in Java. It contains data management methods and it can create ready to use reports. It can read data from several sources and it can write the results in different formats.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    VIKAMINE is a flexible environment for visual analytics, data mining and business intelligence - implemented in pure Java. It features several powerful visualization and mining methods, and can utilize background knowledge.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    QtiPlot
    QtiPlot is a user-friendly, platform independent data analysis and visualization application similar to the non-free Windows program Origin.
    Downloads: 51 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    MYRA

    MYRA

    A collection of ACO algorithms for the data mining classification task

    MYRA is a collection of Ant Colony Optimization (ACO) algorithms for the data mining classification task. It includes popular rule induction and decision tree induction algorithms. The algorithms are ready to be used from the command line or can be easily called from your own Java code. They are build using a modular architecture, so they can be easily extended to incorporate different procedures and/or use different parameter values.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    FormRead

    FormRead

    Free OMR - OCR web sofware based on javascript and PHP

    ...Once you have parameterized your form, you can print many of them, give it to your students/respondents, scan and recognize them with formread, and you can finally export the data in your preferred formats (excel, pdf, csv)
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    Lattice Miner is a data mining prototype for creating, visualizing and exploring concept (Galois) lattices. It allows the generation of formal concepts and association rules.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Kohonen neural network library is a set of classes and functions for design, train and use Kohonen network (self organizing map) which is one of AI algorithms and useful tool for data mining and discovery knowledge in data (http://knnl.sf.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    GUI Ant-Miner is a tool for extracting classification rules from data. It is an updated version of a data mining algorithm called Ant-Miner (Ant Colony-based Data Miner).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Mass-based dissimilarity

    Mass-based dissimilarity

    A data dependent dissimilarity measure based on mass estimation.

    This software calculates the mass-based dissimilarity matrix for data mining algorithms relying on a distance measure. References: Overcoming Key Weaknesses of Distance-based Neighbourhood Methods using a Data Dependent Dissimilarity Measure. KDD 2016 http://dx.doi.org/10.1145/2939672.2939779 The source code, presentation slide and poster are attached under "Files". The presentation video in KDD 2016 is published on https://youtu.be/eotD_-SuEoo .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Graph-RAT
    Graph-RAT is a database abstraction layer designed to make it easy to use a large library of graph-analysis routines on a database as well as add new kinds of algorithms to data mining.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19

    cextools

    Command line helpers for Conexp files.

    Some small command line programs and a file parser for Concept Explorer (conexp) written in C++. Currently features include: Converters from concept explorer into PDF, PostScript, SVG and PovRay, a modified 3D Freese layout.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The FAKE GAME tool uses natural evolution to evolve Data Mining models. It incorporates several preprocessing, optimization and visualization methods aimed to streamline the Knowledge Discovery process. Knowledge Extraction from data is being automated!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    GAMES ESMI

    GAMES ESMI

    The Energy Sensing and Monitoring Infrastructure of the GAMES project

    This is part of the GAMES Project, which consists of a set of innovative methodologies and Open Source ICT tools for designing and managing energy efficiency in IT Service Centres. The ESMI is the monitoring and sensing infrastructure which allows the fine-grained measurement and provides both an event-based and a real-time stream of data conveying information about the energy consumption in the key points of the overall system. It is completed by a set of Nagios plugin and an assessment tool which implements some green Performance Indicators and integrates data mining with the self adaptive controllers of the entire framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Ruleminer is an experimental tool for performing various data-mining tasks, especially focused on searching for association rules. The aim is to research the association rules mining process and to develop new approaches for solving the related problems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project aims to be a easy-to-use toolkit of algorithms and utilities for semantic data mining. So far all algorithms are implemented as web services and we provide widgets for their use in the Orange4WS data mining platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Knime (http://www.knime.org) nodes for sequence bioinformatics. Sequime is an eclipse plug-in for the KNIME data mining platform, providing additional nodes for reading, processing and visualizing sequence information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Ferda is a user friendly data mining tool. It is a modular distributed multiplatform framework based on Internet Communications Engine. Ferda is very powerful in working with association rules.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next