Showing 167 open source projects for "pdf data mining"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 1
    Small and simple java library for working with Jasper Reports dynamically, enabling dynamic column creation and dynamic data sets using Apache DynaBeans. Project is developed by people at small software company called Softberries www.softberries.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Python module and command line utility that analyzes XML output from the program pdftohtml in order to extract tables from PDF files. Outputs CSV.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    The application allows the online pdf report generation and the break of a report through one or more dimensions: production or cost reports can output thousands of pages, while a user needs just his own portion of the data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    This project is intended to achive all the capbilities specified in PDF 1.6 Reference, and provide tools to create, read, manipulate PDF files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DataVision is a reporting tool similar to Crystal Reports. DV supports many data sources (JDBC, files) and many output formats (HTML, XML, PDF, LaTeX, Excel, delimited files, DocBook). DV includes a GUI editor. DV is embeddable. Reports are XML-based.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    PHP report engine. Main Features: Group data rows with totals; Cell styles & formats; Render to HTML & PDF; No external dependencies; Easy to extend with new column, renderer, cell formats...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This is a Java port of the original FPDF free PDF generation library for PHP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Genetic Programming (tree structure) predictor within Weka data mining software for both continuous and classification problems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    Openminer, as a data mining engine, is developed on java for analysis of dataset with the methods of data mining. By making use of openminer, we could discovery the knowledge which interests us but hides in the raw data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    openRiverbed - the PHP5 framework. Ajax, TinyMCE, Plugins, XML based configuration, template based, XML2PDF pdf generation, multi-language support for application and content, encrypted sessions, test-driven, oo developed... Hardened by real projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A prepared docbook project. Can create html, html_single, PDF documents immediately. Included all newest version needing files, you needn't down anything. The quick start hint and easy samples will bring you enjoy your writing. 支持中文 中文帮助
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    dbtroff converts DocBook XML to PostScript or PDF using XSLT, Heirloom troff, and Ghostscript. troff markup can be used to customize the appearance of the document. Currently, only a subset of DocBook version 4 markup is supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Agent Academy is a Java integrated development framework for creating Intelligent Agents and Multi Agent Systems. Agent Academy seamlessly integrates JADE and WEKA platforms and enables the creation of software agents that use Data Mining models.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Library Of Randomized Algorithms: Randomization is a powerful idea has applications in science and engineering. Difficult optimization problems, protein folding and data mining are only a few of the problems that have been solved using randomization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The Source2PDF tool is a small application which converts a directory full of source code into a single PDF file. It includes a table of contents, a title page, and is chaptered by subdirectories. This allows for easy, portable source code distribution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    BTL is a template language that combines power of JSTL and XSLT to produce documents in XML, HTML, XHTML, XSL-FO, PDF or other formats, based on the JavaBean input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    KNN-WEKA provides a implementation of the K-nearest neighbour algorithm for Weka. Weka is a collection of machine learning algorithms for data mining tasks. For more information on Weka, see http://www.cs.waikato.ac.nz/ml/weka/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    NOD-MP stands for not another data-mining project. It is educational and scientific software to utilize data mining clustering algorithms through a user-friendly interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DocBook sml maintains multilingual documentations, generates fully automated artifacts(html,pdf,xml,txt), uses DocBook XSL,Saxon,Xalan,FOP,Lynx, is driven by Ant,Yax, supports Computer Aided Translation and runs standalone or in an IDE like Eclipse.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DMTL (Data Mining Template Library) - A generic C++ based library for mining structured patterns such as sets, sequences, trees and graphs. The library provides implementation of popular frequent pattern mining algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    EfficiencyGuardian extracts callgrind efficiency measures from individual CppUnit test cases to detect efficiency regresion. It includes a data mining web tool to browse historic results and TestFarm integration for unattended execution on commit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    dompdf - the PHP 5 HTML to PDF converter. dompdf is a (mostly) CSS compliant HTML rendering engine written in PHP. It supports external stylesheets, inline style tags, and the style attributes of individual HTML elements. Requires PHP 5.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    musicomp is a program which most important element is an evolutionary algorithm which uses data mining methods as a fitness function to generate monophone melodies.
    Downloads: 0 This Week
    Last Update:
    See Project