Showing 87 open source projects for "pdf data mining"

View related business solutions
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 1
    iText is a free open source Java-PDF library released on SF under the MPL/LGPL; iText comes with a simple GUI: the iText toolbox. The original developers of iText want to publish this toolbox as a separate project under the more permissive MIT license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SPIDR (Space Physics Interactive Data Resource) is a distributed database and application server network, built to select, visualize and model historical space weather data. SPIDR is a web-application and a grid of data mining web-services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    XTRACT4J V2 is a stand-alone, pure-Java program which creates XML file by dependent or independent SQL queries. It is designed as a drop-in replacement for Oracle Report to generate XML file. It also incorporates BI Publisher to create PDF reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Automatically embed Wikipedia topic information into PDF documents via pop up annotations. This relies on the Wikipedia Miner service that is also available on Sourceforge.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Shared Questionnaire System
    Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    This simple java application lets you overlap several different single pdf pages into one page. Across all the applications that provide the ability of merging or spliting pdf docs, none that I have tried, have the feature of overlaping pdf pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    The ProM Import Framework allows to extract process enactment event logs from a set of information systems. These can be exported in the MXML format, which is the standard event log data format for Process Mining analysis techniques.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    JODConverter automates conversions between office document formats using OpenOffice.org. Supported formats include OpenDocument, PDF, RTF, Word, Excel, PowerPoint, and Flash. It can be used as a Java library, a command line tool, or a Web application.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 9
    Template based Excel/PDF file generator(Java/Apache POI/iText). Does not require a report designer. Templates are created in Excel workbook. Allows to generate complex Excel documents. Simple to use and easy to learn.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    a9text is a set of text-doc specification, implementations and tools. it is easy and friendly to read, write and translate(to docbook/html/pdf).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Convert plain text format RFC docs into open format, such as HTML, PDF. Features: Index page link, Document reference link, Figure/Table reference link; customizable CSS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The aim of this project is to develop a Portable Document Format (PDF) importer for OpenOffice.org Writer based on XPDF. This project was inspired by the PDF importer within KWord.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    This is a Java port of the original FPDF free PDF generation library for PHP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    XML Documentation Tool provides a way to automate the production of documentation deliverables, in both PDF and HTML formats, from a set of content modules written in XML, using DocBook DTD. It enables re-use of content modules from previous projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Knowledge management tool which is primarily designed for pdf files. The tool will let the documents get tagged either automatically or manually. Within the pdf archieve, a user will be able to retrieve desired documents quickly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PDF Annot is a piece of software that enables you to add audio and text annotation to a PDF. It uses JPedal SimpleViewer and iText library. Annotations are supported by Adobe'sofficial PDF Reader. Report any bug here: krakosia[at]gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    PDF Merger. It merges multiple PDF files into one PDF file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    "distribution" is a message and data processing tool. It allows to process information through a graph of processors. It may be used to build mailing lists, fax gateways, email filters, PDF mailing combinators, report systems and many other processes
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    QExcel Converter SQL QT4, Import xml excel/openoffice 2003 format, sqlite3 sql text/binary, edit table and export to various Format, Pdf XSL format objects Apache fop JAVA(XSL-FO), XML/XSLT , excel, SQL text sqlite3 dump file and MYSQL SQL to XML/XSL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The Source2PDF tool is a small application which converts a directory full of source code into a single PDF file. It includes a table of contents, a title page, and is chaptered by subdirectories. This allows for easy, portable source code distribution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    BTL is a template language that combines power of JSTL and XSLT to produce documents in XML, HTML, XHTML, XSL-FO, PDF or other formats, based on the JavaBean input.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DocBook sml maintains multilingual documentations, generates fully automated artifacts(html,pdf,xml,txt), uses DocBook XSL,Saxon,Xalan,FOP,Lynx, is driven by Ant,Yax, supports Computer Aided Translation and runs standalone or in an IDE like Eclipse.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Visual xsltproc is a tool which help to write xslt file, and debug it to find errors. It writes xml, and generates xml (Syntax highlighting of XML & line Nr.). Finally if the result is XSL-FO it generates the pdf on Apache FOP java. Build on QT4.2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Includes tools for creating ebooks in xml-format. xTrans helps in creating an XML-Ebook from plain text like RTF, TXT. XTrans converts xml-ebooks into the final format like PDF, HTML, RTF, PDB (various forms), ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Nheengatu Project is a Java library that provides HTML markup abstraction allowing you to reutilize it to generate PDF files, OpenOffice documents, image files, etc. The goal of this project is to maximize the use of HTML markup procedures.
    Downloads: 0 This Week
    Last Update:
    See Project