Showing 11 open source projects for "pdf data mining"

View related business solutions
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 1
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    PDF Split and Merge

    PDF Split and Merge

    Split and merge PDF files on any platform

    Split and merge PDF files with PDFsam, an easy-to-use desktop tool with graphical, command line and web interface.
    Leader badge
    Downloads: 303 This Week
    Last Update:
    See Project
  • 3

    toPDF

    Online service for PDF conversion (to PDF)

    A simple online service for PDF conversion. This project is a simple library and also a web application. It offers a REST service and a simple upload service for synchronous conversion. This library/application doesn't contain conversion libraries because it's a wrapper for existing tools. toPDF currently supports the open source tool PDF Creator (http://www.pdfforge.org) and the commercial solution, easy PDF, from BCL (http://www.pdfonline.com/easypdf/sdk/).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    PdfInspector1

    PdfInspector1

    Application to inspect text and images of pdf books.

    With the application you can open and browse pdf books. In addition you will be able to inspect the codes for characters or locate the images. May be an example of basic use of pdfbox. JDK-17 compatibility You will find more about it at this web site: https://www.frojasg1.com:8443/downloads_web/web/html/pdfInspector.html?origin=sourceforge
    Downloads: 1 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 5
    Merge to PDF

    Merge to PDF

    Simple portable Java app to merge single images and documents to PDF

    Portable Java app to merge single PDF documents and JPG images to one PDF document. Standalone app include own Java. * Application is in the BETA version - please report bugs and issues on Tickets page. *
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MagicalPdfEditor

    MagicalPdfEditor

    This is a small PDF editor based on OpenPdf core and AndroidPdfViewer

    This is a small PDF editor based on OpenPDF Core and AndroidPdfViewer. As there is not many open-source easy working PDF editors and PDF wizards, I decided to create a simple directory to resolve my issues. Here I have worked on two separate cores, add some functionality to them, and combined them together to achieve my target. I am working on this repo, any help will be appreciated. Just clone the project and trance the source code, It's really easy and clear. All functionality in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    PDF Guru

    PDF Guru

    Merge images and PDFs to a single PDF

    PDF Guru is a simple in use program for merging multiple images and PDF files into a single compact PDF file. It is capable of selecting specific PDF pages or range of pages, which lets you have more control on the output file. Be able to produce compacted, smaller sized files in any operating system. Its features makes it a great, must have, tool for everyone.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 8
    PdfJumbler
    A simple tool to rearrange/merge/delete pages from PDF files. The modular backend system uses either JPedal or JPod to display PDFs and iText or Apache PDFBox to save them. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Development of this project has moved to GitHub. Please check https://github.com/mgropp/pdfjumbler for current releases! !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Downloads: 36 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10
    PDFManager

    PDFManager

    Application to modify pdf files.

    Java application to modify PDF files. It may be used to merge two or more PDF files, delete pages, change page order, etc. Windows installer and source code to download. ------------------------------------- Version 2.0: You may create PDF files from *txt files; You may create PDF files from pictures; You may choose the order of the pages when merging files; New Interface; Available for Linux; -------------------------------------
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 135 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next